Data collection is the initial stage of research. There are various data sources on the internet that can be used in the research process. The process of taking data or information from sites on the internet is called web scraping. Some methods of web scraping include Regular Expression (Regex), HTML DOM and XPath. This study aims to determine the performance of the three methods of web scraping. The Comparison is done by testing each method when retrieving data from the target website, then measuring the performance of the process and comparing it. Process time, memory usage, and data consumption are used as measurement parameters in the experiment. The results of the experiment show that web scraping with the regex method is the smallest in memory usage compared to the HTML DOM method, and Xpath. While HTML DOM requires the least amount of time and the smallest data consumption compared to Regular Expression and XPath methods.
Permasalahan donor darah merupakan masalah disetiap negara, termasuk di Indonesia. Walaupun sudah ada sistem di Palang Merah Indonesia (PMI) namun belum bisa mengatasi permasalahan pencarian maupun distribusi donor darah. Sesuai trend sekarang di jaman gadget yaitu maraknya penggunaan Android, maka untuk mengatasi masalah ini diperlukan aplikasi berbasis Android. Sementara untuk integrasi dengan sistem yang sudah ada diperlukan web service sebagai backend system sehingga layanan donor darah dapat diakses oleh berbagai platform. Arsitektur yang digunakan pada web service menggunakan REST, namun masih ada beberapa masalah pada REST yaitu mengenai keamanan pada proses otentikasi. Pada arsitektur REST diperlukan metode otentikasi yang tidak bernegara (stateless), salah satunya dapat menggunakan JSON Web Token. Hasil penelitian ini menunjukan bahwa penggunaan JSON Web Token Authentication pada Web Service and Backend System Blood Donors dapat membentuk sistem yang sangat skalabel, aman, mampu berinteraksi multi-platform serta dapat diandalkan.
Abstrak. Google Scholar adalah layanan berbasis web untuk mencari literatur akademik. Berbagai jenis referensi yang dapat diakses contohnya adalah: makalah peer-review, tesis, buku, abstrak dan artikel dari penerbit akademik, komunitas profesional, pusat data pra-cetak, universitas dan organisasi akademik lainnya. Google Scholar menyediakan fitur pembuatan profil untuk setiap peneliti, pakar, dan dosen. Jumlah publikasi dari lembaga akademis bersama dengan data yang detail tentang publikasi artikel ilmiah dapat diakses melalui Google Scholar. Rekapitulasi publikasi artikel ilmiah dari masing-masing peneliti di suatu lembaga diperlukan untuk menentukan kinerja penelitian secara kolektif. Namun, hal ini masih menyisakan masalah, yaitu belum tersedianya layanan rekapitulasi publikasi artikel ilmiah untuk setiap peneliti di suatu lembaga. Oleh karena itu, penelitian ini berupaya melakukan rekapitulasi publikasi artikel ilmiah. Pengumpulan data dari Google Scholar dilakukan dengan menerapkan teknologi web scrapping. Eksperimen web scrapping dari Google Scholar dalam penelitian ini telah berhasil mengambil 238 data peneliti dan 2.523 file artikel. Data yang telah diunduh disimpan ke dalam basis data, kemudian digunakan untuk rekapitulasi publikasi artikel ilmiah, yang dapat menampilkan: daftar profil peneliti, daftar afiliasi, daftar kutipan, dan daftar judul artikel yang dapat dicetak dalam bentuk *.pdf atau *.xlsx dan dilengkapi dengan pencarian data dan fitur penyortiran.Abstract. Google Scholar is a web-based service for searching broad academic literature. Various types of references can be accessed such as: peer-reviewed papers, theses, books, abstracts and articles from academic publishers, professional communities, pre-printed data center, universities and other academic organizations. Google Scholar provides the profile creation feature of every researcher, expert, and lecturer. The quantity of publication from an academic institution along with detailed data on the publication of scientific articles can be accessed through Google Scholar. Recapitulation of the publication of scientific articles of each researcher in an institution is needed to determine the research performance collectively. However, it still leaves a problem, that is the unavailability of recapitulation services publication of scientific articles for each researcher in an institution. Therefore, this study attempts to make the recapitulation of scientific article publications. Data collection from Google Scholar was carried out by applying web scraping technology. The scraping experiment from Google Scholar in this study has succeeded in retrieving 238 researchers' data and 2,523 article files. The data that had been downloaded was stored in a database, then used to recapitulate the publication of scientific articles, which can display: a list of researcher profiles, a list of affiliations, a list of
The system survive in this digital era is a system that can work on multiple platforms, the use of web services is one of the solution. Exchange data using JSON format and for the security of authentication using JSON Web Token (JWT). The importance of token-based authentication using JWT on web services can solve interoperability problems. JWT is stateless and can include information in the token authorization. JWT has several options for using algorithms namely HMAC, RSA, and ECDSA. Unfortunately, it is unknown which algorithm has better performance. Here, we directly tested the signing algorithm in the three algorithms seen from several parameters. The experimental results showed the use of HMAC algorithm produces an average value of token-generating time is 21.3 s, token size 109 bytes and data transfer token speed 91.2 s. It was considered that the HMAC had an excellent performance.
The fingerprint-based attendance management process generally consists of recording, management, and reporting. The method of recording attendance data is done with the help of fingerprint attendance machine. Even though the computer connection is running online, the administrator still needs to process and generate reports because there is no real-time interconnection between devices. Web services such as Simple Object Access Protocol (SOAP) and Representational State Transfer (REST) are among the technologies that can support real-time processing, which has been used in previous studies. SOAP is more secure than REST. Solutions to overcome these problems, this study used SOAP-based web service on the fingerprint attendance management system, to support real-time data access. The results of the research have made web services that can overcome the problem of interoperability between fingerprint machines and systems, so that data can be accessed in real-time and safer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.