2020
DOI: 10.35470/2226-4116-2020-9-3-144-151
|View full text |Cite
|
Sign up to set email alerts
|

Web Crawler: Design And Implementation For Extracting Article-Like Contents

Abstract: The World Wide Web is a large, wealthy, and accessible information system whose users are increasing rapidly nowadays. To retrieve information from the web as per users’ requests, search engines are built to access web pages. As search engine systems play a significant role in cybernetics, telecommunication, and physics, many efforts were made to enhance their capacity.However, most of the data contained on the web are unmanaged, making it impossible to access the entire network at once by current search engin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 16 publications
0
5
0
1
Order By: Relevance
“…Therefore, the intelligent rules of rejection are designed to prevent the crawler from falling into infinite crawling loops. This technique outperforms the web crawler presented in [43]. On comparing accuracy and recall, in testing phase crawler in [43] has accuracy 81.06% and precision 84.62%, while both performance measures have reached above 95% in technique.…”
Section: Comparative Advantagesmentioning
confidence: 84%
See 1 more Smart Citation
“…Therefore, the intelligent rules of rejection are designed to prevent the crawler from falling into infinite crawling loops. This technique outperforms the web crawler presented in [43]. On comparing accuracy and recall, in testing phase crawler in [43] has accuracy 81.06% and precision 84.62%, while both performance measures have reached above 95% in technique.…”
Section: Comparative Advantagesmentioning
confidence: 84%
“…This technique outperforms the web crawler presented in [43]. On comparing accuracy and recall, in testing phase crawler in [43] has accuracy 81.06% and precision 84.62%, while both performance measures have reached above 95% in technique. Figures 9 and 10 show the comparison of precision and recall delivered by the proposed crawler.…”
Section: Comparative Advantagesmentioning
confidence: 84%
“…Hasat Oranı Kesinlik/ Hassasiyet(%) Geri Çağırma Oranı Taranan Sayfa Sayısı [43] 0.389 36.00 0.611 5000 [44] 0.411 ------1000 [45] 0.810 ------5000 [46] 0.850 ------5000 [47] 0.500 ---0.600 6500 [48] ---92.01 0.590 495 [49] 0.890 ------1000 [50] 0.500 32.00 ---3200 [51] 0.830 ------1200 [26] ---92.00 0.300 2000 [25] 0.750 ---0.400 10000 [52] ------0.820 400 [53] 0.850 ------6000 [54] 0.700 ---0.470 13377 Tablo 2' de görüldüğü gibi odaklı web tarayıcılarında en çok kullanılan performans ölçütü hasat oranıdır. Hasat oranı taranan sayfa sayıları arttıkça genel olarak artmaktadır.…”
Section: Kaynakunclassified
“…[41,42] has been employed in this study to estimate the total CO 2 emissions and fuel consumption of vehicles from multiple inputs. CNN is a form of deep neural network that is often used to explore visual imagery [37,43]. The deep learning model has been built using Google Collab and results are presented in Figure 19 and Table 14.…”
Section: Linear Regression and Univariate Polynomial Regressionmentioning
confidence: 99%