2021
DOI: 10.1007/978-981-15-9651-3_13
|View full text |Cite
|
Sign up to set email alerts
|

Web Scraping: From Tools to Related Legislation and Implementation Using Python

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(7 citation statements)
references
References 28 publications
0
5
0
2
Order By: Relevance
“…[14] Retrieval of structured data through online table extraction. In such cases, a complex approach is applied that includes the detection of certain patterns across rows and columns and determining the search content [15]Dynamic rendering usually demands more complicated technologies, which are particularly relevant for a page with JavaScript involved. To function, the headless browsers must allow for emulating user inputs and showing changing data.…”
Section: Methods For Extracting Datamentioning
confidence: 99%
“…[14] Retrieval of structured data through online table extraction. In such cases, a complex approach is applied that includes the detection of certain patterns across rows and columns and determining the search content [15]Dynamic rendering usually demands more complicated technologies, which are particularly relevant for a page with JavaScript involved. To function, the headless browsers must allow for emulating user inputs and showing changing data.…”
Section: Methods For Extracting Datamentioning
confidence: 99%
“…Uma das técnicas de web scraping consiste em navegar pelos elementos da página HTML no formato de árvore [15], atividade facilita por ferramentas disponíveis para a linguagem de programação Python. Dentre elas, destacam-se as bibliotecas: Beautiful-Soup 8 , que possibilita a interpretação de elementos HTML na forma de árvore; requests 9 , utilizada para realizar requisições HTTP; e re 10 , que fornece operações com expressões regulares (regex).…”
Section: Percurso Metodológicounclassified
“…Several works in the literature have addressed the problem of data extraction from web pages either by accessing the databases through webpages or by APIs [9][10][11][12][13]. Furthermore, big data extraction has spread to many scientific fields, such as medicine, where the volume of medical data is exponentially increasing.…”
Section: Related Workmentioning
confidence: 99%
“…Many tools have been constructed to extract data for machine learning purposes. These include command-line-based methods, such as application programming interface (API) [9,10] and web scraping methods, that extract information from websites [11][12][13]. Alternatively, some researchers or companies hire people to extract data manually, which costs them time and money; therefore, a method that extracts candidate information automatically is desperately needed.…”
Section: Introductionmentioning
confidence: 99%