2006 7th International Baltic Conference on Databases and Information Systems
DOI: 10.1109/dbis.2006.1678494
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent integration of information from semistructured web data sources on the basis of ontology and meta-models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(12 citation statements)
references
References 8 publications
0
12
0
Order By: Relevance
“…Sometimes one text (web pages in text form) has the same information found in other text or lesser for the same product, for example text 8 and text 10 have the same number of sub attributes. In this case, RIA deletes one of the texts for reducing the space of storage in the universal database (IS UDB) (Guntis Arnicans & Girts Karnitis 2006). IS UDB receives the rest of the texts from RIA and saved them in universal database.…”
Section: Relevant Information Analyzer (Ria)mentioning
confidence: 99%
See 3 more Smart Citations
“…Sometimes one text (web pages in text form) has the same information found in other text or lesser for the same product, for example text 8 and text 10 have the same number of sub attributes. In this case, RIA deletes one of the texts for reducing the space of storage in the universal database (IS UDB) (Guntis Arnicans & Girts Karnitis 2006). IS UDB receives the rest of the texts from RIA and saved them in universal database.…”
Section: Relevant Information Analyzer (Ria)mentioning
confidence: 99%
“…David Buttler et al (2001) observed in their tests of 50 web sites with over 2000 web pages that the tag <TABLE> is used as object separator (18% of time) more than the other tags such as tag <P> 10% of time, tag <li> 8% of time, tag <hr> 6% of time, tag <ul> 2% of time, tag <DIV> 2% of time, and tag <a> 2% of time. Therefore, the relevant information in a web page that the user needs which must be extracted by IE are found between the tag <TABLE> and </TABLE> (Guntis Arnicans and Girts Karnitis 2006;Fatima Ashraf et al 2008). Each table is formatted in rows and columns, whereas it is distinguished in head and body according to meaning.…”
Section: Concepts Of Information Extraction (Ie)mentioning
confidence: 99%
See 2 more Smart Citations
“…Many researchers such as [7,10,16,17] research on extraction of information from web pages in different domains (traveling, products, business intelligence) but these researches deal with limited web pages and the user still need to use the search engines such as Yahoo and Google to collect more information.…”
Section: Introductionmentioning
confidence: 99%