Abstract:Systems for helping users to select data sources in an environment, such as the Internet, must be expressive enough to allow a variety of data sources to be formally represented. We build upon and extend the concept language, description logic (DL), to propose a novel representation system to achieve that goal. We point out that there are technical barriers within description logic limiting the types of data sources that can be represented. Specifically, we show that (1) DL is awkward in representing sufficien… Show more
“…In a highly diverse environment with hundreds and thousands of data sources, differences of content scopes can be valuably used to facilitate effective and efficient data source selection. Integrity constraints in COINL and the consistency checking component of the abductive procedure provide the basic ingredients to characterize the scope of information available from each source, to efficiently rule out irrelevant data sources and thereby speed up the selection process [TM98]. For example, a query requesting information about companies with assets lower than $2 million can avoid accessing a particular source based on knowledge of integrity constraints stating that the source only reports information about companies listed in the New York Stock Exchange (NYSE), and that companies must have assets larger than $10 million to be listed in the NYSE.…”
“…In a highly diverse environment with hundreds and thousands of data sources, differences of content scopes can be valuably used to facilitate effective and efficient data source selection. Integrity constraints in COINL and the consistency checking component of the abductive procedure provide the basic ingredients to characterize the scope of information available from each source, to efficiently rule out irrelevant data sources and thereby speed up the selection process [TM98]. For example, a query requesting information about companies with assets lower than $2 million can avoid accessing a particular source based on knowledge of integrity constraints stating that the source only reports information about companies listed in the New York Stock Exchange (NYSE), and that companies must have assets larger than $10 million to be listed in the NYSE.…”
“…In cases of violent conflict, casualty reports vary significantly largely because of differences in definitions of the variable (ie who is being counted). {See [TM98] for more details on proposed solution approach.} 7.…”
Section: Research Tasks and Expected Contributions In Integrating Sysmentioning
“…A natural extension is to leverage context knowledge to achieve contextbased automatic source selection. One particular kind of context knowledge useful to enable automatic source selection is the content scope of data sources [TM98]. Data sources differ either significantly or subtly in their coverage scopes.…”
Section: Extended Domain Of Knowledge -Equational and Temporalmentioning
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.