Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries 2007
DOI: 10.1145/1255175.1255188
|View full text |Cite
|
Sign up to set email alerts
|

Integrating data and text mining processes for digital library applications

Abstract: This paper explores the integration of text mining and data mining techniques, digital library systems, and computational and data grid technologies with the objective of developing an online classification service exemplar. We discuss the current research issues relating to the use of data mining algorithms and toolkits for textual data; the necessary changes within the Cheshire3 Information Framework to accommodate analysis workflows; the outcomes of a demonstrator based on the National Library of Medicine's… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2008
2008
2015
2015

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 9 publications
0
11
0
Order By: Relevance
“…Mining and analysis related to these massive data has been the focus of many scientific researchers. [15] The literature from a variety of database, the analysis of dynamic interdisciplinary science paper changed in each historical period (research trends) over time.…”
Section: Research Status On Analysis Of the Research Trend Of The Scimentioning
confidence: 99%
“…Mining and analysis related to these massive data has been the focus of many scientific researchers. [15] The literature from a variety of database, the analysis of dynamic interdisciplinary science paper changed in each historical period (research trends) over time.…”
Section: Research Status On Analysis Of the Research Trend Of The Scimentioning
confidence: 99%
“…The computational expense to execute data or text mining based analysis as advanced services in real-world applications such as digital libraries has been identified as the major cause for the lack of more widespread use of such services [43]. For the easy deployment of text mining applications, a higher level of abstraction is needed than the ones commonly used in HPC.…”
Section: Distributed Computing In a Heterogeneous Environmentmentioning
confidence: 99%
“…This has lead to the development of GPU-based MapReduce models, which makes it easier to adapt algorithms to run on a distributed GPU cluster. A crucial advantage of MapReduce is that it also helps with the integration of other tasks, which in turn aids the development of advanced services, for instance, in digital libraries [43].…”
Section: Introductionmentioning
confidence: 99%
“…From human being perspective, such research assumes a particular interest when the involved data are natural language documents and the relationships are defined between entities described in text, e.g. [23,24,34,47].…”
Section: Introductionmentioning
confidence: 99%