Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing 2010
DOI: 10.1145/1851476.1851526
|View full text |Cite
|
Sign up to set email alerts
|

ParaText

Abstract: Automated analysis of unstructured text documents (e.g., web pages, newswire articles, research publications, business reports) is a key capability for solving important problems in areas including decision making, risk assessment, social network analysis, intelligence analysis, scholarly research and others. However, as data sizes continue to grow in these areas, scalable processing, modeling, and semantic analysis of text collections becomes essential. In this paper, we present the ParaText text analysis eng… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…Many works apply the Term-Frequency and Inverse Document Frequency (TF-IDF) method [4,7,10,23,24]. It is based on the calculation of the similarity of two documents on the basis of frequency of matching terms occurring in the two documents.…”
Section: B Types Of Text Miningmentioning
confidence: 99%
“…Many works apply the Term-Frequency and Inverse Document Frequency (TF-IDF) method [4,7,10,23,24]. It is based on the calculation of the similarity of two documents on the basis of frequency of matching terms occurring in the two documents.…”
Section: B Types Of Text Miningmentioning
confidence: 99%