2004
DOI: 10.1073/pnas.0400341101
|View full text |Cite
|
Sign up to set email alerts
|

From paragraph to graph: Latent semantic analysis for information visualization

Abstract: Most techniques for relating textual information rely on intellectually created links such as author-chosen keywords and titles, authority indexing terms, or bibliographic citations. Similarity of the semantic content of whole documents, rather than just titles, abstracts, or overlap of keywords, offers an attractive alternative. Latent semantic analysis provides an effective dimension reduction method for the purpose that reflects synonymy and the sense of arbitrary word combinations. However, latent semantic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
84
0
3

Year Published

2006
2006
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 107 publications
(87 citation statements)
references
References 9 publications
0
84
0
3
Order By: Relevance
“…Internet term-document data are a common application of SVD-based techniques, often via latent semantic analysis (24,25). The Open Directory Project (ODP) (26) is a multilingual open content directory of World Wide Web links.…”
Section: Diagnostic Data Analysis Applicationsmentioning
confidence: 99%
“…Internet term-document data are a common application of SVD-based techniques, often via latent semantic analysis (24,25). The Open Directory Project (ODP) (26) is a multilingual open content directory of World Wide Web links.…”
Section: Diagnostic Data Analysis Applicationsmentioning
confidence: 99%
“…The frequency of latent dimensions identified by LSI follows the Zipf-distribution [4]. Dimensions with smaller contributions can be safely omitted.…”
Section: Visualizing An Lsi Latent Semantic Spacementioning
confidence: 99%
“…Landauer [4] describes a linear SVD technique and applies it to a collection of a half billion documents containing 750 000 unique word types. LSA presumes that the overall semantic content of a document can be approximated as a sum of the meaning of its words.…”
Section: Related Workmentioning
confidence: 99%
“…Many quantitative literature overview studies use LSA for document clustering purposes (Landauer et al 2004, Ord et al 2005, while some studies (Sidorova et al 2008) use factor analysis and others (Larsen et al 2008) use both clustering and factor analysis. In this section we discuss differences and similarities between clustering and factor analysis extensions to LSA.…”
Section: Clustering Versus Factor Analysismentioning
confidence: 99%
“…(1) quantitative literature reviews (as done in Landauer et al 2004, Ord et al 2005, Larsen et al 2008, Sidorova et al 2008, or Hovorka et al 2009 (Salton 1975), where a corpus of d documents using a vocabulary of t terms is used to compile a t×d matrix A, containing the number of times each term appears in each document (term frequencies). Some trivial terms such as "the", "of", etc.…”
Section: Lsa Applications Relevant To Is Researchmentioning
confidence: 99%