2016
DOI: 10.21105/joss.00037
|View full text |Cite
|
Sign up to set email alerts
|

tidytext: Text Mining and Analysis Using Tidy Data Principles in R

Abstract: The tidytext package (Silge, Robinson, and Hester 2016) is an R package (R Core Team 2016) for text mining using tidy data principles. As described by Hadley Wickham (Wickham 2014), tidy data has a specific structure:

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
505
0
19

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 804 publications
(650 citation statements)
references
References 4 publications
0
505
0
19
Order By: Relevance
“…We used text analyses to quantify the most frequently used words or terms in presentation titles. We used the tidytext (Silge & Robinson ) package to extract text‐based data into a format ready to be analyzed and parsed. To quantify the top terms, we split the titles into separate words or bigrams and removed stop words (the most common short function words, such as the, is, at, which , and on ) based on a custom list (Supporting Information).…”
Section: Methodsmentioning
confidence: 99%
“…We used text analyses to quantify the most frequently used words or terms in presentation titles. We used the tidytext (Silge & Robinson ) package to extract text‐based data into a format ready to be analyzed and parsed. To quantify the top terms, we split the titles into separate words or bigrams and removed stop words (the most common short function words, such as the, is, at, which , and on ) based on a custom list (Supporting Information).…”
Section: Methodsmentioning
confidence: 99%
“…Developing these specialized tools using the core tidyverse principles means that the tools can more easily be adopted into workflows by a wider range of analysts. An example, well-suited to this sample analysis, is tidytext (Silge & Robinson 2016). Created by Julia Silge and David Robinson, tidytext provides a set of "tidy" tools for handling text data.…”
Section: Using Tidytext To Analyze Textmentioning
confidence: 99%
“…We also build an interactive network visualization (see figure 3) using sigma.js to make it interoperable with TextImager. tidytext The tidytext package (Silge and Robinson, 2016) provides functionality to create datasets following the tidy data principles (Wickham, 2014). We used it with our tm based corpus to calculate TF-IDF information of documents.…”
Section: R-packagesmentioning
confidence: 99%