2021
DOI: 10.1002/smj.3317
|View full text |Cite
|
Sign up to set email alerts
|

Using machine learning to revisit the diversification–performance relationship

Abstract: Research Summary In this article, we examine the relationship between corporate diversification and firm performance using a machine learning technique called natural language processing (NLP). By applying a widely used NLP technique called topic modeling to unstructured text from annual reports, we create a new, multidimensional measure that captures the degree of diversification of both multisegment and single‐segment firms. Additionally, we introduce a novel method to incorporate human judgments into the in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
21
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 45 publications
(22 citation statements)
references
References 90 publications
(142 reference statements)
1
21
0
Order By: Relevance
“… 5 The commonly used rule in previous topic modeling studies is to filter out words that appear in less than 0.5 %–1 % of documents ( Denny and Spirling, 2018 ). Our choice of 1 % was consistent with Choi et al (2021) and Hickman et al (2022) , who argued that, with longer documents (compared with social media text), using higher thresholds can cut computational cost and hardly impact the result of the analysis. …”
supporting
confidence: 61%
See 2 more Smart Citations
“… 5 The commonly used rule in previous topic modeling studies is to filter out words that appear in less than 0.5 %–1 % of documents ( Denny and Spirling, 2018 ). Our choice of 1 % was consistent with Choi et al (2021) and Hickman et al (2022) , who argued that, with longer documents (compared with social media text), using higher thresholds can cut computational cost and hardly impact the result of the analysis. …”
supporting
confidence: 61%
“…We did so by applying the Jieba tokenizer, a Chinese word segmentation tool ( Chen and Hsu, 2018 ; Day and Lee, 2016 ). We then removed highly infrequent words that appeared in less than 1 % of documents 5 to reduce computational cost ( Choi et al, 2021 ; Hickman et al, 2022 ) and stop words (e.g., the, but, a, and, if) that contain little topical content.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Second, unsupervised ML approaches, such as topic modeling, have been used to identify dominant patterns or groups within text data or image data without the guidance of any preexisting theoretical concepts (Choudhury et al, 2019; Kaplan & Vakili, 2015; Choi, Menon, Tabakovic, 2021; Teodoridis et al, 2020; for a recent review, see Hannigan et al, 2019).…”
Section: Overview Of Machine Learning Methods In Strategic Management...mentioning
confidence: 99%
“…Social science has adapted machine learning, NLP, and CATA successfully, and a recent contribution in Strategic Management Journal (Choi et al, 2021) suggests that the modern methods are better predictors with accuracy and efficiency. While topic modeling is also gaining ground, CATA has been established since 2009 (Uotila et al, 2009) in strategic management discourse.…”
Section: Discussionmentioning
confidence: 99%