2021
DOI: 10.23919/jsc.2021.0011
|View full text |Cite
|
Sign up to set email alerts
|

From Symbols to Embeddings: A Tale of Two Representations in Computational Social Science

Abstract: Computational Social Science (CSS), aiming at utilizing computational methods to address social science problems, is a recent emerging and fast-developing field. The study of CSS is data-driven and significantly benefits from the availability of online user-generated contents and social networks, which contain rich text and network data for investigation. However, these large-scale and multi-modal data also present researchers with a great challenge: how to represent data effectively to mine the meanings we wa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 431 publications
(294 reference statements)
0
6
0
Order By: Relevance
“…This section carefully examines the experiment's findings. The proposed method splits the document length into eight (8) regions to analyze the keyphrase centroid and frequency (KCF). When a document's number of regions is increased by more than eight, the first region has a lower keyphrase frequency value as well as a lower centroid value than the eight-region.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…This section carefully examines the experiment's findings. The proposed method splits the document length into eight (8) regions to analyze the keyphrase centroid and frequency (KCF). When a document's number of regions is increased by more than eight, the first region has a lower keyphrase frequency value as well as a lower centroid value than the eight-region.…”
Section: Resultsmentioning
confidence: 99%
“…The split() method is then applied to the key files to calculate the number of keyphrases based on the newline (\n) (also shown in Figure 2). The proposed technique then considers the length of the document is broken into eight (8) regions and employs the first appearance keyphrase to analyze the centroid and frequency of keyphrases.…”
Section: Documents and Keys Pre-processingmentioning
confidence: 99%
See 1 more Smart Citation
“…Topic modeling is one of the most popular text‐mining method to analyze publications and form themes from a large amount of textual data. STM is a widely used topic modeling strategy in computational social science (Bai et al, 2021; Chen et al, 2021; Kang et al, 2022). Similar to other common topic modeling methods such as latent Dirichlet allocation (LDA), STM assumes that each document is a mixture of multiple latent topics, with each topic having a different probability to appear in the document.…”
Section: Methodsmentioning
confidence: 99%
“…is method can effectively classify sea ice regions, but they only analyse sea ice data provided by the Canadian Ice Service, which is only applicable to sea ice image processing in specific situations and has a smaller application [10]. Chen et al proposed a kernel singular vector decomposition (KSVD) algorithm, which is based on a learning decomposition method to generate adaptive bases and extends the KSVD algorithm in nonlinear feature space [11]. An algorithm is an iterative approach that alternates between the sparse encoding of the kernel feature space based on a nonlinear dictionary and an update process for each atom in the dictionary, ending with a linear support vector machine classifier to obtain the classification accuracy [12].…”
Section: Related Workmentioning
confidence: 99%