2021
DOI: 10.3390/fi14010004
|View full text |Cite
|
Sign up to set email alerts
|

Authorship Attribution of Social Media and Literary Russian-Language Texts Using Machine Learning Methods and Feature Selection

Abstract: Authorship attribution is one of the important fields of natural language processing (NLP). Its popularity is due to the relevance of implementing solutions for information security, as well as copyright protection, various linguistic studies, in particular, researches of social networks. The article is a continuation of the series of studies aimed at the identification of the Russian-language text’s author and reducing the required text volume. The focus of the study was aimed at the attribution of textual da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 39 publications
0
4
0
Order By: Relevance
“…Some scholars emphasize that the core of public opinion guidance lies in the competition for the right to speak, think that the main elements of the right to speak are the right to speak, the right to spread, and the right to guide, and put forward a new path of public opinion guidance. Some scholars pointed out that news and public opinion are an important carrier for a political party to master the right to speak [ 9 ]. The discourse power of news public opinion should obey certain political needs and guide the trend of news public opinion.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Some scholars emphasize that the core of public opinion guidance lies in the competition for the right to speak, think that the main elements of the right to speak are the right to speak, the right to spread, and the right to guide, and put forward a new path of public opinion guidance. Some scholars pointed out that news and public opinion are an important carrier for a political party to master the right to speak [ 9 ]. The discourse power of news public opinion should obey certain political needs and guide the trend of news public opinion.…”
Section: Literature Reviewmentioning
confidence: 99%
“…• saliency maps: elements in the input that have the largest influence in the prediction are identified (e.g., LIME); • feature attribution: attributing the classification to a small number of numeric/semantic features [47,48]; • metric learning [49]: mapping out data structures by deriving a metric from a classifier (explicit Siamese networks are very popular); • activation maximization: methods that are based on GAN.…”
Section: State Of the Art In Explainable Aimentioning
confidence: 99%
“…An important part of the literature consists of studies on English language [4,5,6,7,8]. There are also many studies done in many different languages including Japanese [9], Mongolian [10], Persian [11], Albanian [12], Indian [13,14], Brazilian [15], Russian [16,17], German [18], and Arabic [19]. When the existing studies were examined, it was seen that different types of data sets were used for author identification tasks.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Some studies have been carried out on newspaper articles [4,15,18,19], while others were carried out on poems [13], novels [11,12,16], email content [20], song lyrics [21], source codes [22], or tweets, blog posts, and forums [8,9,23]. In some cases, different types of data sources were combined or compared [17,25] Early studies in author identification focused on different stylometric techniques. These techniques are based on identification of style markers including lexical and character features or syntactic and semantic features that quantify writing style [9,26].…”
Section: Literature Reviewmentioning
confidence: 99%