2019
DOI: 10.2197/ipsjjip.27.802
|View full text |Cite
|
Sign up to set email alerts
|

Understanding Attack Trends from Security Blog Posts Using Guided-topic Model

Abstract: Organizations are plagued by sophisticated and diversified cyber attacks. In order to prevent such attacks, it is necessary to understand threat trends and to take measures to protect their assets. Security vendors publish reports which contain threat trends or analysis of malware. These reports are useful for help in responding to a cyber security incident. However, it is difficult to collect threat information from multiple sources such as security blog posts. In this paper, we propose a method to efficientl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 8 publications
1
3
0
Order By: Relevance
“… b The top 20 topic words associated with the highest coefficients are listed. Consistent with previous studies (e.g., Nagai et al, 2019; Ramesh et al, 2014; Toubia et al, 2019; Watanabe & Zhou, 2020), many words derived from guided LDA were the same as seed words, but relevant topic words were also identified. The percentage of each topic under product attributes was rounded, and the sum is based on the numbers before rounding up. c The “Other” category refers to unseeded topics in our model, to account for tweets that did not fall into any classified topic.…”
Section: Methodssupporting
confidence: 84%
See 3 more Smart Citations
“… b The top 20 topic words associated with the highest coefficients are listed. Consistent with previous studies (e.g., Nagai et al, 2019; Ramesh et al, 2014; Toubia et al, 2019; Watanabe & Zhou, 2020), many words derived from guided LDA were the same as seed words, but relevant topic words were also identified. The percentage of each topic under product attributes was rounded, and the sum is based on the numbers before rounding up. c The “Other” category refers to unseeded topics in our model, to account for tweets that did not fall into any classified topic.…”
Section: Methodssupporting
confidence: 84%
“…For example, by changing seed confidence, researchers can tune their guided LDA models to classify tweets based not only on the selected seed words, but also on words’ co-occurrence patterns. Following Nagai et al’s (2019) recommendations, we set our seed confidence at .7, but future studies could usefully explore the potential impact of other seed-confidence levels on model performance.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations