Feedback control can make data structure layout randomization more cost-effective under zero-day attacks

Chen, Ping; Hu, Zhisheng; Xu, Jun; Zhu, Minghui; Liu, Peng

doi:10.1186/s42400-018-0003-x

Cited by 4 publications

(1 citation statement)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, Bigelow et al [5] randomize the memory address layout of the programs in individual hosts to make vulnerabilities more difficult to be exploited. In addition, Chen et al [6] and Xin et al [49] randomize the layout of data structures to prevent attacks from correctly locating target data objects and further manipulating them. For computer networks, diversity is a widely used technique that equips computers with randomized implementations of software, operating systems, or hardware platforms to force attackers to target each computer individually, substantially raising the bar on network-level threats.…”

Section: Related Workmentioning

confidence: 99%

Adaptive Cyber Defense Against Multi-Stage Attacks Using Learning-Based POMDP

Hu¹,

Zhu

Liu

2020

ACM Trans. Priv. Secur.

Self Cite

View full text Add to dashboard Cite

Growing multi-stage attacks in computer networks impose significant security risks and necessitate the development of effective defense schemes that are able to autonomously respond to intrusions during vulnerability windows. However, the defender faces several real-world challenges, e.g., unknown likelihoods and unknown impacts of successful exploits. In this article, we leverage reinforcement learning to develop an innovative adaptive cyber defense to maximize the cost-effectiveness subject to the aforementioned challenges. In particular, we use Bayesian attack graphs to model the interactions between the attacker and networks. Then we formulate the defense problem of interest as a partially observable Markov decision process problem where the defender maintains belief states to estimate system states, leverages Thompson sampling to estimate transition probabilities, and utilizes reinforcement learning to choose optimal defense actions using measured utility values. The algorithm performance is verified via numerical simulations based on real-world attacks.

show abstract

Section: Related Workmentioning

confidence: 99%