Adversarial Reinforcement Learning in a Cyber Security Simulation}

Conference object English OPEN
Elderman, Richard; Pater, Leon J.J.; Thie, Albert S.; Drugan, Madalina M.; Wiering, Marco A.;
  • Publisher: SCITEPRESS-Science and Technology Publications, Lda.
  • Subject: Adversarial setting | Cyber security in networks | Security | Software | Markov games | Control and Systems Engineering | Artificial Intelligence | Reinforcement learning | reinforcement learning (RL) | Simulations

This paper focuses on cyber-security simulations in networks modeled as a Markov game with incomplete information and stochastic elements. The resulting game is an adversarial sequential decision making problem played with two agents, the attacker and defender. The two ... View more
  • References (14)
    14 references, page 1 of 2

    Auer, P., Cesa-Bianchi, N., and Fischer, P. (2002). Finitetime analysis of the multiarmed bandit problem. Machine Learning, 47:235-256.

    Chung, K., Kamhoua, C., Kwiat, K., Kalbarczyk, Z., and Iyer, K. (2016). Game theory with learning for cyber security monitoring. IEEE HASE, pages 1-8.

    Garivier, A. and Moulines, E. (2008). On upper-confidence bound policies for non-stationary bandit problems. ALT.

    Lin, L.-J. (1993). Reinforcement Learning for Robots Using Neural Networks. PhD thesis, Carnegie Mellon University.

    Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. In ICML, pages 157-163.

    Neumann, J. V. and Morgenstern, O. (2007). Theory of games and economic behavior. Princeton University Press.

    Sharma, A., Kalbarczyk, Z., Barlow, J., and Iyer, R. (2011). Analysis of security data from a large computing organization. In 2011 IEEE/IFIP DSN, pages 506-517.

    Sutton, R. S. and Barto, A. G. (1998). Reinforcement Learning: An Introduction. The MIT press, Cambridge MA.

    Szepesva´ri, C. (1997). The asymptotic convergence-rate of q-learning. In NIPS, pages 1064-1070.

    Tambe, M. (2011). Security and Game Theory: Algorithms, Deployed Systems, Lessons Learned. Cambridge University Press, New York, NY, USA, 1st edition.

  • Metrics
Share - Bookmark