1. 2018
  2. Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets

    Steckelmacher, D., Roijers, D., Harutyunyan, A., Vrancx, P., Plisnier, H. & Nowe, A., 4 Feb 2018, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence. AAAI Press, p. 4099-4106 8 p. 4099. (AAAI Conference on Artificial Intelligence).

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  3. Learning with options that terminate off-policy

    Harutyunyan, A., Vrancx, P., Bacon, P. L., Precup, D. & Nowé, A., 1 Jan 2018, 32nd AAAI Conference on Artificial Intelligence, AAAI 2018. AAAI Press, p. 3173-3182 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  4. 2017
  5. Multi-objectivization and ensembles of shapings in reinforcement learning

    Brys, T., Harutyunyan, A., Vrancx, P., Nowe, A. & Taylor, M., 8 Nov 2017, In : Neurocomputing. 263, p. 48-59

    Research output: Contribution to journalArticle

  6. Efficient evaluation of influenza mitigation strategies using preventive bandits

    Libin, P., Verstraeten, T., Theys, K., Roijers, D., Vrancx, P. & Nowe, A., 9 May 2017, p. 67-85. 19 p.

    Research output: Unpublished contribution to conferenceOther

  7. Analysing Congestion Problems in Multi-agent Reinforcement Learning

    Radulescu, R., Vrancx, P. & Nowe, A., 8 May 2017, Proceedings of the Adaptive and Learning Agents Workshop 2017 (ALA-17) at AAMAS. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  8. Analysing Congestion Problems in Multi-agent Reinforcement Learning

    Radulescu, R., Vrancx, P. & Nowe, A., 8 May 2017, 16th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2017. Durfee, E., Winikoff, M., Larson, K. & Das, S. (eds.). Vol. 3. p. 1705-1707 3 p.

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  9. Extending the Options Model for Reinforcement Learning in POMDPs

    Steckelmacher, D., Vrancx, P. & Nowe, A., 26 Jan 2017. 2 p.

    Research output: Unpublished contribution to conferenceUnpublished abstract

  10. Efficient evaluation of influenza mitigation strategies using preventive bandits

    Libin, P., Verstraeten, T., Theys, K., Roijers, D., Vrancx, P. & Nowe, A., 2017, AAMAS 2017: Autonomous Agents and Multiagent Systems . Springer, p. 67-85 ( Lecture Notes in Computer Science; vol. 10643).

    Research output: Chapter in Book/Report/Conference proceedingConference paper

Previous 1 2 3 4 5 6 7 8 Next

ID: 72031