1. 2018
  2. Off-Policy and Off-Actor Actor-Critic with Bootstrapped Dual Policy Iteration

    Steckelmacher, D., Plisnier, H., Roijers, D. & Nowe, A., 3 Oct 2018. 8 p.

    Research output: Unpublished contribution to conferencePoster

  3. Multi-objective Reinforcement Learning for the Expected Utility of the Return

    Roijers, D., Steckelmacher, D. & Nowe, A., 14 Jul 2018. 6 p.

    Research output: Unpublished contribution to conferenceUnpublished paper

  4. Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making

    Zintgraf, L., Roijers, D., Linders, S., Jonker, C. M. & Nowe, A., Jul 2018, AAMAS 2018: Proceedings of the Seventeenth International Joint Conference on Autonomous Agents and Multi-Agent Systems. 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  5. Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies

    Libin, P., Verstraeten, T., Roijers, D., Grujic, J., Theys, K., Lemey, P. & Nowe, A., 15 May 2018, Machine Learning and Knowledge Discovery in Databases, European Conference, ECML PKDD 2018, Dublin, Ireland, September 10–14, 2018, Proceedings, Part III. Springer, p. 456-471

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  6. Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets

    Steckelmacher, D., Roijers, D., Harutyunyan, A., Vrancx, P., Plisnier, H. & Nowe, A., 4 Feb 2018, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence. AAAI Press, p. 4099-4106 8 p. 4099. (AAAI Conference on Artificial Intelligence).

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  7. Bootstrapping LPs in Value Iteration for Multi-Objective and Partially Observable MDPs

    Roijers, D., Walraven, E. & Spaan, M. T. J., 2018, (Accepted/In press) ICAPS 2018: Proceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling.

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  8. Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems

    Bargiacchi, E., Verstraeten, T., Roijers, D., Nowe, A. & van Hasselt, H., 2018, 35th International Conference on Machine Learning, ICML 2018. Dy, J. & Krause, A. (eds.). Vol. 2. p. 810-818 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference paper

  9. 2017
  10. Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies

    Libin, P., Verstraeten, T., Roijers, D. M., Grujic, J., Theys, K., Lemey, P. & Nowé, A., 14 Dec 2017.

    Research output: Unpublished contribution to conferencePoster

ID: 26429967