Research output

  1. Directed Policy Gradient for Safe Reinforcement Learning with Human Advice

    Research output: Unpublished contribution to conferenceUnpublished paper

  2. Directed Policy Gradient for Safe Reinforcement Learning with Human Advice

    Research output: Unpublished contribution to conferencePoster

  3. Adapting to Concept Drift in Credit Card Transaction Data Streams Using Contextual Bandits and Decision Trees

    Research output: Chapter in Book/Report/Conference proceedingConference paper

View all (38) »

Activities

  1. Multi-agent Reinforcement Learning in Traffic Light Control

    Activity: OtherResearch and Teaching at External Organisation

  2. Evolutionary Algorithms for Satisfiability Solving in Fuzzy Logics

    Activity: OtherResearch and Teaching at External Organisation

  3. Function Landscape of Satisfiability Problems in Fuzzy Logics

    Activity: OtherResearch and Teaching at External Organisation

View all (3) »

ID: 168212