How to train your agent: Active Learning from Human Preferences and Justifications in Safety-critical Environments

Ilias Kazantzidis, Yali Du, Christopher Freeman, Tim Norman

Research output: Contribution to conference typesAbstractpeer-review

Original languageUndefined/Unknown
Publication statusPublished - 2022

Cite this