Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior

RLC 2024

1UC Berkeley 2Brown University