Maximum Entropy Inverse Reinforcement Learning: Understanding the Trajectory Formula
Inverse reinforcement learning (IRL) asks a different question from classical RL: instead of assuming a reward function and learning a policy, you observe expert behavior and infer what reward would …