Problem
Research questions and friction points this paper is trying to address.
Develops AI for competitive Pokémon using offline reinforcement learning
Reconstructs agent perspective from spectator logs for training data
Trains models to adapt without explicit search, outperforming humans
Innovation
Methods, ideas, or system contributions that make the work stand out.
Reconstruct first-person perspective from spectator logs
Train large sequence models without explicit search
Combine imitation learning and offline RL