Problem
Research questions and friction points this paper is trying to address.
Improving offline reinforcement learning data efficiency
Exploiting parametric dependencies in transition dynamics
Pruning redundant actions via game and SMT techniques
Innovation
Methods, ideas, or system contributions that make the work stand out.
Parametric SPI algorithm leveraging distribution correlations
Game-based abstraction pruning redundant actions
SMT solving for advanced action pruning