Scholar

Oleg Arenz

Google Scholar ID: fSyGIDwAAAAJ

Postdoctoral Researcher, Technische Universitaet Darmstadt

Autonomous RobotsInverse Reinforcement LearningVariational InferenceReinforcement Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

356

H-index

i10-index

Publications

Co-authors

list available

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

9 items

Trust Region Inverse Reinforcement Learning: Explicit Dual Ascent using Local Policy Updates

2026

Cited

Behavior-Constrained Reinforcement Learning with Receding-Horizon Credit Assignment for High-Performance Control

2026

Cited

Boosting deep Reinforcement Learning using pretraining with Logical Options

2026

Cited

GaussTwin: Unified Simulation and Correction with Gaussian Splatting for Robotic Digital Twins

2026

Cited

Floating-Base Deep Lagrangian Networks

2025

Cited

Discrete Variational Autoencoding via Policy Search

2025

Cited

Maximum Total Correlation Reinforcement Learning

2025

Cited

Learning from Less: Guiding Deep Reinforcement Learning with Differentiable Symbolic Planning

2025

Cited

Resume (English only)

Academic Achievements

Developed GMMVI software framework: A high-performance and well-documented Gaussian mixture model optimization framework for variational inference using natural gradient descent. The framework is quite modular, supporting different techniques, such as estimating the natural gradients or selecting the samples for each update. It supports a total of 432 different combinations of design choices.

Research Experience

During his PhD, he investigated several different learning problems for robotics, including reinforcement learning, inverse reinforcement learning, and variational inference, showing that they can all be framed as an information projection, a particular type of distribution-matching problem. By treating the aforementioned learning problems as different instances of an information projection, they can be solved based on similar insights. For example, an upper bound on the I-Projection objective was derived and used in combination with an expectation-maximization procedure for variational inference, density estimation, as well as non-adversarial imitation learning.

Education

PhD (2015-2020), Advisor: Gerhard Neumann (currently full professor at Karlsruher Institute of Technology).

Background

Research Interests: Machine Learning, Robotics, Inverse Reinforcement Learning, Imitation Learning, Grasping and Manipulation, Reinforcement Learning, Variational Inference. Affiliated with TU Darmstadt, Intelligent Autonomous Systems, Computer Science Department.

Co-authors

3 total

Jan Peters

Professor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKI

Gerhard Neumann

Professor, Karlsruhe Institute of Technology (KIT)

Hany Abdulsamad

Postdoc, University of Amsterdam