π€ AI Summary
Current AI systems face fundamental limitations in solving real-world physics problems requiring rigorous physical reasoning and domain-specific knowledge.
Method: We propose a novel agent framework integrating physics-informed modeling, multi-step symbolic reasoning, and dynamic tool invocation. Our approach couples reinforcement learning with a symbolic engine within a prior-guided collaborative reasoning system, enabling stepwise problem modeling, analytical equation derivation, and numerical validation for complex theoretical physics problems.
Results & Contributions: The agent achieves 23.5/30 on the IPhO 2025 theoretical examβranking 14th out of 406 globally (top 3.5%) and surpassing over half of human gold medalists. Key contributions include: (1) establishing the first IPhO-level physics reasoning benchmark; (2) empirically demonstrating substantial performance gains from principled, domain-aware tool integration in scientific reasoning; and (3) introducing an interpretable, scalable agent paradigm for AI-augmented physical discovery.
π Abstract
Physics provides fundamental laws that describe and predict the natural world. AI systems aspiring toward more general, real-world intelligence must therefore demonstrate strong physics problem-solving abilities: to formulate and apply physical laws for explaining and predicting physical processes. The International Physics Olympiad (IPhO)--the world's most prestigious physics competition--offers a rigorous benchmark for this purpose. We introduce Physics Supernova, an AI agent system with superior physics problem-solving abilities that match elite IPhO gold medalists. In IPhO 2025 theory problems, Physics Supernova attains 23.5/30 points, ranking 14th of 406 contestants and surpassing the median performance of human gold medalists. We extensively analyzed Physics Supernova's capabilities and flexibility across diverse physics tasks. These results show that principled tool integration within agent systems can deliver competitive improvements in solving challenging science problems. The codes are available at https://github.com/CharlesQ9/Physics-Supernova.