Improving choice model specification using reinforcement learning

๐Ÿ“… 2025-06-06
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

200K/year
๐Ÿค– AI Summary
Conventional discrete choice models rely on manual trial-and-error and subjective assumptions, resulting in low efficiency and poor reproducibility; existing metaheuristic approaches treat model specification as a static optimization problem, ignoring historical estimation information and thus failing to enable dynamic search adaptation or cross-task knowledge transfer. Method: We propose the first deep reinforcement learningโ€“based (DQN) automated model search framework, formalizing model specification as a sequential decision-making process. We design a reward function jointly optimizing goodness-of-fit and model parsimony, and employ a serialized structural encoding scheme. Contribution/Results: The method requires no domain-specific prior knowledge, supports dynamic exploration control and cross-scenario knowledge transfer, and consistently converges to high-quality models under diverse data-generating processes. It significantly improves search efficiency, robustness, and generalization capability compared to state-of-the-art alternatives.

Technology Category

Application Category

๐Ÿ“ Abstract
Discrete choice modelling is a theory-driven modelling framework for understanding and forecasting choice behaviour. To obtain behavioural insights, modellers test several competing model specifications in their attempts to discover the 'true' data generation process. This trial-and-error process requires expertise, is time-consuming, and relies on subjective theoretical assumptions. Although metaheuristics have been proposed to assist choice modellers, they treat model specification as a classic optimisation problem, relying on static strategies, applying predefined rules, and neglecting outcomes from previous estimated models. As a result, current metaheuristics struggle to prioritise promising search regions, adapt exploration dynamically, and transfer knowledge to other modelling tasks. To address these limitations, we introduce a deep reinforcement learning-based framework where an 'agent' specifies models by estimating them and receiving rewards based on goodness-of-fit and parsimony. Results demonstrate the agent dynamically adapts its strategies to identify promising specifications across data generation processes, showing robustness and potential transferability, without prior domain knowledge.
Problem

Research questions and friction points this paper is trying to address.

Enhancing choice model specification via reinforcement learning
Overcoming trial-and-error limitations in discrete choice modeling
Adaptive model discovery without prior domain knowledge
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses deep reinforcement learning framework
Agent adapts strategies dynamically
Rewards based on goodness-of-fit
๐Ÿ”Ž Similar Papers
No similar papers found.