Pareto-NRPA: A Novel Monte-Carlo Search Algorithm for Multi-Objective Optimization

📅 2025-07-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses multi-objective optimization in discrete search spaces. We propose Pareto-NRPA, the first extension of the single-objective Nested Rollout Policy Adaptation (NRPA) algorithm to the multi-objective setting. Pareto-NRPA integrates nested Monte Carlo search with Pareto dominance analysis, dynamically maintaining a non-dominated front and incorporating diversity- and isolation-driven strategy updates to enable parallel exploration of multiple policies. Unlike conventional multi-objective evolutionary algorithms, Pareto-NRPA achieves significantly faster convergence and superior distribution quality of the solution set under constrained search budgets. Empirical evaluation on the multi-objective Traveling Salesman Problem with Time Windows (MO-TSPTW) and neural architecture search demonstrates state-of-the-art performance, particularly under limited computational budgets.

Technology Category

Application Category

📝 Abstract
We introduce Pareto-NRPA, a new Monte-Carlo algorithm designed for multi-objective optimization problems over discrete search spaces. Extending the Nested Rollout Policy Adaptation (NRPA) algorithm originally formulated for single-objective problems, Pareto-NRPA generalizes the nested search and policy update mechanism to multi-objective optimization. The algorithm uses a set of policies to concurrently explore different regions of the solution space and maintains non-dominated fronts at each level of search. Policy adaptation is performed with respect to the diversity and isolation of sequences within the Pareto front. We benchmark Pareto-NRPA on two classes of problems: a novel bi-objective variant of the Traveling Salesman Problem with Time Windows problem (MO-TSPTW), and a neural architecture search task on well-known benchmarks. Results demonstrate that Pareto-NRPA achieves competitive performance against state-of-the-art multi-objective algorithms, both in terms of convergence and diversity of solutions. Particularly, Pareto-NRPA strongly outperforms state-of-the-art evolutionary multi-objective algorithms on constrained search spaces. To our knowledge, this work constitutes the first adaptation of NRPA to the multi-objective setting.
Problem

Research questions and friction points this paper is trying to address.

Extends NRPA for multi-objective optimization problems
Explores diverse solution regions using policy sets
Benchmarks on MO-TSPTW and neural architecture search
Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends NRPA to multi-objective optimization
Uses multiple policies for diverse exploration
Adapts policies based on Pareto front diversity
🔎 Similar Papers
No similar papers found.
N
Noé Lallouet
LAMSADE, Paris Dauphine-PSL University and Thales DMS
Tristan Cazenave
Tristan Cazenave
Professor Université Paris Dauphine - PSL CNRS
Artificial IntelligenceMonte Carlo SearchCombinatorial ProblemsGame PlayingDeep Learning
C
Cyrille Enderli
Thales DMS