Publications

Highlights

Learning Lagrangian Multipliers for the Travelling Salesman Problem

Lagrangian relaxation is a versatile mathematical technique employed to relax constraints in an optimization problem, enabling the generation of dual bounds to prove the optimality of feasible solutions and the design of efficient propagators in constraint programming (such as the weighted circuit constraint). However, the conventional process of deriving Lagrangian multipliers (e.g., using subgradient methods) is often computationally intensive, limiting its practicality for large-scale or time-sensitive problems. To address this challenge, we propose an innovative unsupervised learning approach that harnesses the capabilities of graph neural networks to exploit the problem structure, aiming to generate accurate Lagrangian multipliers efficiently.

A. Parjadis, Q. Cappart, B. Dilkina, A. Ferber, L.-M. Rousseau

Proceedings of CP 2024 conference (Best ML paper award)

Estimating Road Construction Costs with Explainable Machine Learning

A preliminary estimation of construction costs is a crucial operation of any project related to civil engineering. An accurate estimation ensures a proper management of the available funds and helps the project managers in their decision-making processes. In order to select a subcontractor, the project manager needs to have an accurate idea of a reasonable price for the subtask given. A price higher than expected is undesirable, but a price significantly lower than expected may also result in poor quality of service. This paper introduces a framework for estimating construction costs while tackling both challenges. It is based on six machine learning models and on Shapley additive explanations. This project was commissioned by the Ministry of Transport and Sustainable Mobility, a public agency responsible for transport infrastruc- ture in Quebec, Canada.

R. Larocque, A.-M. Boulé, Q. Cappart

INFORMS Journal on Applied Analytics (2024)

Acquiring Constraints for a Non-linear Transmission Maintenance Scheduling Problem

Modern electrical power utilities must maintain their electrical equipment and replace it when the end of its useful life arrives. Each year, a list of equipment (power lines, capacitors, transistors, etc.) that needs to be maintained or replaced is available and the goal is to generate an optimal maintenance plan. While most constraints can be naturally formalized in constraint programming (CP), there are some complex constraints like transit-power limits that are challenging to model because of their continuous and non-linear nature. This paper proposes a methodology based on active constraint acquisition to automatically approximate these constraints and to add them into a CP model.

H. Barral, M. Gaha, A. Dems, A. Côté, F. Nguewouo, Q. Cappart

Proceedings of CPAIOR 2024 conference

Towards a Generic Representation of Combinatorial Problems for Learning-Based Approaches

In recent years, there has been a growing interest in using learning-based approaches for solving combinatorial problems, either in an end-to-end manner or in conjunction with traditional optimization algorithms. In both scenarios, the challenge lies in encoding the targeted combinatorial problems into a structure compatible with the learning algorithm. this paper advocates for progress toward a fully generic representation of combinatorial problems for learning-based approaches. The approach we propose involves constructing a graph by breaking down any constraint of a combinatorial problem into an abstract syntax tree and expressing relationships (e.g., a variable involved in a constraint) through the edges.

L. Boisvert, H. Verhaeghe, Q. Cappart

Proceedings of CPAIOR 2024 conference

MARCO: A Memory-Augmented Reinforcement framework for Combinatorial Optimization

Neural Combinatorial Optimization (NCO) is an emerging domain where deep learning techniques are employed to address combinatorial optimization problems as a standalone solver. Despite their potential, existing NCO methods often suffer from inefficient search space exploration, frequently leading to local optima entrapment or redundant exploration of previously visited states. This paper introduces a versatile framework that can be used to enhance both constructive and improvement methods in NCO through an innovative memory module.

A. Garmendia, J. Ceberio, A. Mendiburu, Q. Cappart

Proceedings of IJCAI 2024 conference

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on measuring the agents’ ability to perform tasks that span the typical daily work of knowledge workers utilizing enterprise software systems. To this end, we propose WorkArena, a remote-hosted benchmark of 29 tasks based on the widely-used ServiceNow platform. We also introduce BrowserGym, an environment for the design and evaluation of such agents, offering a rich set of actions as well as multimodal observations.

A. Drouin, M. Gasse, M. Caccia, I. Laradji, M. Del Verme, T. Marty, L. Boisvert, M. Thakkar, Q. Cappart, D. Vazquez, N. Chapados, A. Lacoste

ICLR 2024 Workshop on Large Language Model (LLM) Agent

A machine-learning-based column generation heuristic for electric bus scheduling

Bus scheduling in public transit consists in determining a set of bus schedules to cover a set of timetabled trips at minimum cost. Column-generation algorithms have regained popularity for solving problems similar to the one considered in this paper, namely, the multi-depot electric vehicle scheduling problem (MDEVSP) with a piecewise linear charging function and capacitated charging stations. To tackle large-scale MDEVSP instances, we design a learning-based column generation heuristic that relies on reduced-sized networks to generate the bus schedules

J. Gerbaux, Q. Cappart, G. Desaulniers

Les Cahiers du GERAD (2024)

Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

We study vehicle dispatching in autonomous mobility on demand (AMoD) systems, where a cen- tral operator assigns vehicles to customer requests or rejects these with the aim of maximizing its total profit. Recent approaches use multi-agent deep reinforcement learning (MADRL) to realize scalable yet performant algorithms, but train agents based on local rewards, which distorts the re- ward signal with respect to the system-wide profit, leading to lower performance. We therefore propose a novel global-rewards-based MADRL algorithm for vehicle dispatching in AMoD sys- tems, which resolves so far existing goal conflicts between the trained agents and the operator by assigning rewards to agents leveraging a counterfactual baseline

H. Hoppe, T. Enders, Q. Cappart, M. Schiffer

Proceedings of L4DC conference (to appear)

Improved Peel-and-Bound: Methods for Generating Dual Bounds with Multivalued Decision Diagrams

Decision diagrams are an increasingly important tool in cutting-edge solvers for discrete optimization. This paper extends our previously peel-and-bound approaches and close several benchmark instances on the traveling salesman problem with time windows problem

I. Rudich, Q. Cappart, L.-M. Rousseau

Journal of Artificial Intelligence Research (2023)

Combinatorial optimization and reasoning with graph neural networks

Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have focused on solving problem instances in isolation, ignoring the fact that they often stem from related data distributions in practice. However, recent years have seen a surge of interest in using machine learning, especially graph neural networks, as a key building block for combinatorial tasks. This paper presents a conceptual review of recent key advancements in this emerging field, aiming at researchers in both optimization and machine learning

Q. Cappart, D. Chételat, E. Khalil, A. Lodi, C. Morris, P. Veličković

Journal of Machine Learning Research (2023)

Learning a Generic Value-Selection Heuristic Inside a Constraint Programming Solver

Constraint programming is known for being an efficient approach for solving combinatorial problems. Important design choices in a solver are the branching heuristics, which are designed to lead the search to the best solutions in a minimum amount of time. This paper introduces a generic learning procedure that can be used to obtain a value-selection heuristic inside a constraint programming solver. This has been achieved thanks to the combination of a deep Q-learning algorithm, a tailored reward signal, and a heterogeneous graph neural network architecture.

T. Marty, T. François, P. Tessier, L. Gauthier, L.-M. Rousseau, Q. Cappart

Proceedings of CP 2023 conference (Distinguished paper)

Dynamic Routing and Wavelength Assignment with Reinforcement Learning

With the rapid developments in communication systems, and considering their dynamic nature, all-optical networks are becoming increasingly complex. This paper proposes a novel method based on deep reinforcement learning for the routing and wavelength assignment problem in all-optical wavelength-decision-multiplexing networks. We employ graph neural networks to capture crucial latent information from the graph-structured input to develop the optimal strategy.

P. Kafaei, Q. Cappart, N. Chapados, H. Pouya, L.-M. Rousseau

INFORMS Journal on Optimization (2023)

Explaining the Behavior of Reinforcement Learning Agents using Association Rules

Deep reinforcement learning algorithms are increasingly used to drive decision-making systems. However, there exists a known tension between the efficiency of a machine learning algorithm and its level of explainability. Generally speaking, increased efficiency comes with the cost of decisions that are harder to explain. This paper proposes to explain the behaviour of a deep reinforcement learning algorithm thanks to standard data mining tools, i.e. association rules. We apply this idea to the design of playing bots, which is ubiquitous in the video game industry.

Z. Parham, V.T. de Lille, Q. Cappart

Proceedings of LION 2023 conference

Repositioning Fleet Vehicles: a Learning Pipeline

Managing a fleet of vehicles under uncertainty requires careful planning and adaptability. We consider a ride-hailing problem where the operator manages vehicle repositioning to maximize responsiveness. This paper introduces a supervised learning pipeline that uses past trip data to reposition vehicles while adapting to fleet activity, a geographical zone, and seasonal or daily request variation. This tool has been developed for, and used by, operators of an ambulance company in Belgium. Using predictors for ambulance repositioning reduces at least 10% of the overall fleet reaction distance.

A. Parjadis, Q. Cappart, Q. Massoteau, L.-M. Rousseau,

Proceedings of LION 2023 conference

Peel-and-Bound: Generating Stronger Relaxed Bounds with Multivalued Decision Diagrams

Decision diagrams are an increasingly important tool in cutting-edge solvers for discrete optimization. However, the field of decision diagrams is relatively new, and is still incorporating the library of techniques that conventional solvers have had decades to build. We drew inspiration from the warm-start technique used in conventional solvers to address one of the major challenges faced by decision diagram based methods.

I. Rudich, Q. Cappart, L.-M. Rousseau

Proceedings of CP 2022 conference (Best paper award)

Scheduling the Equipment Maintenance of an Electric Power Transmission Network Using Constraint Programming

Modern electrical power utilities must maintain their electrical equipment and replace it when the end of its useful life arrives. The Transmission Maintenance Scheduling (TMS) problem consists in generating an annual maintenance plan for electric power transportation equipment while maintaining the stability of the network and ensuring a continuous power flow for customers. Each year, a list of equipment (power lines, capacitors, transistors, etc.) that needs to be maintained or replaced is available and the goal is to generate an optimal maintenance plan. This paper proposes a constraint-based scheduling approach for solving the TMS problem.

L. Popovic, A. Côté, M. Gaha, F. Nguewouo, Q. Cappart

Proceedings of CP 2022 conference

Improving Variable Orderings of Approximate Decision Diagrams Using Reinforcement Learning

This paper studies the use of reinforcement learning to compute a variable ordering of decision diagram-based approximations for discrete optimization problems. This is among the first works to propose the use of machine learning to improve upon generic bounding methods for discrete optimization problems, thereby establishing a critical bridge between optimization and learning.

Q. Cappart, D. Bergman, L.-M. Rousseau, I. Prémont-Schwarz, A. Parjadis

INFORMS Journal on Computing (2022)

Learning the travelling salesperson problem requires rethinking generalization

End-to-end training of neural network solvers for combinatorial problems such as the Travelling Salesman Problem is intractable and inefficient beyond a few hundreds of nodes. While state-of-the-art Machine Learning approaches perform closely to classical solvers for trivially small sizes, they are unable to generalize the learnt policy to larger instances. This paper proposes controlled experiments in the first principled investigation into such zero-shot generalization, revealing that extrapolating beyond training data requires rethinking the entire neural combinatorial optimization pipeline.

C. K. Joshi, Q. Cappart, L.-M. Rousseau, T. Laurent

Constraints Journal (2022)

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights

Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have focused on solving problem instances in isolation, ignoring that they often stem from related data distributions in practice. However, recent years have seen a surge of interest in using machine learning as a new approach for solving combinatorial problems, either directly as solvers or by enhancing exact solvers. Based on this context, the ML4CO aims at improving state-of-the-art combinatorial optimization solvers by replacing key heuristic components.

M. Gasse, Q. Cappart, A. Parjadis, et al.

NeurIPS 2021 Competitions and Demonstrations Track

Graph neural networks and deep reinforcement learning for simultaneous beam orientation and trajectory optimization of Cyberknife

This paper presents a deep Q-learning approach based on graph neural networks to improve cancer treatments performed by the Cyberknife system.

P. Kafaei, Q. Cappart, M.-A. Renaud, N. Chapados, L.-M. Rousseau

Physics in Medicine & Biology Journal (2021)

SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning

This paper presents the proof of concept for SeaPearl, a new constraint programming solver implemented in Julia, that supports machine learning routines in order to learn branching decisions using reinforcement learning. Support for modeling the learning component is also provided.

F. Chalumeau, I. Coulon, Q. Cappart, L.-M. Rousseau

Proceedings of CPAIOR 2021 conference

Combining reinforcement learning and constraint programming for combinatorial optimization

We propose a general and hybrid approach, based on deep reinforcement learning and constraint programming, for solving combinatorial optimization problems. The core of the approach is based on a dynamic programming formulation, that acts as a bridge between both techniques.

Q. Cappart, T. Moisan, L.-M. Rousseau, I. Prémont-Schwarz, A. Cire

Proceedings of AAAI 2021 conference

 

Full List

Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-Supervised Learning
S. Bessa, D. Dabert, M. Bourgeat, L.-M. Rousseau, Q. Cappart
Arxived (2024)

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
L. Boisvert, M. Thakkar, M. Gasse, M. Caccia, T. Le Sellier De Chezelles, Q. Cappart, N. Chapados, A. Lacoste, A. Drouin
Arxived (2024)

Learning Precedences for Scheduling Problems with Graph Neural Networks
H. Verhaeghe, Q. Cappart, G. Pesant, C.-G. Quimper
Proceedings of CP 2024 conference (to appear)

Learning Lagrangian Multipliers for the Travelling Salesman Problem
A. Parjadis, Q. Cappart, B. Dilkina, A. Ferber, L.-M. Rousseau
Proceedings of CP 2024 conference (Best ML paper award)

Estimating Road Construction Costs with Explainable Machine Learning
R. Larocque, A.-M. Boulé, Q. Cappart
INFORMS Journal on Applied Analytics (2024)

Winning the 2023 CityLearn Challenge: a Community-based Hierarchical Energy Systems Coordination Algorithm
A. Garmendia, F. Morri, Q. Cappart, H. Le Cadre
Proceedings of ECAI 2024 conference (to appear)

Acquiring Constraints for a Non-linear Transmission Maintenance Scheduling Problem
H. Barral, M. Gaha, A. Dems, A. Côté, F. Nguewouo, Q. Cappart
Proceedings of CPAIOR 2024 conference

Towards a Generic Representation of Combinatorial Problems for Learning-Based Approaches
L. Boisvert, H. Verhaeghe, Q. Cappart
Proceedings of CPAIOR 2024 conference

MARCO: A Memory-Augmented Reinforcement framework for Combinatorial Optimization
A. Garmendia, J. Ceberio, A. Mendiburu, Q. Cappart
Proceedings of IJCAI 2024 conference

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
A. Drouin, M. Gasse, M. Caccia, I. Laradji, M. Del Verme, T. Marty, L. Boisvert, M. Thakkar, Q. Cappart, D. Vazquez, N. Chapados, A. Lacoste
ICLR 2024 Workshop on Large Language Model (LLM) Agent

An Improved Neuro-Symbolic Architecture to Fine-Tune Generative AI Systems”
C. Yin, Q. Cappart, G. Pessant
Proceedings of CPAIOR 2024 conference

A machine-learning-based column generation heuristic for electric bus scheduling
J. Gerbaux, Q. Cappart, G. Desaulniers
Les Cahiers du GERAD (2024)

Deep Learning for Data-Driven Districting-and-Routing
A. Ferraz, Q. Cappart, T. Vidal
Arxived (2024)

Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems
H. Hoppe, T. Enders, Q. Cappart, M. Schiffer
Proceedings of L4DC conference (to appear)

Decision Diagrams in Space!
I. Rudich, Q. Cappart, M. López-Ibáñez, M. Römer, L.-M. Rousseau
Arxived (2023)

The Unsolved Challenges of LLMs as Generalist Web Agents: A Case Study
R. Assouel, T. Marty, M. Caccia, I. H. Laradji, A. Drouin, S. Rajeswar, H. Palacios, Q. Cappart, D. Vazquez, N. Chapados, M. Gasse, A. Lacoste
Foundation Models for Decision Making Workshop (affilated to NeurIPS 2023 conference)

Improved Peel-and-Bound: Methods for Generating Dual Bounds with Multivalued Decision Diagrams
I. Rudich, Q. Cappart, L.-M. Rousseau
Journal of Artificial Intelligence Research (2023)

Combinatorial optimization and reasoning with graph neural networks
Q. Cappart, D. Chételat, E. Khalil, A. Lodi, C. Morris, P. Veličković
Journal of Machine Learning Research (2023)

Learning a Generic Value-Selection Heuristic Inside a Constraint Programming Solver
T. Marty, T. François, P. Tessier, L. Gauthier, L.-M. Rousseau, Q. Cappart
Proceedings of CP 2023 conference (Distinguished paper)

Dynamic Routing and Wavelength Assignment with Reinforcement Learning
P. Kafaei, Q. Cappart, N. Chapados, H. Pouya, L.-M. Rousseau
INFORMS Journal on Optimization (2023)

Explaining the Behavior of Reinforcement Learning Agents using Association Rules
Z. Parham, V.T. de Lille, Q. Cappart
Proceedings of LION 2023 conference

Repositioning Fleet Vehicles: a Learning Pipeline
A. Parjadis, Q. Cappart, Q. Massoteau, L.-M. Rousseau,
Proceedings of LION 2023 conference

Peel-and-Bound: Generating Stronger Relaxed Bounds with Multivalued Decision Diagrams
I. Rudich, Q. Cappart, L.-M. Rousseau
Proceedings of CP 2022 conference (Best paper award)

Scheduling the Equipment Maintenance of an Electric Power Transmission Network Using Constraint Programming
L. Popovic, A. Côté, M. Gaha, F. Nguewouo, Q. Cappart
Proceedings of CP 2022 conference

Improving Variable Orderings of Approximate Decision Diagrams Using Reinforcement Learning
Q. Cappart, D. Bergman, L.-M. Rousseau, I. Prémont-Schwarz, A. Parjadis
INFORMS Journal on Computing (2022)

Learning the travelling salesperson problem requires rethinking generalization
C. K. Joshi, Q. Cappart, L.-M. Rousseau, T. Laurent
Constraints Journal (2022)

On Causal Inference for Data-free Structured Pruning
M. Ferianc, A. Sankaran, O. Mastropietro, E. Saboori, Q. Cappart
Proceedings of ITCI’22 workshop (affilated to AAAI 2022 conference)

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights
M. Gasse, Q. Cappart, A. Parjadis, et al.
NeurIPS 2021 Competitions and Demonstrations Track

Graph neural networks and deep reinforcement learning for simultaneous beam orientation and trajectory optimization of Cyberknife
P. Kafaei, Q. Cappart, M.-A. Renaud, N. Chapados, L.-M. Rousseau
Physics in Medicine & Biology Journal (2021)

Improving branch-and-bound using decision diagrams and reinforcement learning
A. Parjadis, Q. Cappart, L.-M. Rousseau, D. Bergman
Proceedings of CPAIOR 2021 conference

SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning
F. Chalumeau, I. Coulon, Q. Cappart, L.-M. Rousseau
Proceedings of CPAIOR 2021 conference

Combining reinforcement learning and constraint programming for combinatorial optimization
Q. Cappart, T. Moisan, L.-M. Rousseau, I. Prémont-Schwarz, A. Cire
Proceedings of AAAI 2021 conference

Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning
Q. Cappart, E. Goutierre, D. Bergman, L.-M. Rousseau
Proceedings of AAAI 2019 conference

How to Evaluate Machine Learning Approaches for Combinatorial Optimization: Application to the Travelling Salesman Problem
A. François, Q. Cappart, L.-M. Rousseau
ArXived (2019)