The resilience of critical infrastructure networks (CINs) after disruptions, such as those caused by natural hazards, depends on both the speed of restoration and the extent to which operational functionality can be regained. Allocating resources for restoration is a combinatorial optimal planning problem that involves determining which crews will repair specific network nodes and in what order. This paper presents a novel graph-based formulation that merges two interconnected graphs, representing crew and transportation nodes and power grid nodes, into a single heterogeneous graph. To enable efficient planning, graph reinforcement learning (GRL) is integrated with bigraph matching. GRL is utilized to design the incentive function for assigning crews to repair tasks based on the graph-abstracted state of the environment, ensuring generalization across damage scenarios. Two learning techniques are employed: a graph neural network trained using Proximal Policy Optimization and another trained via Neuroevolution. The learned incentive functions inform a bipartite graph that links crews to repair tasks, enabling weighted maximum matching for crew-to-task allocations. An efficient simul
Existing collaborative WiFi sensing systems rely on perfect node synchronization and complete data availability. However, real-world edge deployments suffer from heterogeneous computing and network dropouts, leading to asynchronous and incomplete features. We propose CREWS, a robust collaborative sensing framework that inherently resists these network volatility. First, CREWS employs a topology-agnostic aggregator invariant to the arrival order and subset size of incoming features. Second, rather than discarding delayed observations, it utilizes a staleness-aware adaptive replay mechanism. By treating stale features from lagging nodes as system-induced hard samples, CREWS transforms synchronization delays into beneficial training regularization. We theoretically prove the joint convergence of this architecture and demonstrate how replay bounds the bias-variance trade-off. Extensive evaluations and an 8-node heterogeneous hardware testbed demonstrate its superior resilience. Under severe conditions i.e., 50\% transient dropout rate or out-of-distribution jitter, CREWS restricts accuracy degradation to merely 2.2 percentage points, substantially outperforming state-of-the-art baselin
Metro crew planning is a key component of smart city development as it directly impacts the operational efficiency and service reliability of public transportation. With the rapid expansion of metro networks, effective multi-line scheduling and emergency management have become essential for large-scale seamless operations. However, current research focuses primarily on individual metro lines,with insufficient attention on cross-line coordination and rapid replanning during disruptions. Here, a unified optimization framework is presented for multi-line metro crew planning and replanning with heterogeneous workforce. Specifically, a hierarchical time-space network model is proposed to represent the unified crew action space, and computationally efficient constraints and formulations are derived for the crew's heterogeneous qualifications and preferences. Solution algorithms based on column generation and shortest path adjustment are further developed, utilizing the proposed network model. Experiments with real data from Shanghai and Beijing Metro demonstrate that the proposed methods outperform benchmark heuristics in both cost reduction and task completion,and achieve notable effici
Aircraft and crew are two major resources that ensure the smooth operations of airlines. However, with the anticipated growth in the aviation industry, the crew resource is predicted to be insufficient worldwide and has been one of the bottlenecks in fast-developing airlines. In addition, the mismatch between aircraft and crew has influenced the airlines' operation, and limited the full usage of all resources. To resolve this problem, we propose a tactical fleet assignment and crew pairing problem with crew flight time allocation (TFACPP). The basic integrated model is reformulated by the Benders decomposition, where the Benders master problem (BMP) poses the most significant computational barrier. To efficiently solve the BMP, we propose a column generation algorithm. The TFACPP can provide superior solutions compared to the equal allocation of crew resources. In addition, we provide a quantitative method for evaluating the scarcities of crew and aircraft resources and the matching degree between crew and aircraft based on the shadow prices of the proposed model. These information can provide rich managerial insights regarding the acquisition, replacement, and transition of crew a
In this paper, we reschedule the duties of train drivers one day before the operation. Due to absent drivers (e.g., because of sick leave), some trains have no driver. Thus, duties need to be rescheduled for the day of operation. We start with a feasible crew schedule for each of the remaining operating drivers, a set of unassigned tasks originally assigned to the absent drivers, and a group of standby drivers with fixed start time, end time, start depot, and end depot. Our aim is to generate a crew schedule with as few canceled or changed tasks as possible. We present a tabu-search-based approach for crew rescheduling. We also adapt a column-generation approach with the same objective function and equivalent restrictions as the benchmark for comparing the results, computational time, and space usage. Our tabu-search-based approach needs both less computation time and space than the column-generation approach to compute an acceptable result. We further test the performance of our approach under different settings. The data used in the experiments originated from a regional passenger-train system around Stockholm, Sweden and was provided by Mälartåg.
The advent of large language models has ushered in a new era of agentic systems, where artificial intelligence programs exhibit remarkable autonomous decision-making capabilities across diverse domains. This paper explores agentic system workflows in the financial services industry. In particular, we build agentic crews with human-in-the-loop module that can effectively collaborate to perform complex modeling and model risk management (MRM) tasks. The modeling crew consists of a judge agent and multiple agents who perform specific tasks such as exploratory data analysis, feature engineering, model selection/hyperparameter tuning, model training, model evaluation, and writing documentation. The MRM crew consists of a judge agent along with specialized agents who perform tasks such as checking compliance of modeling documentation, model replication, conceptual soundness, analysis of outcomes, and writing documentation. We demonstrate the effectiveness and robustness of modeling and MRM crews by presenting a series of numerical examples applied to credit card fraud detection, credit card approval, and portfolio credit risk modeling datasets.
Despite rapid progress in large language model (LLM)-based multi-agent systems, current benchmarks fall short in evaluating their scalability, robustness, and coordination capabilities in complex, dynamic, real-world tasks. Existing environments typically focus on small-scale, fully observable, or low-complexity domains, limiting their utility for developing and assessing next-generation multi-agent Agentic AI frameworks. We introduce CREW-Wildfire, an open-source benchmark designed to close this gap. Built atop the human-AI teaming CREW simulation platform, CREW-Wildfire offers procedurally generated wildfire response scenarios featuring large maps, heterogeneous agents, partial observability, stochastic dynamics, and long-horizon planning objectives. The environment supports both low-level control and high-level natural language interactions through modular Perception and Execution modules. We implement and evaluate several state-of-the-art LLM-based multi-agent Agentic AI frameworks, uncovering significant performance gaps that highlight the unsolved challenges in large-scale coordination, communication, spatial reasoning, and long-horizon planning under uncertainty. By providin
Post-disaster crew dispatch is a critical but computationally intensive task. Traditional mixed-integer linear programming methods often require minutes to several hours to compute solutions, leading to delays that hinder timely decision-making in highly dynamic restoration environments. To address this challenge, we propose a novel learning-based framework that integrates transformer architectures with deep reinforcement learning (DRL) to deliver near real-time decision support without compromising solution quality. Crew dispatch is formulated as a sequential decision-making problem under uncertainty, where transformers capture high-dimensional system states and temporal dependencies, while DRL enables adaptive and scalable decision-making. Earthquake-induced distribution network damage is first characterized using established seismic standards, followed by a scenario generation and reduction pipeline that aggregates probable outcomes into a single geospatial impact map. Conditioned on this map, the proposed framework generates second-level dispatch strategies, trained offline on simulated and historical events and deployed online for rapid response. In addition to substantial run
The crew rostering problem (CRP) for pilots is a complex crew scheduling task assigning pairings, or sequences of flights starting and ending at the same airport, to pilots to create a monthly schedule. In this paper, we propose an innovative solution method for the CRP that uses a windowing approach. First, using a combination of machine learning (ML) and combinatorial optimisation (CO), we quickly generate an initial solution. The solution is obtained with a sequential assignment procedure (\textit{seqAsg}) based on a neural network trained by an evolutionary algorithm. Then, this initial solution is reoptimized using a branch-and-price algorithm that relies on a windowing scheme to quickly obtain a CRP solution. This windowing method consists of decomposing the optimization horizon into several overlapping windows, and then optimizing each one sequentially. Although windowing has been successfully used in other airline applications, it had never been implemented for the CRP, due to its large number of horizontal constraints involving the whole planning horizon. We test our approach on two large real-world instances, and show that our method is over ten times faster than the stat
We study the practical setting in which regular- and reserve-crew schedules are dynamically maintained up to the day of executing the schedule. At each day preceding the execution of the schedule, disruptions occur due to sudden unavailability of personnel, making the planned regular and reserve-crew schedules infeasible for its execution day. This paper studies the fundamental question how to repair the schedules' infeasibility in the days preceding the execution, taking into account labor regulations. We propose a robust repair strategy that maintains flexibility in order to cope with additional future disruptions. The flexibility in reserve-crew usage is explicitly considered through evaluating the expected shortfall of the reserve-crew schedule based on a Markov chain formulation. The core of our approach relies on iteratively solving a set-covering formulation, which we call the Robust Crew Recovery Problem, which encapsulates this flexibility notion for reserve crew usage. A tailored branch-and-price algorithm is developed for solving the Robust Crew Recovery Problem to optimality. The corresponding pricing problem is efficiently solved by a newly developed pulse algorithm. B
We introduce a set of open-source packages that form a highly extensible framework for quantum optimization. One design goal of the system is the inclusion of a command line based configuration system for setting up experiments. The possible options are derived using well-known Python packages and presented to the user intuitively, allowing the configuration of repeatable variational quantum optimization experiments. We give an example of using the system through the Airline Crew Pairing problem, a highly relevant industrial problem, and the MaxCut problem, for which instances of manageable size are readily available.
With the increasing deployment of artificial intelligence (AI) technologies, the potential of humans working with AI agents has been growing at a great speed. Human-AI teaming is an important paradigm for studying various aspects when humans and AI agents work together. The unique aspect of Human-AI teaming research is the need to jointly study humans and AI agents, demanding multidisciplinary research efforts from machine learning to human-computer interaction, robotics, cognitive science, neuroscience, psychology, social science, and complex systems. However, existing platforms for Human-AI teaming research are limited, often supporting oversimplified scenarios and a single task, or specifically focusing on either human-teaming research or multi-agent AI algorithms. We introduce CREW, a platform to facilitate Human-AI teaming research in real-time decision-making scenarios and engage collaborations from multiple scientific disciplines, with a strong emphasis on human involvement. It includes pre-built tasks for cognitive studies and Human-AI teaming with expandable potentials from our modular design. Following conventional cognitive neuroscience research, CREW also supports multi
Optimizing service schedules is pivotal to the reliable, efficient, and inclusive on-demand mobility. This pressing challenge is further exacerbated by the increasing needs of an aging population, the oversubscription of existing services, and the lack of effective solution methods. This study addresses the intricacies of service scheduling, by jointly optimizing rider trip planning and crew scheduling for a complex dynamic mobility service. The resulting optimization problems are extremely challenging computationally for state-of-the-art methods. To address this fundamental gap, this paper introduces the Joint Rider Trip Planning and Crew Shift Scheduling Problem (JRTPCSSP) and a novel solution method, called Attention and Gated GNN-Informed Column Generation (AGGNNI-CG), that hybridizes column generation and machine learning to obtain near-optimal solutions to the JRTPCSSP with real-life constraints of the application. The key idea of the machine-learning component is to dramatically reduce the number of paths to explore in the pricing problem, accelerating the most time-consuming component of the column generation. The machine learning component is a graph neural network with an
We consider robust tactical crew scheduling for a large passenger railway operator, who aims to inform crew early on about their work schedules while also maintaining the ability to respond to changes in the daily timetables. To resolve this conflict, the operator considers a template-based planning process, templates being time windows during which duties can later be scheduled. The goal is to select a cost-efficient set of templates that is robust with respect to uncertainty in the work to be performed in the operational phase. A set of templates is deemed robust when few excess duties are required to cover all work in the operational planning phase. To enable the construction of efficient template-based rosters, we impose several template rostering constraints that proxy the actual rostering rules of later planning steps. We propose a two-phase accelerated Benders decomposition algorithm that can incorporate these restrictions. Computational experiments on real-life instances from Netherlands Railways, featuring up to 948 tasks per day, show that historical planning information can be used to obtain robust templates and that parsimonious solutions can be obtained at negligible e
This paper considers a disrupted infrastructure network where the repair crew knows the locations of service outages but not the locations of actual faults. Our goal is to determine a route for a single crew to visit and repair the disruptions to restore service with minimum negative impact. We call this problem the Traveling Repairman Network Restoration Problem (TRNRP). This problem presents strong computational challenges due to the combinatorial nature of the decisions, inter-dependencies within the underlying infrastructure network, and incomplete information. Considering the dynamic nature of the decisions as a result of dynamic information revelation on the status of the nodes, we model this problem as a finite-horizon Markov decision process. Our solution approach uses value approximation based on reinforcement learning, which is strengthened by structural results that identify a set of suboptimal moves. In addition, we propose state aggregation methods to reduce the size of the state space. We perform extensive computational studies to characterize the performance of our solution methods under different parameter settings and to compare them with benchmark solution approac
This work focuses on exact methods for a Simultaneous Vehicle Routing and Crew Scheduling Problem in long-haul transport. Pickup-and-delivery requests with time windows must be fullfiled over a multi-day planning horizon. Unlike some classic approaches, the correspondence between trucks and drivers is not fixed and they can be exchanged in some locations and at any time. Drivers can also travel for free as truck passengers or take external taxis for an additional cost. The objective is to minimise the truck and taxi travel costs and the penalties for late deliveries. Routes for trucks and drivers are represented separately as directed paths in certain digraphs and then synchronised in time and space. Three compact Integer Linear Programming formulations are proposed and many families of valid inequalities are described. Extensive computational experiments are conducted on randomly generated instances. The formulations are experimentally compared and the effectiveness of the proposed valid inequalities as cutting planes in a branch-and-cut algorithm is evaluated.
Unmanned Aerial Vehicle (UAV) based communication networks (UCNs) are a key component in future mobile networking. To handle the dynamic environments in UCNs, reinforcement learning (RL) has been a promising solution attributed to its strong capability of adaptive decision-making free of the environment models. However, most existing RL-based research focus on control strategy design assuming a fixed set of UAVs. Few works have investigated how UCNs should be adaptively regulated when the serving UAVs change dynamically. This article discusses RL-based strategy design for adaptive UCN regulation given a dynamic UAV set, addressing both reactive strategies in general UCNs and proactive strategies in solar-powered UCNs. An overview of the UCN and the RL framework is first provided. Potential research directions with key challenges and possible solutions are then elaborated. Some of our recent works are presented as case studies to inspire innovative ways to handle dynamic UAV crew with different RL algorithms.
Deep space exploration offers the most profound opportunity for the expansion of humanity and our understanding of the Universe, but remains extremely challenging. Progress will continue to be paced by uncrewed missions followed up by crewed missions to ever further destinations. Major space powers continue to invest in crewed deep space exploration as an important national strategy. An improved model based on previous work is developed, which projects the earliest possible launch dates for human-crewed missions from cis-lunar space to selected destinations in the Solar System and beyond based on NASA's historic budget trend and overall development trends of deep space exploration research. The purpose of the analysis is to provide a projected timeframe for crewed missions beyond Mars. Our findings suggest the first human missions from a spacefaring nation or international collaboration to the Asteroid Belt and Jovian System could be scheduled as soon as ~2071 to ~2087 and ~2101 to ~2121, respectively, while a launch to the Saturn System may occur by the year ~2132, with an uncertainty window of ~2129 to ~2153.
Airline crew pairing optimization problem (CPOP) aims to find a set of flight sequences (crew pairings) that cover all flights in an airline's highly constrained flight schedule at minimum cost. Since crew cost is second only to the fuel cost, CPOP solutioning is critically important for an airline. However, CPOP is NP-hard, and tackling it is quite challenging. The literature suggests, that when the CPOP's scale and complexity is reasonably limited, and an enumeration of all crew pairings is possible, then Metaheuristics are used, predominantly Genetic Algorithms (GAs). Else, Column Generation (CG) based Mixed Integer Programming techniques are used. Notably, as per the literature, a maximum of 45,000 crew pairings have been tackled by GAs. In a significant departure, this paper considers over 800 flights of a US-based large airline (with a monthly network of over 33,000 flights), and tests the efficacy of GAs by enumerating all 400,000+ crew pairings, apriori. Towards it, this paper proposes a domain-knowledge-driven customized-GA. The utility of incorporating domain-knowledge in GA operations, particularly initialization and crossover, is highlighted through suitable experiments
Crew Pairing Optimization aims at generating a set of flight sequences (crew pairings), covering all flights in an airline's flight schedule, at minimum cost, while satisfying several legality constraints. CPO is critically important for airlines' business viability, considering that the crew operating cost is their second-largest expense. It poses an NP-hard combinatorial optimization problem, to tackle which, the state-of-the-art relies on relaxing the underlying Integer Programming Problem (IPP) into a Linear Programming Problem (LPP), solving the latter through Column Generation (CG) technique, and integerization of the resulting LPP solution. However, with the growing scale and complexity of the flight networks (those with a large number of flights, multiple crew bases and/or multiple hub-and-spoke subnetworks), the utility of the conventional CG-practices has become questionable. This paper proposed an Airline Crew Pairing Optimization Framework, AirCROP, whose constitutive modules include the Legal Crew Pairing Generator, Initial Feasible Solution Generator, and an Optimization Engine built on heuristic-based CG-implementation. In this paper, besides the design of AirCROP's