001.891.573 Simulation model of information interaction in a population of agents
doi: 10.18698/2309-3684-2025-3-85102
The article is devoted to the development of a multi-agent evacuation model that takes into account the physical characteristics of agents (age categories, speed, maneuverability), the level of panic, social interactions in groups of the “leader-follower” type, and the presence of several evacuation exits opening at a given interval (an interval of 6 seconds was considered). The Multi-Agent Proximal Policy Optimization (MAPPO) algorithm is used to train the behavior of agents. A hybrid action space is used, combining discrete output selection and continuous motion control. Training is carried out according to the curriculum learning principle: with a gradual increase in the number of agents. This allows agents to adapt to complex scenarios with high crowding and improves the generalization ability of the model for experiments with different numbers of agents. The environment is a room of given dimensions (rooms of 15×20 m were considered) with a given number of exits of a certain width (3 exits of 1.5 m each were considered). The model includes the logic of disseminating information about exits. Individual agents learn about new open exits within a radius of 5 m and transmit the signal to their neighbors. Leaders initially know about all available exits regardless of the distance. A mechanism is provided for spreading panic depending on the crowding of agents, the distance to the exit, and the time elapsed since the start of the evacuation. Specific rules of behavior for social groups are introduced: leaders make strategic decisions, and elderly followers receive a speed bonus when following the leader. In the current implementation, the choice of exit for individual agents is based on the shortest distance from the agent to it. In social groups, the decision to choose an exit is made by the leader based on the average distance of all agents. Computational experiments were conducted for 40 agents in various scenarios: with a different number of leaders (2–16) and without groups (individual evacuation). The computational experiments showed that under the considered conditions, scenarios with social groups lead to faster evacuation (the total time was reduced by about 38%). Also, during group evacuation, vulnerable agents receive the greatest advantage, in this case, the elderly. The optimal number of leaders is 4–6: further increase in their number does not provide statistically significant improvements. According to the results of the experiments, a decrease in the number of collisions and a lower level of panic with this number of leaders was recorded. The obtained results demonstrate the practical applicability of the MAPPO approach to the problems of analyzing evacuation processes in realistic conditions.
Силинская А.А., Богомолов А.С., Кушников В.А. Моделирование эвакуации из помещений с учетом социальных групп и множественных выходов. Математическое моделирование и численные методы, 2025, № 3, с. 85–102.