Bahamid, Alala and Mohd Ibrahim, Azhar and Shafie, Amir Akramin (2024) Crowd evacuation with human-level intelligence via neuro-symbolic approach. Advanced Engineering Informatics, 60. pp. 1-17. ISSN 1474-0346
|
PDF (SCOPUS)
- Supplemental Material
Download (207kB) | Preview |
|
PDF (Article)
- Published Version
Restricted to Registered users only Download (7MB) | Request a copy |
Abstract
Understanding human response to crowd emergencies is extremely complex, and it plays a significant role in engineering construction designs and crowd safety. Individual choices, reasoning, and behaviours cannot be fully described by equations or rule-based methods. Accordingly, this research proposes a neuro-symbolic approach for modelling agents with human-level capabilities of reasoning and performance in an emergency evacuation. The proposed neuro-symbolic approach combines deep reinforcement learning with evaluative fuzzy logic to address the challenges of large amounts of required data, time, and trials-and-errors for policy optimization and to handle the assumption of reward function that may not be practical in real scenarios. This neuro-symbolic model has the potential to deal with the complexity of the environment and decision-making process via deep reinforcement learning and enhances the cognitive and visual intelligence via an evaluative fuzzy function, which continuously evaluates agent actions during the training process to boost pedestrian active response to their surroundings, with full awareness of time, thereby, the human-level capacity of reasoning. Moreover, this proposed model optimizes the computational demands of deep reinforcement learning and enables faster learning of new situations. The findings indicate that the proposed model can produce behavioural patterns that align with real observations of crowd evacuation, such as laminar flow, stop-and-go flow, and crowd turbulence. On top of that, a new evacuation behaviour is observed, as some pedestrians avoid congestion at the exit until the density reduces which reflects a level of human reasoning. The proposed model illustrates a higher accuracy and much faster converge than the pure proximal policy optimization model with substantially minimal training timesteps of as little percentage as 2 to 8. Meanwhile, the reliability study records an increase of the mean and standard deviation of evacuation time from 39.7 s, 1.06 to 155.09 s, 7.39 as crowd size increases from 15 to 200 pedestrians, which implies a rise of uncertainty. Therefore, we perceive that this work can provide crowd authorities and construction engineers with insights into complex behaviour and critical conditions to make better evacuation plans and sustainable designs to ensure crowd safety. It also provides a promising alternative to the evident lack of data on critical crowd conditions.
Item Type: | Article (Journal) |
---|---|
Uncontrolled Keywords: | Crowd evacuation Neuro-symbolic approach Decision intelligence Deep reinforcement learning Fuzzy logi |
Subjects: | T Technology > T Technology (General) |
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): | Kulliyyah of Engineering Kulliyyah of Engineering > Department of Mechatronics Engineering |
Depositing User: | Dr Azhar Mohd Ibrahim |
Date Deposited: | 24 Jan 2024 15:04 |
Last Modified: | 07 May 2024 14:53 |
URI: | http://irep.iium.edu.my/id/eprint/110539 |
Actions (login required)
View Item |