Multi-UAV encirclement training optimization method based on implicit curriculum experience partitioning

This paper proposes an implicit curriculum learning framework for multi-drone coordination, employing autonomous phased training via dynamic experience replay. By integrating stage-critical sample prioritization and role-adaptive exploration policies, the method resolves local-global strategy confli...

Full description

Bibliographic Details
Published in:Инноватика-2025 : сборник материалов XXI Международной школы-конференции студентов, аспирантов и молодых ученых, 28-30 апреля 2025 г., г. Томск, Россия С. 551-558
Main Author: Ling, Y. H.
Format: Book Chapter
Language:English
Subjects:
Online Access:https://vital.lib.tsu.ru/vital/access/manager/Repository/koha:001272935
Перейти в каталог НБ ТГУ
Description
Summary:This paper proposes an implicit curriculum learning framework for multi-drone coordination, employing autonomous phased training via dynamic experience replay. By integrating stage-critical sample prioritization and role-adaptive exploration policies, the method resolves local-global strategy conflicts without manual curriculum design. Validated in dynamic environments, the framework demonstrates enhanced collaborative efficiency and hierarchical strategy learning capabilities, providing systematic solutions for phased multi-agent tasks.
Bibliography:Библиогр.: 12 назв.
ISBN:9785936297311