DSpace Repository

Reinforcement Learning-Based Human-Machine Co-Adaptation via Policy Gradient

Show simple item record

dc.contributor.advisor Tahboub, Karim
dc.contributor.author Maharmeh, Elias
dc.date.accessioned 2023-02-26T08:56:29Z
dc.date.available 2023-02-26T08:56:29Z
dc.date.issued 2023-02-01
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/8829
dc.description CD, no of pages 83, ماجستير انظمة ذكية 1/2023, 31163
dc.description.abstract Human Machine Co-adaptation (HMCo) is a critical problem in the design of intelligent systems that interact with humans. This thesis proposes a general framework for solving HMCo problems using a reinforcement-based approach called the policy gradient algorithm. The thesis goal is to empower the machine with the ability to learn a policy or a strategy in order to co-adapt to human behaviors. The proposed approach is based on the assumption of rationality on the human side and involves learning a policy that co-adapts to dynamic environments and aids the human while performing a specific task. The effectiveness of the proposed approach is demonstrated through case studies, including both direct and indirect shared control, and some of the challenges and limitations that must be addressed in order to further advance the field are highlighted. These challenges include the sensitivity of the algorithm to hyperparameters, the issue of local minima, and the complexity of the optimization process. The impact of the human factor during the training process is also considered, as is the need to enhance sampling complexity in order to handle the limitations of real-world interaction. This thesis makes several key contributions to the fields of HMCo and intelligent systems design. First, it provides a general framework for solving HMCo problems that is based on policy gradient methods and is applicable to a wide range of environments and tasks. Second, it demonstrates and tests the feasibility and effectiveness of the proposed approach through case studies involving both direct and indirect shared control. Third, it identifies key challenges and limitations that must be addressed in order to further advance the field, such as the sensitivity of the algorithm to hyperparameters and the complexity of the optimization process. en_US
dc.language.iso en en_US
dc.publisher جامعة بوليتكنك فلسطين - ماجستير انظمة ذكية en_US
dc.subject Co-Adaptation en_US
dc.subject Human-Machine en_US
dc.title Reinforcement Learning-Based Human-Machine Co-Adaptation via Policy Gradient en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account