Learning Gait Using a Neuromusculoskeletal Model and Imitation Learning

Can we learn natural looking humanoid locomotion skills with reinforcement learning?

Project Overview

This research addresses limitations in current neuromechanical control models, which struggle with dynamic tasks, environmental adaptation, and long-term planning. While deep reinforcement learning (RL) offers advantages for high-dimensional neuromusculoskeletal systems, existing implementations often produce unnatural movements despite achieving high task rewards. In this project, I explored combining reinforcement learning with imitation learning techniques to generate more physiologically plausible and human-like movements.

Report

Jumping
Large Knee
Unnatural walking examples: Walking by Jumping (left) and Large Knee Extension (right)

Methodology

Key Experiments

The research focused on two primary tasks:

  1. Balancing: Achieved successful static balance maintenance with convergence at approximately 400,000 environment steps, demonstrating robustness across both delayed and non-delayed sensory feedback conditions.
  2. Walking: Explored walking behaviors with non-delayed observations, achieving policy convergence at approximately 5 million environment steps, though with challenges in producing natural gait patterns.

Results and Insights

The preliminary findings revealed several important insights:

Conclusion and Future Directions

The project demonstrated that directly applying imitation rewards into DRL algorithms does not automatically yield natural walking behaviors. Success requires careful reward function design, extensive hyperparameter optimization, and consistent implementation environments. Future work will focus on improved imitation learning strategies and exploring unsupervised skill discovery to capture different movement strategies observed in humans.

This research contributes to the growing field of biomechanically accurate reinforcement learning for human movement simulation, with potential applications in rehabilitation robotics, prosthetics design, and human movement understanding.