This research masterfully leverages reinforcement learning to turn low gravity from a constraint into a mobility advantage, effectively bridging the gap between terrestrial locomotion and orbital mechanics. It represents a paradigm shift where robots no longer just navigate terrain, but actively manipulate their own physics to conquer extreme environments.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
Towards Low-gravity Planetary Exploration using RL for Walking, Jumping & In-flight Attitude ControlAdded:
This video presents our work towards lowgravity planetary exploration using reinforcement learning for walking, jumping, and in-flight attitude control.
We present a series of reinforcement learning policies that when combined in a taskbased deployment in a simulated Mars environment, enables the robot to overcome obstacles larger than itself terrain where traditional legged robots and rovers would struggle. The system uses walking for normal traversal, forward jumps to clear obstacles, vertical jumps for elevated observation, and attitude control during all flight faces to achieve target orientation for safe controlled landings. In this simulated mission, a waypointbased state machine coordinates policy execution.
The robot walks or jumps to a specified waypoint while the state machine automatically transitions between policies based on sensor feedback and waypoint commands.
The quarterbird robot Olympus was designed and optimized for low-gravity jumping and attitude control. The illustrated control architecture was deployed in simulation and on the physical robot with policies generating actions based on task specific observations. The vertical jumping policy is trained to achieve a commanded height. We also investigated the effect of springs on achievable jump height by training a policy with simulated parallel springs. The forward jumping policy is tasked to perform forward jumps to a target position. This task was also investigated with and without springs to understand their effect on jump distance and control authority. The attack control policy is trained to regulate the body orientation by using its legs as reaction masses during flight. Its objective is to maintain the target orientation between takeoff and landing, ensuring safe feet first touchdowns. This was validated through single axis step responses and composite 3D reorientation maneuvers in simulation.
To showcase the systems capabilities, we commended aggressive inflight orientations that challenge the jumping policy during landing. Although the signal policy typically ensures safe landings, these tests deliberately created difficult conditions to demonstrate both the orientation authority and landing robustness.
To validate simp transfer of the control policy, we deployed it on the physical robot at ESS orbital robotics lab. The robot was mounted on a rotating rod attached to a air bearing platform floating on an ultra flat floor enabling near frictionless rotation and translation. The policy was evaluated through single axis tests for roll pitch and yaw commanding step changes of 90 and 180° to the target orientation.
Mounted on the free floating platform, the robot bounces between walls, receiving 180° reorientation command at each push off to maintain feet forward orientation towards the approaching wall. This validates the policy's ability to perform rapid attitude corrections during free flight under dynamic conditions. This work demonstrates that reinforcement learning can enable complex locomotion for planetary exploration in reduced gravity environments. By combining walking, jumping, and attitude control policies, we achieve capabilities that extend beyond those of traditional leg robots and wheeled rovers, opening new possibilities for exploring challenging terrains such as Martian lava tubes.
Thank you.
Related Videos
Beyond Robotics | European Rover Challenge 2026
beyondrobotics
189 views•2026-06-01
Beatbot Sora70: JetPulse Technology and AI obstacle avoidance and navigation!
DroidModderX
26K views•2026-06-02
Tesla FSD 14.3.3 Hits Phoenix Streets - FIRST LOOK
anthonystesla
114 views•2026-05-29
Elon Musk Just Revealed Fremont Line for Optimus Gen 3 Mass Production
TheAINexusOfficial
180 views•2026-05-30
人機一体「零式人機 ver.2」 子ども企画【おもしろ発見!モビリティー】 #乗り物 #automobile #robot #shorts
KyodoNews
1K views•2026-05-28
China’s New Luna AI Robot Looks Shockingly Human...
NextGenHumanoids
850 views•2026-05-28
Reachy Mini: the $300 open source robot you can actually hack — Andres Marafioti, Hugging Face
aiDotEngineer
662 views•2026-05-29
柔軟指×AI画像処理食品の仕分け作業システム!#柔軟指 #ロボット #自動化 #製造業をもっと盛り上げたい
KiQ_Robotics_Corp.
113 views•2026-05-28











