From simulation to reality: teaching a robot arm to pick, think, and adapt.
Explore on GitHub Twitter PostFrontier research endeavour to implement Flow Policy Optimization (FPO) for sim2real zero-shot deployment, comparing its effectiveness against baseline Proximal Policy Optimization (PPO) for the challenging cube pickup task.
The SO100 arm tackling the PickCube task in MuJoCo simulation.
Evaluation rewards during PPO training showing consistent improvement.
The same policy transferred to real hardware — picking cubes in the physical world.