Simulation Matters.

Herbie Wright

2026 June 9

1 Introduction

On February 16, 2026, I had the opportunity to participate in a structured debate—something I had never done before. The debate was centered around the role of simulation in robotics. See, there are a lot of problems in robotics that feel like they would just go away if simulation was better. But simulation in robotics, while being quite impressive, still lacks some things we really wish it had. Things like photo-realism, modeling certain non-rigid dynamics, and the whole sim-to-real gap in general come to mind. Despite this, I am optimistic; I don’t think we should abandon simulation to solely focus on real-world behavior cloning. Even now, our methods for simulating things lets you do a lot of useful stuff in robotics. Conversely, I also think there is quite a bit of room to improve current simulators. In this post, I want to talk about it.

2 Ways To Use Your Simulator

Building good simulation has been a long-standing pursuit in robotics, and for good reason. Simulators have proven to be useful for things like evaluation, data generation, and model-based policy synthesis, among others.

Evaluation: If you have multiple policies and want to compare them on a certain task, it can be quite costly to compare them in the real world. You need to obtain and set up an expensive robot and painstakingly reset the task until you have completed enough trials to make confident statements about which policy is better—not to mention safety concerns, and all the other added difficulty that comes from working with hardware. Simulation is a very attractive alternative. Modern physics simulation has been used to cheaply evaluate various vision-language-action models. For example, [1], where they find a very strong correlation between simulation performance and real-world performance on their benchmark. Similarly, the TRI team used high-quality simulation to evaluate their large behavior models in [2]. In both of these cases, simulation offered a cheap, useful alternative to the headaches of real-world robotic evaluations. Simulation is also much easier to standardize and benchmark than real-world setups. That’s why the most popular robot learning benchmarks are simulation-based (e.g. LIBERO [3])

Images from LIBERO [3].

Cheap data: Robotics has a bit of a data problem—there doesn’t seem to be enough of it [4]. A promising direction to get orders of magnitude more data is through simulation. This is the insight behind the concept of sim-and-real co-training, explored in [5]. They used simulation to generate much more data than they had from the real world, and trained policies by randomly sampling from both their sim and real datasets. They found that co-trained policies performed significantly better than those trained on only real world data—even when the sampling percentage was 99% from simulation data. Another good paper on co-training can be found in [6]. One big benefit of using simulation to generate data is that it can be procedurally done via leveraging classical methods and privileged information; when you are in simulation, you don’t have to deal with the pesky uncertainty present in the real world.

Model-based methods: A simulator is a model, and robotics has many techniques to turn a model into a policy. These include reinforcement learning (RL) techniques (like proximal policy optimization [7]), sampling-based model predictive control methods (like predictive sampling [8] and model predictive path integral [9]), and even methods that use easier-to-optimize simulation formulations used to build controllers (like differential simulation [10] or linear complementarity systems [11][13]). These techniques are made possible because of the progress in robotic simulation methods.

3 The Sim-to-Real Gap

The usual argument against simulation for robotics comes from the gap between the simulation dynamics and reality, commonly called the reality gap. This gap propagates through to policies trained in simulation into the sim-to-real gap. The sim-to-real gap is the performance drop of such policies when moved from simulation to reality. This gap, critics argue, will always be a bit too large [14]. I’m not so sure, and here I want to explain why. In short, I think it has been shown that you don’t need a perfect model to achieve sim-to-real transfer, you simply need a useful model; we already have examples of sim-to-real working well in robotics, even with imperfect models.

In Locomotion: Perhaps the most notable example is locomotion. Sim-to-real reinforcement learning has produced some incredibly robust locomotion policies. Locoformer [15] is a particular flashy example, where they can saw off the legs of a quadroped, and show that their controller can adapt to the new morphology.

Images from Locoformer [15]

In Manipulation: While perhaps less established, we also see inklings of sim-to-real reinforcement learning being possible in manipulation (e.g. ManipGen [16]). Additionally, an example of using incorrect-yet-useful models effectively in manipulation can be found in model-based control methods (e.g. Push Anything [12])

However, that is not to dismiss the sim-to-real gap entirely. The reality gap still very much exists [17], and it is a worthwhile aim to reduce it.

4 Towards Better Simulators

There are multiply ways that the reality gap manifests, and as such, there are multiple research directions aimed at closing various reality gaps in robotics simulation. A pretty easy split is rendering/observation gaps vs a physics/dynamics gaps.

Closing the Observational Gap: One of the most well-known gaps between current robot simulators and the real world is RGB rendering. It can be quite obvious what is a simulator and what is the real world by simply looking at an image—no actual motion required. Domain randomization is one technique to help overcome the visual gap, yet it is far from a perfect solution currently. Other work has explored leveraging Gaussian splatting [18] for more photorealistic rendering [19]. There was recent work at ICRA 2026 which used Unreal engine in combination with Mujoco for photorealism [20]. However, I am somewhat of the opinion that when it comes to visuals, domain randomization or inverse rendering for real-to-sim is the better way to close the observational gap, an idea that aligns with recent work from Ai2 [21]

Closing the Dynamics Gap The models used for simulation need to compress the true physics of the world into a slightly less accurate version. This means that there is a gap in the dynamics of the simulator and the real-world dynamics. One modern way to overcome this gap is to allow for black-box learning as part of the simulation (e.g: residual learning [22], learned deformable dynamics [23]). There are also efforts to improve classical simulation by developing new contact models or simulators (e.g. hydroelastic contact [24]). These efforts aim to reduce the sim-to-real gap by more closely aligning the simulation dynamics with the dynamics observed in the real world.

Image of the hydro-elastic contact model from [24]

5 Conclusion

All in all, I think improving simulation and real-to-sim methods constitute very fruitful research directions with many open problems. In my mind, better simulation is perhaps the biggest key to unlocking new robotics techniques and capabilities, and I am excited to see how research in robotic simulation evolves.

References

[1]
X. Li, K. Hsu, J. Gu, O. Mees, K. Pertsch, H. R. Walke, C. Fu, I. Lunawat, I. Sieh, S. Kirmani, and others, “Evaluating real-world robot manipulation policies in simulation,” in 8th annual conference on robot learning, 2024.
[2]
J. Barreiros, A. Beaulieu, A. Bhat, R. Cory, E. Cousineau, H. Dai, C.-H. Fang, K. Hashimoto, M. Z. Irshad, M. Itkina, and others, “A careful examination of large behavior models for multitask dexterous manipulation,” arXiv preprint arXiv:2507.05331, 2025.
[3]
B. Liu, Y. Zhu, C. Gao, Y. Feng, Q. Liu, Y. Zhu, and P. Stone, “Libero: Benchmarking knowledge transfer for lifelong robot learning,” Advances in Neural Information Processing Systems, vol. 36, pp. 44776–44791, 2023.
[4]
K. Goldberg, “Good old-fashioned engineering can close the 100,000-year ‘data gap’ in robotics,” Science Robotics, vol. 10. American Association for the Advancement of Science, p. eaea7390, 2025.
[5]
A. Maddukuri, Z. Jiang, L. Y. Chen, S. Nasiriany, Y. Xie, Y. Fang, W. Huang, Z. Wang, Z. Xu, N. Chernyadev, S. Reed, K. Goldberg, A. Mandlekar, L. Fan, and Y. Zhu, “Sim-and-real co-training: A simple recipe for vision-based robotic manipulation,” in Proceedings of robotics: Science and systems (RSS), 2025.
[6]
A. Wei, A. Agarwal, B. Chen, R. Bosworth, N. Pfaff, and R. Tedrake, “Empirical analysis of sim-and-real cotraining of diffusion policies for planar pushing from pixels,” in 2025 IEEE/RSJ international conference on intelligent robots and systems (IROS), 2025, pp. 5625–5632.
[7]
J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 2017.
[8]
T. Howell, N. Gileadi, S. Tunyasuvunakool, K. Zakka, T. Erez, and Y. Tassa, “Predictive sampling: Real-time behaviour synthesis with mujoco,” arXiv preprint arXiv:2212.00541, 2022.
[9]
G. Williams, P. Drews, B. Goldfain, J. M. Rehg, and E. A. Theodorou, “Aggressive driving with model predictive path integral control,” in 2016 IEEE international conference on robotics and automation (ICRA), 2016, pp. 1433–1440.
[10]
J. Xu, M. Macklin, V. Makoviychuk, Y. Narang, A. Garg, F. Ramos, and W. Matusik, Accelerated policy learning with parallel differentiable simulation,” in International conference on learning representations, 2022.
[11]
A. Aydinoglu, A. Wei, W.-C. Huang, and M. Posa, “Consensus complementarity control for multicontact mpc,” IEEE Transactions on Robotics, vol. 40, pp. 3879–3896, 2024.
[12]
H. Bui, Y. Gao, H. Yang, E. Cui, S. Mody, B. Acosta, T. S. Felix, B. Bianchini, and M. Posa, “Push anything: Single-and multi-object pushing from first sight with contact-implicit MPC,” arXiv preprint arXiv:2510.19974, 2025.
[13]
H. Wright and M. Posa, “Uncertainty-aware contact-implicit MPC via GPU-parallelized ADMM,” in ICRA workshop on contact-rich manipulation, 2026.
[14]
S. Levine, Sporks of AGI.” Learning and Control (Substack), 2025.
[15]
M. Liu, D. Pathak, and A. Agarwal, “LocoFormer: Generalist locomotion via long-context adaptation,” in Conference on robot learning, 2025, pp. 532–546.
[16]
M. Dalal, M. Liu, W. Talbott, C. Chen, D. Pathak, J. Zhang, and R. Salakhutdinov, “Local policies enable zero-shot long-horizon manipulation,” in 2025 IEEE international conference on robotics and automation (ICRA), 2025, pp. 13875–13882.
[17]
E. Aljalbout, J. Xing, A. Romero, I. Akinola, C. R. Garrett, E. Heiden, A. Gupta, T. Hermans, Y. Narang, D. Fox, and others, “The reality gap in robotics: Challenges, solutions, and best practices,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 9, 2025.
[18]
B. Kerbl, G. Kopanas, T. Leimkühler, G. Drettakis, and others, “3d gaussian splatting for real-time radiance field rendering.” ACM Trans. Graph., vol. 42, no. 4, pp. 139–1, 2023.
[19]
P. Dan, K. Kedia, A. Chao, E. W. Duan, M. A. Pace, W.-C. Ma, and S. Choudhury, “X-sim: Cross-embodiment learning via real-to-sim-to-real,” arXiv preprint arXiv:2505.07096, 2025.
[20]
J. Embley-Riches, J. Liu, S. Julier, and D. Kanoulas, “Unreal robotics lab: A high-fidelity robotics simulator with advanced physics and rendering,” in 2026 IEEE International Conference on Robotics and Automation (ICRA), 2026.
[21]
A. Deshpande, M. Guru, R. Hendrix, S. Jauhri, A. Eftekhar, R. Tripathi, M. Argus, J. Salvador, H. Fang, M. Wallingford, and others, “MolmoB0T: Large-scale simulation enables zero-shot manipulation,” arXiv preprint arXiv:2603.16861, 2026.
[22]
M. Saveriano, Y. Yin, P. Falco, and D. Lee, “Data-efficient control policy search using residual dynamics learning,” in 2017 IEEE/RSJ international conference on intelligent robots and systems (IROS), 2017, pp. 4709–4715.
[23]
K. Zhang, B. Li, K. Hauser, and Y. Li, “Particle-grid neural dynamics for learning deformable object models from RGB-d videos,” in Proceedings of robotics: Science and systems (RSS), 2025.
[24]
R. Elandt, E. Drumwright, M. Sherman, and A. Ruina, “A pressure field model for fast, robust approximation of net contact force and moment between nominally rigid objects,” in 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS), 2019, pp. 8238–8245.

You can find my personal website here