Human3R is a unified, feed-forward framework for online 4D human-scene reconstruction, in the world frame, from casually captured monocular videos. It jointly recovers global multi-person SMPL-X bodies ("everyone"), dense 3D scene ("everywhere"), and camera trajectories in a single forward pass ("all-at-once").
TL;DR: Inference with One model, One stage; Training in One day using One GPU
Paper: Human3R: Everyone Everywhere All at Once Project Page: https://fanegg.github.io/Human3R/ Code: https://github.com/fanegg/Human3R
To run the inference demo, you can use the following command (assuming you have followed the installation steps from the GitHub repository):