REST3D:从单张图像重建物理稳定的 3D 场景
REST3D: Reconstructing Physically Stable 3D Scenes from a Single Image

原始链接: https://shirleymaxx.github.io/REST3D/

REST3D 是一个创新的框架,旨在从单张 RGB 图像重建物理稳定的 3D 场景,从而实现仿真就绪数字资产的创建。当前方法生成的模型往往看似合理,却存在物体悬浮或穿模等物理不稳定问题,而 REST3D 通过将物理推理与 3D 重建相结合,弥补了这一差距。 该过程始于一种“代理式”物理场景理解技术,它基于重力和支撑关系来映射物体及其关联,从而建立结构先验。在此基础上,该框架初始化 3D 模型,并结合树状引导对齐与物理约束优化。这种方法在严格保持原始图像视觉完整性的同时,解决了物理不一致的问题。 实验结果表明,REST3D 在合成数据集和真实世界数据集上均显著降低了物理误差,并增强了仿真稳定性。通过架起视觉表征与物理现实之间的桥梁,该框架在 VR 人机交互等沉浸式应用中表现出色,能够将静态图像转化为功能性、可交互的 3D 环境。

Sorry.
相关文章

原文

Reconstructing physically stable 3D scenes from a single RGB image enables casual images to be converted into simulation-ready digital assets for applications such as immersive interaction and content creation. However, existing single-image reconstruction methods fall short in capturing the physical structure of a scene. As a result, they often produce geometrically plausible but physically inconsistent results, including object floating and penetration, which lead to unstable behavior in physics simulations. Image-conditioned scene generation methods improve physical plausibility but often rely on strong scene priors, yielding plausible yet inaccurate object arrangements that fail to match the input image. We propose REST3D, a single-image reconstruction framework that can REconstruct physically STable 3D scenes by integrating physical scene understanding with physics-constrained refinement. We first introduce an agentic physical scene understanding technique that constructs a scene-tree representation capturing object physical states and inter-object relationships from a gravity-support perspective, providing a structural prior for reconstruction. Leveraging this structure, we initialize the scene using image-to-3D models, followed by scene-tree-guided alignment and physics-constrained optimization to resolve physical violations while preserving visual consistency with the input image. Experiments show that our method significantly reduces physical errors and improves simulation stability on both synthetic and real-world datasets while maintaining strong reconstruction quality. We further demonstrate the reconstructed scenes in VR-based human-object interaction, showing their potential for immersive applications.

联系我们 contact @ memedata.com