混元3D-2-Turbo：在4090显卡上，约1秒内即可生成高质量的快速形状。

混元3D-2-Turbo：在4090显卡上，约1秒内即可生成高质量的快速形状。
Hunyuan3D-2-Turbo: fast high-quality shape generation in ~1s on a 4090

原始链接: https://github.com/Tencent/Hunyuan3D-2/commit/baab8ba18e46052246f85a2d0f48736586b84a33

这段代码定义了一个FastAPI应用，作为模型工作器，使用Hunyuan3D模型根据文本提示生成3D网格。它使用了多个库，包括`fastapi`、`torch`、`trimesh`以及用于背景移除、形状生成和纹理应用的自定义模块。`ModelWorker`类初始化Hunyuan3D管道，包括基于DiT的形状生成管道和可选的纹理生成管道。`/generate`端点接收文本提示和诸如种子、分辨率和引导比例等参数。它生成一个3D网格，可以选择性地简化它并添加纹理。生成的网格被保存到文件中，并返回其路径。该应用还包括一个`/status`端点来检查生成任务的状态。CORS中间件已启用，允许跨域请求。脚本接受命令行参数来配置主机、端口、模型路径、设备、并发限制以及是否启用纹理生成。

Hacker News 最新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录 Hunyuan3D-2-Turbo：在 4090 上约 1 秒内快速生成高质量形状 (github.com/tencent) 9 分，由 dvrp 发布，1 小时前 | 隐藏 | 过去 | 收藏 | 2 条评论 dvrp 1 小时前 [–] 另请参阅：https://github.com/Tencent/FlashVDM 回复 Flux159 1 分钟前 | 父评论 [–] 我认为链接应该更新为此链接，因为它目前仅指向一个 git 提交。回复加入我们，参加 6 月 16-17 日在旧金山举办的 AI 初创公司学校！指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系我们搜索：

稳定快速的 3D：从单个图像快速生成 3D 资产 2024-08-03

Flux：具有 12B 参数的开源文本到图像模型 2024-08-02

Llama 3 用纯 NumPy 实现 2024-05-17

Bolt3D：秒速生成3D场景 2025-03-19

原文

1+# Hunyuan 3D is licensed under the TENCENT HUNYUAN NON-COMMERCIAL LICENSE AGREEMENT 2+# except for the third-party components listed below. 3+# Hunyuan 3D does not impose any additional limitations beyond what is outlined 4+# in the repsective licenses of these third-party components. 5+# Users must comply with all terms and conditions of original licenses of these third-party 6+# components and must ensure that the usage of the third party components adheres to 7+# all relevant laws and regulations. 8+ 9+# For avoidance of doubts, Hunyuan 3D means the large language models and 10+# their software and algorithms, including trained model weights, parameters (including 11+# optimizer states), machine-learning model code, inference-enabling code, training-enabling code, 12+# fine-tuning enabling code and other elements of the foregoing made publicly available 13+# by Tencent in accordance with TENCENT HUNYUAN COMMUNITY LICENSE AGREEMENT. 14+ 115""" 216A model worker executes the model. 317"""  2236from fastapi.responses import JSONResponse, FileResponse 2337 2438from hy3dgen.rembg import BackgroundRemover 25-from hy3dgen.shapegen import Hunyuan3DDiTFlowMatchingPipeline, FloaterRemover, DegenerateFaceRemover, FaceReducer 39+from hy3dgen.shapegen import Hunyuan3DDiTFlowMatchingPipeline, FloaterRemover, DegenerateFaceRemover, FaceReducer, \ 40+    MeshSimplifier 2641from hy3dgen.texgen import Hunyuan3DPaintPipeline 2742from hy3dgen.text2image import HunyuanDiTPipeline 2843 @@ -129,17 +144,31 @@ def load_image_from_base64(image):
 129144 130145 131146class ModelWorker: 132-    def __init__(self, model_path='tencent/Hunyuan3D-2', device='cuda'): 147+    def __init__(self, 148+                 model_path='tencent/Hunyuan3D-2mini', 149+                 tex_model_path='tencent/Hunyuan3D-2', 150+                 subfolder='hunyuan3d-dit-v2-mini-turbo', 151+                 device='cuda', 152+                 enable_tex=False): 133153        self.model_path = model_path 134154        self.worker_id = worker_id 135155        self.device = device 136156        logger.info(f"Loading the model {model_path} on worker {worker_id} ...") 137157 138158        self.rembg = BackgroundRemover() 139-        self.pipeline = Hunyuan3DDiTFlowMatchingPipeline.from_pretrained(model_path, device=device) 140-        self.pipeline_t2i = HunyuanDiTPipeline('Tencent-Hunyuan/HunyuanDiT-v1.1-Diffusers-Distilled', 141-                                               device=device) 142-        self.pipeline_tex = Hunyuan3DPaintPipeline.from_pretrained(model_path) 159+        self.pipeline = Hunyuan3DDiTFlowMatchingPipeline.from_pretrained( 160+            model_path, 161+            subfolder=subfolder, 162+            use_safetensors=True, 163+            device=device, 164+        ) 165+        self.pipeline.enable_flashvdm() 166+        # self.pipeline_t2i = HunyuanDiTPipeline( 167+        #     'Tencent-Hunyuan/HunyuanDiT-v1.1-Diffusers-Distilled', 168+        #     device=device 169+        # ) 170+        if enable_tex: 171+            self.pipeline_tex = Hunyuan3DPaintPipeline.from_pretrained(tex_model_path) 143172 144173    def get_queue_length(self): 145174        if model_semaphore is None: @@ -174,31 +203,42 @@ def generate(self, uid, params):
 174203        else: 175204            seed = params.get("seed", 1234) 176205            params['generator'] = torch.Generator(self.device).manual_seed(seed) 177-            params['octree_resolution'] = params.get("octree_resolution", 256) 178-            params['num_inference_steps'] = params.get("num_inference_steps", 30) 179-            params['guidance_scale'] = params.get('guidance_scale', 7.5) 180-            params['mc_algo'] = 'mc' 206+            params['octree_resolution'] = params.get("octree_resolution", 128) 207+            params['num_inference_steps'] = params.get("num_inference_steps", 5) 208+            params['guidance_scale'] = params.get('guidance_scale', 5.0) 209+            params['mc_algo'] = 'dmc' 210+            import time 211+            start_time = time.time() 181212            mesh = self.pipeline(**params)[0] 213+            logger.info("--- %s seconds ---" % (time.time() - start_time)) 182214 183215        if params.get('texture', False): 184216            mesh = FloaterRemover()(mesh) 185217            mesh = DegenerateFaceRemover()(mesh) 186218            mesh = FaceReducer()(mesh, max_facenum=params.get('face_count', 40000)) 187219            mesh = self.pipeline_tex(mesh, image) 188220 189-        with tempfile.NamedTemporaryFile(suffix='.glb', delete=False) as temp_file: 221+        type = params.get('type', 'glb') 222+        with tempfile.NamedTemporaryFile(suffix=f'.{type}', delete=True) as temp_file: 190223            mesh.export(temp_file.name) 191224            mesh = trimesh.load(temp_file.name) 192-            temp_file.close() 193-            os.unlink(temp_file.name) 194-            save_path = os.path.join(SAVE_DIR, f'{str(uid)}.glb') 225+            save_path = os.path.join(SAVE_DIR, f'{str(uid)}.{type}') 195226            mesh.export(save_path) 196227 197228        torch.cuda.empty_cache() 198229        return save_path, uid 199230 200231 201232app = FastAPI() 233+from fastapi.middleware.cors import CORSMiddleware 234+ 235+app.add_middleware( 236+    CORSMiddleware, 237+    allow_origins=["*"],  # 你可以指定允许的来源 238+    allow_credentials=True, 239+    allow_methods=["*"],  # 允许所有方法 240+    allow_headers=["*"],  # 允许所有头部 241+) 202242 203243 204244@app.post("/generate") @@ -260,14 +300,17 @@ async def status(uid: str):
 260300if __name__ == "__main__": 261301    parser = argparse.ArgumentParser() 262302    parser.add_argument("--host", type=str, default="0.0.0.0") 263-    parser.add_argument("--port", type=int, default=8081) 264-    parser.add_argument("--model_path", type=str, default='tencent/Hunyuan3D-2') 303+    parser.add_argument("--port", type=str, default="8081") 304+    parser.add_argument("--model_path", type=str, default='tencent/Hunyuan3D-2mini') 305+    parser.add_argument("--tex_model_path", type=str, default='tencent/Hunyuan3D-2') 265306    parser.add_argument("--device", type=str, default="cuda") 266307    parser.add_argument("--limit-model-concurrency", type=int, default=5) 308+    parser.add_argument('--enable_tex', action='store_true') 267309    args = parser.parse_args() 268310    logger.info(f"args: {args}") 269311 270312    model_semaphore = asyncio.Semaphore(args.limit_model_concurrency) 271313 272-    worker = ModelWorker(model_path=args.model_path, device=args.device) 314+    worker = ModelWorker(model_path=args.model_path, device=args.device, enable_tex=args.enable_tex, 315+                         tex_model_path=args.tex_model_path) 273316    uvicorn.run(app, host=args.host, port=args.port, log_level="info")

混元3D-2-Turbo：在4090显卡上，约1秒内即可生成高质量的快速形状。 Hunyuan3D-2-Turbo: fast high-quality shape generation in ~1s on a 4090

混元3D-2-Turbo：在4090显卡上，约1秒内即可生成高质量的快速形状。
Hunyuan3D-2-Turbo: fast high-quality shape generation in ~1s on a 4090