光谱实验室发布 SGS-1:首个用于结构化 CAD 的生成模型。
Spectral Labs releases SGS-1: the first generative model for structured CAD

原始链接: https://www.spectrallabs.ai/research/SGS-1

## SGS-1:用于3D几何生成的新基础模型 SGS-1是一种新型AI模型,能够从图像或现有3D网格生成完全可制造且参数化的3D几何体,输出可编辑的STEP格式CAD文件。与现有生成模型不同,SGS-1产生的结果准确,适用于实际工程任务——通过其根据部分上下文和描述设计滚子和输送机组件的支架的能力得到证明。 评估表明,SGS-1在性能上显著优于GPT-5和HoLa等现有模型,展现出更强的空间理解能力,并始终生成可用的设计。虽然其他模型在处理中等复杂度的形状时就面临困难,但SGS-1能够成功创建各种参数化设计的几何体。 SGS-1还可以将草图、工程图,甚至“哑”3D文件(如STL)转换为参数化CAD数据,从而自动化逆向工程流程。目前的局限性包括难以处理有机形状、薄结构以及单步生成完整组件——这些是未来开发的目标。目前提供研究预览版供测试。

## Spectral Labs 的 SGS-1:生成式 CAD 面临质疑 Spectral Labs 最近发布了 SGS-1,这是一种生成模型,旨在根据提示创建 CAD 设计,并输出 STEP 文件。该发布引发了争论,许多用户质疑其核心主张,特别是关于“参数化”设计的部分。 初步测试显示,生成的設計存在显著不准确之处——尺寸错误、功能缺失以及非标准几何形状,例如非圆形孔和未对齐的元素。批评者指出,STEP 文件本身缺乏传统 CAD 软件中常见的参数化特征历史记录,从而对易于编辑的说法提出了质疑。 Spectral Labs 澄清了他们对“参数化”的定义,指的是使用带有参数的原始几何体进行内部表示,然后转换为 STEP 文件输出。他们承认存在问题,并承诺在 SGS-2 中进行改进,包括特征树和更丰富的输入条件。 这场讨论凸显了人工智能在复杂工程领域面临的挑战,强调了在仅仅生成 3D 形状之外,需要可制造且准确的设计。虽然潜力令人兴奋,尤其是在简化原型设计和 3D 打印方面,但用户在看到可证明的改进之前仍然持怀疑态度。
相关文章

原文

SGS-1 in Fusion360 CAD software creating brackets for a roller assembly.

Today we are announcing SGS-1, a foundation model that can generate fully manufacturable and parametric 3D geometry. You can try a research preview of SGS-1 here.

Given an image or a 3D mesh, SGS-1 can generate CAD B-Rep parts in STEP format. Unlike all other existing generative models, SGS-1 outputs are accurate and can be edited easily in traditional CAD software.

Overview of SGS-1 - users can provide an image or “dumb” 3D file, and get back a parametric B-Rep file that can be easily edited to match specific dimensions

SGS-1 shows strong general results, producing much more complex and diverse CAD shapes than existing methods.

Illustrative results from SGS-1

SGS-1 can be used for real-world engineering tasks. In the below example, SGS-1 is used to design a bracket for a roller assembly from partial context and a text description (additional details below in Generating Parametric Geometry in Assembly Context section).

Bracket designed by SGS-1 as part of a simple roller assembly

Bracket designed by SGS-1 as part of a complex conveyor assembly

Results and comparing SGS-1 to prior models

We compare SGS-1 to SOTA multimodal reasoning LLMs and open-source image-to-CAD models: GPT-5 thinking, a large reasoning model by OpenAI that can produce CadQuery code to represent parametric geometry, and HoLa, a 205M parameter latent diffusion model with 181M parameter VAE that generate B-Rep geometry conditioned on a single input image. We develop a benchmark set of 75 images depicting medium to high complexity parametric geometry, sourced from CAD image renders of various styles, engineering sketches, and images generated by generative AI models. Model performance is evaluated by successful/failed creation of a single valid watertight solid that is an accurate representation of the input image using distance metrics (Success Ratio).

Quantitative evaluations

We run each model 10 times and show scores for all 10 runs, as well as for the best output of the 10. Although GPT-5 and HoLa BRep can attain non-zero performance on the easiest images, SGS-1 is the best performing model with at least one success for all but the most complex objects.

Outputs from the SOTA large reasoning model (GPT-5) demonstrate a clear lack of spatial understanding, producing outputs that are unusable or too simple to actually be useful. We use both SGS-1 and GPT-5 to generate the parametric geometry for the rail mount from the input image, in order to produce the desired target complete assembly.

Rail mount image provided to SGS-1 and GPT-5.
SGS-1 accurately captures geometric features, demonstrating correct spatial and geometric understanding.
GPT-5 fails to produce most features and incorrectly places others, showing a lack of spatial intelligence.

SGS-1 accurately represents the geometry and can be plugged into an assembly context, while the output from the large reasoning model is missing core spatial features.

This enables quick and correct integration into the assembly. As the output is parametric, dimensions can easily be adjusted.
The large reasoning model output is completely unusable within the assembly. There is no way to parametrically edit the output to represent the correct geometry - the designer must start from scratch.

Generating Parametric Geometry in Assembly Context

With SGS-1, you can create new parametric geometry within your current assembly context. In this example, SGS-1 takes in a partial CAD assembly and a text description/image of a bracket, and produces a 3D design for a bracket that is feasible for the context.

First, render the partial assembly and come up with a text description of the parts you want to add. Next, run it through SGS-1, which will output a parametric B-Rep in the form of a downloadable STEP fileFinally, import the STEP file into your partial assembly and adjust dimensions until the part fits correctly into the assembly

SGS-1 is capable of generating diverse designs for tasks like this - several bracket designs created by SGS-1 are shown below:

Converting Sketches and Engineering Drawings to B-Rep

SGS-1 can be used to convert simple freehand sketches and engineering drawings into geometry that you can work in in your CAD editor. In this example, we run sketches and drawings through SGS-1 to create parametric geometry.

Use SGS-1 to transform sketches and drawings into 3D CAD files

This works well on simple hand sketches, enabling powerful design workflows.

This also works on structured engineering drawings.

Automating Reverse Engineering and STL to STEP File Conversion

SGS-1 can be used to convert scans and standalone STL or other mesh files to parametric STEP files without any human input, automating reverse engineering of many shapes.

Use SGS-1 to convert dumb 3D representations to parametric geometry

Limitations

SGS-1 is designed to generate parametric 3D geometry for engineering use cases, and struggles when tasked with generating creative assets and organic shapes with complex curvature. In addition, SGS-1 has a limited 3D resolution and struggles with generating very thin structures. Finally, SGS-1 cannot create full assemblies in one shot. We plan to address these limitations with our next model generation.

Next Steps

SGS-1 represents a significant step forward for foundation models that can generate 3D geometry for engineering tasks. We plan to continue pushing forward the frontier, by training models that can engineer physical systems of increasing complexity. The next generation of models will be natively multimodal, support larger and more complex spatial context, and will be capable of performing more advanced physical reasoning through longer range planning. As we continue to scale up these models, we are excited about scaling up reinforcement learning using physical simulation feedback, which will unlock new physical reasoning capabilities for our models.

If you are interested in deploying SGS-1 or collaborating on research, please contact us through this form.

We are also hiring! Our team is composed of top AI researchers and engineers with previous experience at institutions such as Autodesk Research, Samsung Research, CMU, and Meta. If you're interested in our work and mission, please get in touch.

联系我们 contact @ memedata.com