使用 JavaScript 进行统计物理的逆重力
Using Antigravity for Statistical Physics in JavaScript

原始链接: https://christopherkrapu.com/blog/2025/antigravity-stat-mech/

作者探讨了近期大型语言模型(LLM)的进步,以及这如何改变了他们个人的“基准”测试。过去,在晦涩、有记录的个人身上测试模型很有启发性,但现在模型很容易通过。他们转向了更复杂的任务:生成统计物理概念的Javascript可视化图。 作者使用谷歌的Antigravity IDE(由Gemini 3 Pro提供支持),成功地在一个小时内创建了铁磁性的Ising模型的可视化图。该模型准确地生成了数学框架和使用随机扫描吉布斯采样器的模拟。 虽然OpenAI和Anthropic的先前模型也表现良好,但Antigravity的DOM检索功能被证明特别有用。作者认为一些UI元素是不必要的,但总体上赞扬了Antigravity/Windsurf团队和谷歌员工的工作,强调了LLM能力的显著飞跃。

黑客新闻 新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交 登录 使用 JavaScript 中的反重力进行统计物理 (christopherkrapu.com) 3 分,由 ckrapu 1 小时前发布 | 隐藏 | 过去 | 收藏 | 讨论 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系 搜索:
相关文章

原文

I like learning about the hidden benchmarks that everyone seems to bring out when a new large language model drops. Mine used to be asking the model about obscure but well-documented people on the internet like family members or acquaintances in the sciences or with IMDB credits. Since ~late 2024, most models are nailing that one so it’s not as interesting. Instead, I’ve moved onto Javascript-based visualizations. of statistical physics

Since Gemini 3 and Google’s Antigravity IDE were released recently (and yes, I am aware it is basically Windsurf), I wanted to give it a try with an easy one - the Ising model of ferromagnetism.

Here’s what Antigravity with Gemini 3 Pro cooked up in an hour:

A mathematical model of ferromagnetism in statistical mechanics. The grid consists of discrete variables (spins) that can be in one of two states (+1 or -1).

H(σ) = -J ∑<ij> σiσj - h ∑j σj

Simulation: This visualization uses a Random Scan Gibbs Sampler. In each step, a single spin is chosen at random and updated based on the Boltzmann distribution determined by its neighbors and the external field.

Named after physicist Ernst Ising, who solved the 1D model in his 1924 thesis.

I’ve gotten decent results out of the last crop of OpenAI and Anthropic models, but the Chrome browser extension for retrieving the DOM really helped too. It’s a great feature, and I expect Cursor to have something similar soon. I think some of the other UI features like showing subtasks and intermediate steps were a little unnecessary. Overall, great work by the former Windsurf team and the other G staff!

联系我们 contact @ memedata.com