牺牲质量求速度:光标AI在开源项目中的应用研究
Speed at the cost of quality: Study of use of Cursor AI in open source projects

原始链接: https://arxiv.org/abs/2511.04427

这项研究调查了使用LLM驱动的编码助手Cursor对开源软件项目的影响。通过比较采用Cursor的项目与匹配的对照组,研究人员发现开发速度最初有显著提升,但这种提升是暂时的。 该研究揭示了一种权衡:虽然Cursor在短期内可以加速编码,但同时也会导致代码复杂性和静态分析警告的长期增加——这些是潜在质量问题的指标。进一步分析表明,这些质量下降最终会随着时间的推移导致开发速度减慢。 研究结果表明,虽然Cursor等AI编码工具可以提供立竿见影的生产力提升,但优先保证质量至关重要。该研究强调需要在这些工具和工作流程中更好地集成质量检查,以避免长期并发症并保持可持续的开发速度。

最近的一项研究(arxiv.org)调查了Cursor AI对开源项目的影响,发现开发速度加快的同时,代码警告和复杂性也增加了。然而,Hacker News上的评论员质疑该研究的方法论,认为它没有考虑到代码库规模扩大带来的自然复杂性增加。 一位用户指出,AI工具*降低*了复杂性的成本,能够更快地理解、测试和重构复杂代码——即使AI最初*创建*了这种复杂性。另一位评论员强调了AI发展的快速步伐,质疑2025年4月数据的相关性。 最后一条评论强调,代码质量差会带来超越开发者时间的深远后果,可能导致法律问题和信任丧失。讨论的中心在于,AI辅助编码是否以牺牲可维护性和长期质量为代价来优先速度。
相关文章

原文

View a PDF of the paper titled Speed at the Cost of Quality: How Cursor AI Increases Short-Term Velocity and Long-Term Complexity in Open-Source Projects, by Hao He and 4 other authors

View PDF HTML (experimental)
Abstract:Large language models (LLMs) have demonstrated the promise to revolutionize the field of software engineering. Among other things, LLM agents are rapidly gaining momentum in software development, with practitioners reporting a multifold increase in productivity after adoption. Yet, empirical evidence is lacking around these claims. In this paper, we estimate the causal effect of adopting a widely popular LLM agent assistant, namely Cursor, on development velocity and software quality. The estimation is enabled by a state-of-the-art difference-in-differences design comparing Cursor-adopting GitHub projects with a matched control group of similar GitHub projects that do not use Cursor. We find that the adoption of Cursor leads to a statistically significant, large, but transient increase in project-level development velocity, along with a substantial and persistent increase in static analysis warnings and code complexity. Further panel generalized-method-of-moments estimation reveals that increases in static analysis warnings and code complexity are major factors driving long-term velocity slowdown. Our study identifies quality assurance as a major bottleneck for early Cursor adopters and calls for it to be a first-class citizen in the design of agentic AI coding tools and AI-driven workflows.
From: Hao He [view email]
[v1] Thu, 6 Nov 2025 15:00:51 UTC (265 KB)
[v2] Thu, 13 Nov 2025 15:51:45 UTC (265 KB)
[v3] Mon, 26 Jan 2026 03:02:33 UTC (309 KB)
联系我们 contact @ memedata.com