努力与创意作品的 perceived quality 成超线性关系。
Why Effort Scales Superlinearly with the Perceived Quality of Creative Work

原始链接: https://markusstrasser.org/creative-work-landscapes.html

## 创造的碎形本质 马库斯·斯特拉瑟认为,创造性工作并非一个从想法到执行的线性过程,而是一种递归的、嵌套式的探索。他将常常感受到的“最后阶段的修改”重新定义为并非润色,而是*在日益受限的参数下进行的更高分辨率探索*。 随着质量的提高,“可接受范围”——能够*提升*作品的改变范围——会急剧缩小。这与验证所需的时间(反馈延迟)相结合,形成了一种“精度税”,即努力与感知质量不成比例地增加。本质上,实现精炼需要指数级更精确的修改。 这种动态因媒介而异。音乐需要微观层面的精确,而散文则更宽容。生成式人工智能目前运行在“宽阔盆地”区域,容易接受粗略的改变,但真正的技巧在于驾驭那些接近最佳峰值的缩小范围。熟练的练习,例如音乐排练,*缓存*解决方案,减少实时探索。最终,创造性努力并非关于执行计划,而是对改进进行持续、嵌套式的搜索。

一个由一篇关于努力与感知创造力质量之间超线性关系的文章引发的黑客新闻讨论,触及了创造性工作的迭代本质。最初的文章强调,大量的努力往往不被看到,但却能显著提升最终产品。 一位评论者随后质疑,在不同的创作领域——绘画、软件、建筑,甚至像水彩这样传统的艺术形式中,通常有多少工作被丢弃和重做。他们指出,实践范围从频繁的重新开始到拥抱快速、不完美的执行,与像“画50”系列这样逐步构建细节的系统方法形成对比。 这场讨论本质上探讨了不同创作过程中固有的不同程度的完善和修改,以及有多少“不可见”的努力促成了精美的最终结果。
相关文章

原文

· 1 min read · Send your thoughts via twitter or mail.

Abstract claim: The act of creation is fractal exploration–exploitation under optimal feedback control. When resolution increases the portion of parameter space that doesn't make the artifact worse (acceptance volume) collapses. Verification latency and rate–distortion combine into a precision tax that scales superlinearly with perceived quality.

When I make something good, I often spend most of my time making thousands of high-precision edits on an artifact that I thought should have been finished hours ago. Previously, I called this 'last-mile edits', but that was the wrong mental image.

"Last mile" implies executing a known plan with diminishing returns but "last mile" at one level just becomes "early exploration" at higher resolution at the next level. Instead of treating exploration (idea) and exploitation (execution) as temporally separated phases, they nest recursively. That nested search is where the effort goes.

Once you commit to D minor, this scene, that argument structure you've constrained the search space and now you search again within it.

Take some of my quicker five-minute, gestural sketches below. You'd think they break this nested search dynamic but with a closer look it becomes clear that I just front-loaded my taxes by caching motor heuristics.

Figure 1: A closer look shows the same set of practiced, comfortable gestures that click with my hand shape. They gravitate toward broad, confident circular strokes and shorter straight lines, just short enough to keep them stable. I'm executing cached heuristics, not exploring. I revert to face-like abstractions and focus on having the center hold. I do not like when *The Center Does Not Hold*.

Domains and modalities differ in how wide and forgiving their basins are and how quickly you can verify the edit (feedback latency). Music timing has a narrow basin at the micro-level (±20 ms can kill a groove) but can be more forgiving higher up: key and pitch changes can be interchangeable without loss of quality, not often though. Prose has a wide basin (many phrasings work). Abstract, contemporary art has extremely wide basin, so much so that nobody with any self-respect even bothers anymore. Renaissance paintings have more constraints and less distortion tolerance.

ModalityBasinVerifierSpeed
Text (prose)WideHuman readMinutes
CodeWide (design) / Narrow (syntax)Compiler/testsms-seconds
Music timingNarrowEar–body~20-40ms
Line drawingNarrowEye–hand~100ms

Let's take the following optimization landscape and assume it's for the process of writing a song. To make it simpler, let's constrain like this: We've written the lyrics and picked a BPM of 80.

The wider, more forgiving hill corresponds to choosing C major on the macro level, but there might be a higher, sharper peak in E minor that's trickier—i.e., it demands more precision edits.

Figure 2: Z-axis is quality (warmer = better). X and Y are arbitrary parameters.

Wide basins let coarse proposals land. This is where almost all generative AI outputs live and the oxygen is still plenty. Near a sharp peak, the acceptance volumea a We call it acceptance volume, but in the 3D example above, it’d be the area of a tiny slice of the hill. shrinks rapidly and you can’t reliably see micro-improvements without averaging more evidence or trials. The controller (often the hand) makes many tiny corrections after some latency. Rinse and repeat until the piece sits on a hard-to-vary peak.

That's why effort seems like it scales superlinearly as perceived quality rises. Judging the intermediate artifact takes more time and most edits (the search) make it worse. Geometrically bad edits become more likely and land you lower in the landscape.

Craft, then, is the slog of closing ever-less-perceivable gaps.

Don't bands sometimes record a banger song in an hour together?

The tower-climbing happened during practice (*muscle memory*), not recording. Jazz is closer to real-time exploration and mistakes are more accepted and expected.

Drawing takes forever because you're exploring AND refining simultaneously.

We don't "rehearse" a specific drawing, we solve a novel problem in real-time. There's no cached motor sequence to execute.

BibTeX Citation
@misc{strasser2025,
  author = {Strasser, Markus},
  title = {Why Effort Scales Superlinearly with the Perceived Quality of Creative Work},
  year = {2025},
  url = {https://markusstrasser.org/},
  note = {Accessed: 2025-11-11}
}
联系我们 contact @ memedata.com