我被付了最低工资来解决一个不可能的问题。

我被付了最低工资来解决一个不可能的问题。
I got paid minimum wage to solve an impossible problem

原始链接: https://tiespetersen.substack.com/p/i-got-paid-minimum-wage-to-solve

## 过度优化的陷阱：一个宏大的故事一位计算机科学专业的学生，被要求清扫一家超市的地面，忍不住运用他的技能来“优化”这个过程。他将地面平面图转化为网格，构建了一个可视化编辑器，并用模拟退火算法编写了一个C++路径优化器——一种旨在寻找高效路线的复杂算法。然而，最初的“优化”路径却是一团混乱的急转弯，对于人类清扫工来说完全不实用，尽管它是最短的距离。这凸显了一个关键的缺陷：他优化了*错误*的指标。距离不如机动性和理智重要。在算法中添加“转向惩罚”产生了一条更现实、更易于行走的路径，展示了纯粹效率与可用性之间的权衡。这次经历变成了一个更广泛的教训：算法可以完美地实现*错误*的目标。就像社交媒体算法最大化参与度（往往以牺牲真相和福祉为代价），或者LLM优先考虑听起来自信的答案而不是准确性一样，优化容易衡量的指标并不能保证积极的结果。关键不在于你*如何*优化，而在于你优化*什么*。最终，这位学生意识到，在解决错误问题时，技术上的完美毫无用处，有时，像正常人一样清扫地面才是最好的方法。

原文

Sweeping the entire Albert Heijn floor. Sounds simple. And should’ve been simple.

But I’m a Computer Science student, with a problem: I can’t stop trying to optimize things that (probably) don’t need optimizing.

So instead of just doing my job and, well… sweeping… I did what any “reasonable” person would do: I turned the supermarket floor plan into a grid graph, built a visual editor and wrote a C++ path optimizer using simulated annealing.

But before we dive into how this went spectacularly wrong, and how this made me realize how this makes everyone miserable, I need you to answer a quick question:

If you were to take over my job for one day (I wouldn’t recommend it but hypothetically speaking) and needed to sweep the entire Albert Heijn floor, would you take path A or B?

Path A (top) and path B (bottom).

Seriously. Look at them. Which one seems more efficient for sweeping a supermarket floor?

If you picked path A: congratulations, you think like an algorithm and are most likely a robot. (Good luck with CAPTCHA questions.)

But you are technically right. Path A is shorter by distance. It is absolutely useless however.

Look at those turns. Actually imagine for a second that you would walk around taking those turns. You’d look insane, like some Roomba having a seizure.

Path A is what happens when you optimize for the wrong thing.

Which, spoiler alert, is the entire point of this story. But we’ll get there, let me first explain how we got here:

First, I took the Albert Heijn floor plan and converted it into a grid. Each tile is either empty (should be swept) or an obstacle (wall, checkout counter, yoghurt package somebody threw on the ground).

I built a visual editor in Processing (a Java tool for people who like making things look cool), so I could easily map out the store and export the resulting graph.

Converting the floorplan into the grid structure was therefore quite easy.

Grid floorplan of the Albert Heijn supermarket.

The tiling of the actual floor helped to subdivide the area into bite-sized chunks.

Subdivided floorplan using tile structure.

This could then easily be converted into a network structure (also called a graph), by interpreting each tile as a node and then connecting them to neighboring tiles.

Interpreting each tile as a node in a graph.

Resulting network of tiles.

As you can see, I allowed for horizontal and vertical movement, as well as diagonal movement (as long as you don’t fly through walls).

Final graph of the Albert Heijn.

The only thing to do next was to find a cycle through this network while making sure to visit all nodes (tiles). This would then be the solution to my sweeping problem.

(This problem is also called the Traveling Salesman Problem, see article for more details and why it is so hard to “solve”.)

Since it is computationally impossible to find the best path in a graph of this size, we have to resort to heuristics. Heuristics basically try to find a very good solution in a short time, instead of trying to find the perfect solution (which is more or less impossible).

So I implemented the path optimizer in C++.

The underlying heuristic algorithm: simulated annealing.

If you’re not familiar, simulated annealing is essentially trying a bunch of little changes (also called local moves).

At first, you just accept every little change (even if it makes the path worse), but throughout the algorithm you slowly get more picky and at the end only allow changes that strictly improve the path.

This is inspired by how metals cool down. By starting with a high temperature (just trying different moves) to explore a lot, and then gradually cooling down to settle into a low energy state (close to optimal).

Simulated annealing slowly improving the path throughout many iterations.

Watch this gif. See how it start chaotic and gradually settles into something stable? That’s simulated annealing doing its thing.

For the local move, I used the 2-opt move. You take two edges in your path, remove them, and reconnect them in a different way. If this tiny change makes the path better, keep it. If not, either keep it (if the temperature is still high) or discard it.

Then just do that 1 billion times. Or well… let your computer do that 1 billion times.

After letting it run for a while, I got my first “optimized” path. Here’s what it came up with:

First “optimized” path.

Look at it. That path has more sharp turns than a Christopher Nolan movie. There is no way anybody is crazy enough to actually sweep like this. You would probably throw up afterwards.

Technically, it covers the entire floor. Technically, it’s (almost) the shortest sweeping path. Technically, it’s perfect.

There are some good parts to it, but practically, it’s absolutely useless.

The algorithm did exactly what I asked for. (Thankfully, imagine if it would just do something else entirely, that would be scary.)

I just asked it the wrong question.

I quickly realized I was optimizing for the wrong thing. Distance isn’t everything.