一切都模糊不清。

一切都模糊不清。
It's all a blur

原始链接: https://lcamtuf.substack.com/p/its-all-a-blur

## 图像模糊并非安全的遮蔽尽管有普遍的建议，仅仅模糊图像并不可靠地隐藏信息。这个过程，本质上是对像素值进行平均，看似丢失了数据，但令人惊讶的是，它通常是*可逆的*。一种基本的“移动平均”模糊可以通过简单的计算来撤销。通过分析模糊后的像素，即使在较大的平均窗口下，也可以重建原始像素值，从而显示出令人惊讶的细节图像。这是通过利用平均值内的重叠数据来实现的。虽然简单的1D模糊很容易反转，但在2D中应用它会由于量化而引入显著的噪声。然而，这可以通过在平均过程中略微加权原始像素值来缓解，从而创建“对抗性”模糊。值得注意的是，即使在将这些模糊图像保存为有损格式（如JPEG）后，仍然可以恢复大量细节——直到压缩级别变得过于极端。这表明，通常实施的模糊并不能真正遮蔽信息，强调了需要更强大的遮蔽技术。

黑客新闻新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录一切都变得模糊 (lcamtuf.substack.com) 16 分，由 zdw 1小时前发布 | 隐藏 | 过去 | 收藏 | 讨论指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请YC | 联系搜索：

原文

If you follow information security discussions on the internet, you might have heard that blurring an image is not a good way of redacting its contents. This is supposedly because blurring algorithms are reversible.

But then, it’s not wrong to scratch your head. Blurring amounts to averaging the underlying pixel values. If you average two numbers, there’s no way of knowing if you’ve started with 1 + 5 or 3 + 3. In both cases, the arithmetic mean is the same and the original information appears to be lost. So, is the advice wrong?

Well, yes and no! There are ways to achieve non-reversible blurring using deterministic algorithms. That said, in many cases, the algorithm preserves far more information than would appear to the naked eye — and does it in a pretty unexpected way. In today’s article, we’ll build a rudimentary blur algorithm and then pick it apart.

If blurring is the same as averaging, then the simplest algorithm we can choose is the moving mean. We take a fixed-size window and replace each pixel value with the arithmetic mean of n pixels in its neighborhood. For n = 5, the process is shown below:

Moving average as a simple blur algorithm.

Note that for the first two cells, we don’t have enough pixels in the input buffer. We can use fixed padding, “borrow” some available pixels from outside the selection area, or simply average fewer values near the boundary. Either way, the analysis doesn’t change much.

Let’s assume that we’ve completed the blurring process and no longer have the original pixel values. Can the underlying image be reconstructed? Yes, and it’s simpler than one might expect. We don’t need big words like “deconvolution”, “point spread function”, “kernel”, or any scary-looking math.

We start at the left boundary (x = 0). Recall that we calculated the first blurred pixel like by averaging the following pixels in the original image:

\(blur(0) = {img(-2) \ + \ img(-1) \ + \ img(0) \ +\ img(1)\ +\ img(2) \over 5}\)

Next, let’s have a look at the blurred pixel at x = 1. Its value is the average of:

\(blur(1) = {img(-1)\ +\ img(0)\ +\ img(1)\ +\ img(2)\ +\ img(3) \over 5}\)

We can easily turn these averages into sums by multiplying both sides by the number of averaged elements (5):

\(\begin{align} 5 \cdot blur(0) &= img(-2) + \underline{img(-1) + img(0) + img(1) + img(2)} \\ 5 \cdot blur(1) &= \underline{img(-1) + img(0) + img(1) + img(2)} + img(3) \end{align} \)

Note that the underlined terms repeat in both expressions; this means that if we subtract the expressions from each other, we end up with just:

\(5 \cdot blur(1) - 5 \cdot blur(0) = img(3) - img(-2) \)

The value of img(-2) is known to us: it’s one of the fixed padding pixels used by the algorithm. Let’s shorten it to c. We also know the values of blur(0) and blur(1): these are the blurred pixels that can be found in the output image. This means that we can rearrange the equation to recover the original input pixel corresponding to img(3):