DEDA——追踪点提取、解码和匿名化工具包
DEDA – Tracking Dots Extraction, Decoding and Anonymisation Toolkit

原始链接: https://github.com/dfd-tud/deda

Deda是一个Python工具,用于分析和匿名化打印机跟踪点——那些编码打印机/打印信息的小黄点。它允许你解码扫描文档中的跟踪数据(无损压缩,推荐300 dpi),比较打印输出以识别不同的打印机,并提取点模式进行分析。 使用Deda,需要安装Python 3,然后通过pip安装Deda以及可选的Wand库(用于在匿名化过程中处理白色区域)。GUI可以通过`deda_gui`启动。常用功能包括解析打印数据(`deda_parse_print`),比较打印件(`deda_compare_prints`),提取点(`deda_extract_yd`),创建自定义点(`deda_create_dots`)以及匿名化扫描件(`deda_clean_document`)。对于打印匿名化,需要先生成特定打印机的掩码(`deda_anonmask_create`),然后在打印前将其应用于你的PDF(`deda_anonmask_apply`)。常见问题包括需要设置`PATH`、扫描设置问题以及`wand`库问题。请参考文档进行故障排除。

Hacker News 最新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交 登录 DEDA – 追踪点提取、解码和匿名化工具 (github.com/dfd-tud) 12 分,来自 pavel_lishin,16 分钟前 | 隐藏 | 过去 | 收藏 | 讨论 加入我们,参加 6 月 16-17 日在旧金山举办的 AI 初创公司学校! 指导原则 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系我们 搜索:
相关文章

原文

Document Colour Tracking Dots, or yellow dots, are small systematic dots which encode information about the printer and/or the printout itself. This process is integrated in almost every commercial colour laser printer. This means that almost every printout contains coded information about the source device, such as the serial number.

On the one hand, this tool gives the possibility to read out and decode these forensic features and on the other hand, it allows anonymisation to prevent arbitrary tracking.

If you use this software, please cite the paper: Timo Richter, Stephan Escher, Dagmar Schönfeld, and Thorsten Strufe. 2018. Forensic Analysis and Anonymisation of Printed Documents. In Proceedings of the 6th ACM Workshop on Information Hiding and Multimedia Security (IH&MMSec '18). ACM, New York, NY, USA, 127-138. DOI: https://doi.org/10.1145/3206004.3206019


  • Install Python 3
  • Install Deda

From PyPI: $ pip3 install --user deda

Or from current directory: $ pip3 install --user .

  • Optional requirement by deda_anonmask_apply (Unix and GNU/Linux only): $ pip3 install --user wand

Without Wand, pages containing white areas on images cannot be anonymised.


  • To open the GUI type: $ deda_gui

Tracking data can be read and sometimes be decoded from a scanned image. For good results the input shall use a lossless compression (e.g. png) and 300 dpi. Make sure to set a neutral contrast $ deda_parse_print INPUTFILE

2. Find a divergent printer in a set of scanned documents

$ deda_compare_prints INPUT1 INPUT2 [INPUT3] ...

3. Analysing an unknown tracking pattern

New patterns might not be recognised by parse_print. The dots can be extracted for further analysis.
$ deda_extract_yd INPUTFILE

4. Create your own tracking dots

If you want to create your own tracking dots matrix and add it to a pdf document, pass the contents as parameters (see deda_create_dots -h). $ deda_create_dots PDFINPUT

The calibration page ($ deda_anonmask_create -w) may be used as an input.

5. Anonymise a scanned image

This (mostly) removes tracking data from a scan:
$ deda_clean_document INPUTFILE OUTPUTFILE

6. Anonymise a document for printing
  • Save your document as a PDF file and call it DOCUMENT.PDF.

  • Print the testpage.pdf file created by
    $ deda_anonmask_create -w
    without any page margin.

  • Scan the document (300 dpi) and pass the lossless file to
    $ deda_anonmask_create -r INPUTFILE
    This creates 'mask.json', the individual printer's anonymisation mask.

  • Now apply the anonymisation mask:
    $ deda_anonmask_apply mask.json DOCUMENT.PDF This creates 'masked.pdf', the anonymised document. It may be printed with a zero page margin setting.

Check whether a masked page covers your printer's tracking dots by using a microscope. The mask's dot radius, x and y offsets can be customised and passed to deda_anonmask_apply as parameters.

Note that if DOCUMENT.PDF contains graphics with white or light coloured parts, these can only be masked if "wand" is installed (see above).


deda_parse_print: command not found

Possible solutions:

  • Install deda accordig to chapter 0
  • Execute $ export PATH="$PATH:$(python -c 'import site,os; print(os.path.join(site.USER_BASE, "bin"))')"
Deda does not recognise my tracking dots

Set up your scan program so that it does not eliminate the paper structure nor tracking dots by some threshold and check again. Remember that monochrome pages as well as inkjet prints might not contain tracking dots.

My printer does not print tracking dots. Can I hide this fact?

If there are really no tracking dots, you can either create your own ones (deda_create_dots) or print the calibration page (deda_anonmask_create -w) with another printer and use the mask for your own printer. You can use the anonymised version of the tracking dots or just copy them (deda_anonmask_create --copy). See chapters "Anonymise a document for printing" and "Create your own tracking dots".

Install Error: command 'x86_64-linux-gnu-gcc' failed with exit status 1

This may be caused by the eel dependency which is needed for the GUI. Try $ sudo apt-get install build-essential autoconf libtool pkg-config python3.6-dev gcc && pip3 install --user eel

wand.exceptions.PolicyError: attempt to perform an operation not allowed by the security policy PDF' @ error/constitute.c/IsCoderAuthorized/408

This is being caused by ImageMagick. Either remove Wand (pip3 uninstall wand) or add <policy domain="coder" rights="read | write" pattern="PDF" /> just before </policymap> in /etc/ImageMagick-*/policy.xml. See also https://stackoverflow.com/questions/52998331/imagemagick-security-policy-pdf-blocking-conversion.

联系我们 contact @ memedata.com