picol：一个500行代码的Tcl解释器

picol：一个500行代码的Tcl解释器
picol: A Tcl interpreter in 500 lines of code

## Picol：一个微型的Tcl类解释器 Picol是一个500行的C语言解释器，于2007年发布，设计目的是作为学习工具和实践C语言编程风格的演示。它不像追求最小的代码量，而是以清晰和功能为目标——能够运行非平凡的程序，而不仅仅是“Hello World！”。该解释器具有受Tcl启发的语法，包括字符串插值（例如，`set a "pu" set b {ts} $a$b`）、具有适当作用域的用户自定义过程，以及`if`、`while`（带有`break`和`continue`）和递归等控制流结构。基本的算术运算（+、-、*、/）和`set`和`puts`等命令也受支持。 Picol的一个关键方面是其手工编写的解析器，它占据了大约250行代码。它识别标记并构建一个解析结构，该结构由`picolEval`函数用于执行程序。变量和命令替换在评估期间处理，利用调用帧来管理过程作用域。 Picol在GitHub上可用，作为解释器设计和解析器实现的存档示例。可以使用`gcc -O2 -Wall -o picol picol.c`编译它，并使用`picol filename.tcl`运行它。

## 微型 Tcl 解释器引起关注由 Redis 作者 antirez 创建的一个新的 Tcl 解释器 **picol** 已发布，代码量惊人的简洁，仅有 500 行。该项目在 Hacker News 上引发了关于 Tcl 当前用法的讨论。虽然 Python 和 Lua 是流行的选择，但 Tcl 仍然是**ASIC/FPGA 设计和仿真**等专业领域中的关键脚本语言，充当工具脚本的“通用语言”。虽然 Python 正在缓慢地占据优势，但通常被认为在这些任务中速度较慢。有趣的是，antirez 本身就使用 Tcl 来进行 Redis 测试套件。其他人发现它对特定任务很有用，例如在 Windows 上访问 ODBC。相关的项目 **JimTcl** 提供了更多功能，具有相似的小型足迹，由同一作者维护和扩展。

原文

Picol is a Tcl-alike interpreter in 500 lines of code that I released 15th of March 2007. Recentely I looked at the source code and realized this was a better C programming example compared to what I recalled, so I'm putting this on GitHub to archive it, together with the main points of the original article.

When I built this code, I had some rule in mind:

I wanted to use more or less my normal C style. In Picol you'll find normal C spacing and even comments.
I wanted to write an interpreter with a design similar to a real one. One of the few useful things you can do with Picol is to learn how to write a Tcl interpreter if you are a newbie programmer, I guess, so the point was to write a simple to understand program, not just a short program.
The resulting interpreter should be able to run some kind of non trivial program: to just set few vars and print hello world was not an option.

The resulting interpreter: Picol

The parser is very similar to the Tcl one, Picol supports interpolation as well, for example you can write:

set a "pu"
set b {ts}
$a$b "Hello World!"

Note that Picol has an interactive shell! so just launch it without arguments to start to play (to compile the code use gcc -O2 -Wall -o picol picol.c).

To run a program stored in a file, use: picol filename.tcl.

Probably the parser could be rewritten in order to take less space, currently it takes almost 250 lines of code: this is too much and leaves little room for all the rest. On the other side, it's a decent example about writing parsers by hand.

A Raw list of the supported features:

Interpolation, as seen above. You can also write "2+2 = [+ 2 2]" or "My name is: $foobar".
Procedures, with return. Like Tcl if return is missing the result of the last command executed is returned.
If, if .. else .., while with break and continue.
Recursion.
Variables inside procedures are limited in scope like Tcl, i.e. there are real call frames in Picol.
The following other commands: set + - * / == != > < >= <= puts.

This is an example of programs Picol can run:

proc fib {x} {
    if {== $x 0} {
        return 0
    }
    if {== $x 1} {
        return 1
    }
    return [+ [fib [- $x 1]] [fib [- $x 2]]]
}

puts [fib 20]
that of course will output fib(20). Another example:
proc square {x} {
    * $x $x
}

Or:

set a 1
while {<= $a 10} {
    if {== $a 5} {
        puts {Missing five!}
        set a [+ $a 1]
        continue
    }
    puts "I can compute that $a*$a = [square $a]"
    set a [+ $a 1]
}

It's pretty straightforward, the first important part you see in the source code is an hand written parser. The main function of the parser is picolGetToken that just calls functions able to parse the different parts of a Tcl program and return in the parsing structure the type of the token and start/end pointers in order to extract it.

This parsing function is in turn used by picolEval in order to execute the program. Every token is used either to form a new argument if a separator token was found before, or concatenated to the last argument (this is how interpolation is performed in Picol). Once an EOL (end of line) token is returned picolEval will call the command looking it up in a linked list of commands stored inside the interpreter structure.

Variables and commands substitution is performed by picolEval itself. The parser is able to return variables and commands tokens already stripped by $ and [], so all it's required to do is to lookup the variable in the call frame and substitute the value with the token, or to recursively call picolEval if it's a command substitution, using the result instead of the original token.

Commands are described by a name and a pointer to a C function implementing the command. In the command structure there is also a private data void pointer used in order to store data private to the command. This makes you able to implement multiple Picol commands using a single C function. User defined procedures are just like commands, but they are implemented by passing as private data the argument list and the body of the procedure, so a single C function is able to implement all the existing user defined procedures.

Procedures call is trivial. The interpreter structure contains a call frame structure having more or less just a pointer to a liked list of variables (that are in turn structures with two fileds: name and value). When a procedure is called a new call frame is created and put at the top of the old one. When the procedure returns the top call frame is destroyed.

Inside every large program there is a small program trying to get out -- Sir Tony Hoare.

picol：一个500行代码的Tcl解释器 picol: A Tcl interpreter in 500 lines of code

The resulting interpreter: Picol

picol：一个500行代码的Tcl解释器
picol: A Tcl interpreter in 500 lines of code