将 Rust 的核心和分配箱翻译给 Coq 进行形式验证
Translation of Rust's core and alloc crates to Coq for formal verification

原始链接: https://formal.land/blog/2024/04/26/translation-core-alloc-crates

研究人员正在致力于使用他们的工具 coq-of-rust,通过将 Rust 代码转换为形式化证明系统 Coq 来形式化验证 Rust 程序。 他们在处理 Rust 标准库中的原语(例如 Option::unwrap_or_default)时面临挑战。 他们没有在 Coq 中手动定义每个函数的行为,而是通过翻译 Rust 的 core 和 alloc crates 来自动化这个过程。 结果是生成了大量的 Coq 代码,但不幸的是,由于错误和命名冲突,它无法编译。 他们通过基于 impl 块的 where 子句使生成的模块名称更加唯一,解决了命名冲突问题。 该团队还报告了少量未编译文件,约占总文件的 4%。 尽管如此,他们认为进展很重要,因为他们现在可以在验证 Rust 程序时更好地信任标准库的形式化。 他们未来的目标包括简化证明过程。 有兴趣的各方可以通过 [email protected] 联系他们,了解有关正式验证其 Rust 项目的更多信息。 形式验证为关键应用程序提供针对软件错误的最有力保证。

Coq-of-rust 是一个旨在将 Rust 代码转换为 Coq 逻辑框架以进行形式验证的项目。 尽管是用 Rust 本身开发的,但创建者提出了一种通过反向工程 Coq 版本并验证其与 Rust 实现匹配来确保其正确性的方法。 通过提供经过验证正确的 Coq 版本,用户可以通过对照原始 Rust 实现检查 Coq 对应版本来确保其 Rust 代码的完整性。 尽管由于递归性质,该过程涉及一定程度的复杂性,但它增加了对 Rust 代码准确性的信心,特别是在金融或安全等关键行业的背景下。 此外,工业支持凸显了 coq-ofrus 等项目的潜在影响和重要性。
相关文章

原文

We continue our work on formal verification of Rust programs with our tool coq-of-rust, to translate Rust code to the formal proof system Coq. One of the limitation we had was the handling of primitive constructs from the standard library of Rust, like Option::unwrap_or_default or all other primitive functions. For each of these functions, we had to make a Coq definition to represent its behavior. This is both tedious and error prone.

To solve this issue, we worked on the translation of the core and alloc crates of Rust using coq-of-rust. These are very large code bases, with a lot of unsafe or advanced Rust code. We present what we did to have a "best effort" translation of these crates. The resulting translation is in the following folders:

Crab with a pen

A crab in a library

Initial run 🐥

An initial run of coq-of-rust on the alloc and core crates of Rust generated us two files of a few hundred thousands lines of Coq corresponding to the whole translation of these crates. This is a first good news, as it means the tool runs of these large code bases. However the generated Coq code does not compile, even if the errors are very rare (one every few thousands lines).

To get an idea, here is the size of the input Rust code as given by the cloc command:

  • alloc: 26,299 lines of Rust code
  • core: 54,192 lines of Rust code

Given that this code uses macros that we expand in our translation, the actual size that we have to translate is even bigger.

Splitting the generated code 🪓

The main change we made was to split the output generated by coq-of-rust with one file for each input Rust file. This is possible because our translation is insensitive to the order of definitions and context-free. So, even if there are typically cyclic dependencies between the files in Rust, something that is forbidden in Coq, we can still split them.

We get the following sizes as output:

  • alloc: 54 Coq files, 171,783 lines of Coq code
  • core: 190 Coq files, 592,065 lines of Coq code

The advantages of having the code split are:

  • it is easier to read and navigate in the generated code
  • it is easier to compile as we can parallelize the compilation
  • it is easier to debug as we can focus on one file at a time
  • it is easier to ignore files that do not compile
  • it will be easier to maintain, as it is easier to follow the diff of a single file

Fixing some bugs 🐞

We had some bugs related to the collisions between module names. These can occur when we choose a name for the module for an impl block. We fixed these by adding more information in the module names to make them more unique, like the where clauses that were missing. For example, for the implementation of the Default trait for the Mapping type:

#[derive(Default)]
struct Mapping<K, V> {

}

we were generating the following Coq code:

Module Impl_core_default_Default_for_dns_Mapping_K_V.

End Impl_core_default_Default_for_dns_Mapping_K_V.

We now generate:

Module Impl_core_default_Default_where_core_default_Default_K_where_core_default_Default_V_for_dns_Mapping_K_V.

with a module name that includes the where clauses of the impl block, stating that both K and V should implement the Default trait.

Here is the list of files that do not compile in Coq, as of today:

  • alloc/boxed.v
  • core/any.v
  • core/array/mod.v
  • core/cmp/bytewise.v
  • core/error.v
  • core/escape.v
  • core/iter/adapters/flatten.v
  • core/net/ip_addr.v

This represents 4% of the files. Note that in the files that compile there are some unhandled Rust constructs that are axiomatized, so this does not give the whole picture of what we do not support.

Example 🔎

Here is the source code of the unwrap_or_default method for the Option type:

pub fn unwrap_or_default(self) -> T
where
T: Default,
{
match self {
Some(x) => x,
None => T::default(),
}
}

We translate it to:

Definition unwrap_or_default (T : Ty.t) (τ : list Ty.t) (α : list Value.t) : M :=
let Self : Ty.t := Self T in
match τ, α with
| [], [ self ] =>
ltac:(M.monadic
(let self := M.alloc (| self |) in
M.read (|
M.match_operator (|
self,
[
fun γ =>
ltac:(M.monadic
(let γ0_0 :=
M.get_struct_tuple_field_or_break_match (|
γ,
"core::option::Option::Some",
0
|) in
let x := M.copy (| γ0_0 |) in
x));
fun γ =>
ltac:(M.monadic
(M.alloc (|
M.call_closure (|
M.get_trait_method (| "core::default::Default", T, [], "default", [] |),
[]
|)
|)))
]
|)
|)))
| _, _ => M.impossible
end.

We prove that it is equivalent to the simpler functional code:

Definition unwrap_or_default {T : Set}
{_ : core.simulations.default.Default.Trait T}
(self : Self T) :
T :=
match self with
| None => core.simulations.default.Default.default (Self := T)
| Some x => x
end.

This simpler definition is what we use when verifying code. The proof of equivalence is in CoqOfRust/core/proofs/option.v. In case the original source code changes, we are sure to capture these changes thanks to our proof. Because the translation of the core library was done automatically, we trust the generated definitions more than definitions that would be done by hand. However, there can still be mistakes or incompleteness in coq-of-rust, so we still need to check at proof time that the code makes sense.

Conclusion

We can now work on the verification of Rust programs with more trust in our formalization of the standard library. Our next target is to simplify our proof process, which is still tedious. In particular, showing that simulations are equivalent to the original Rust code requires doing the name resolution, introduction of high-level types, and removal of the side-effects. We would like to split these steps.

If you are interested in formally verifying your Rust projects, do not hesitate to get in touch with us at [email protected] 💌! Formal verification provides the highest level of safety for critical applications, with a mathematical guarantee of the absence of bugs for a given specification.

联系我们 contact @ memedata.com