Using the `propverify` library to verify Rust programs
The goal of the tools and library in this repository is to let you verify interesting things about non-trivial programs. Unfortunately, interesting/non-trivial programs are too large for introducing you to a tool so, for now, we will consider this trivial program.
proptest! {
#[test]
fn multiply(a in 1..=1000u32, b in 1..=1000u32) {
let r = a*b;
prop_assert!(1 <= r && r < 1000000);
}
}
This program
- generates two values
a
andb
in the range [1..1000] - multiplies
a
andb
- asserts that their product is in the range 1..1000000
In this note, we shall do the following to check this code
- create a test crate
- fuzz the crate using proptest
- verify the crate using propverify and cargo-verify
- verify the crate using propverify and Crux-mir
Workaround for “compilation error”
Before we really get started…
The cargo-verify
tool sometimes gets confused by previous compilations.
If you get an error message involving compilation error
, try running cargo
clean
and try again.
Hopefully we’ll make cargo-verify more robust soon.
Creating a test crate
The Rust compiler, KLEE, and Seahorn are in the Dockerfile so start the Docker image by running
../docker/run
All remaining commands in this file will be run in this docker image.
(It is usually easiest to run this in one terminal while using a separate editor to edit the files in another terminal.)
To try the above example, we will create a crate in which to experiment with this code.
cargo new try-propverify
cd try-propverify
cat >> Cargo.toml << "EOF"
[target.'cfg(verify)'.dependencies]
propverify = { path="/home/rust-verification-tools/propverify" }
[target.'cfg(not(verify))'.dependencies]
proptest = { version = "*" }
[features]
verifier-klee = ["propverify/verifier-klee"]
verifier-crux = ["propverify/verifier-crux"]
verifier-seahorn = ["propverify/verifier-seahorn"]
EOF
cat > src/main.rs << "EOF"
#[cfg(not(verify))]
use proptest::prelude::*;
#[cfg(verify)]
use propverify::prelude::*;
proptest! {
#[test]
fn multiply(a in 1..=1000u32, b in 1..=1000u32) {
let r = a*b;
prop_assert!(1 <= r && r < 1000000);
}
}
EOF
Fuzzing with proptest
Since proptest is a normal library, we can fuzz the
program using cargo test
.
This will generate output a bit like this
running 1 test
test multiply ... ok
test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out
Which means that the fuzzer has failed to find a bug.
(As we shall see in a moment, the test actually has a bug. We have deliberately chosen ranges that are large enough that the current version of proptest is unlikely to find the bug.)
Verifying with propverify using cargo-verify
To verify the program using propverify, we use the cargo-verify
tool to
compile the program and verify the program using one of the verification
backends (currently we support KLEE and Seahorn).
cargo clean
cargo-verify --backend=klee --tests --verbose
The program above has a deliberate error and KLEE reports the error
Running 1 test(s)
test multiply ... ERROR
test result: ERROR. 0 passed; 1 failed
VERIFICATION_RESULT: ERROR
Running cargo-verify
with the option --backend=seahorn
will give the same
result, but with Seahorn as the verification backend, instead of KLEE.
While it is nice that it has found a bug, we need more detail before we can understand and fix the bug. After finding the bug, we can “replay” the test to see some concrete data values that it fails on.
(The Seahorn backend does not support replay at the moment)
(Although it makes no difference in this small example, it is usually best to
use --test=...
instead of --tests
to focus on a single failing test at
a time.)
cargo-verify --backend=klee --test=multiply --replay
This produces additional output that shows that KLEE explored two paths through the program: one that passes and one that fails.
The first path has value a = 1
and b = 1
and it passes the test.
Test input try-propverify/kleeout-multiply/test000001.ktest
running 1 test
Value a = 1
Value b = 1
test multiply ... ok
test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out
The second path has value a = 1000
and b = 1000
and it fails the test that a*b < 1000000
.
Test input try-propverify/kleeout-multiply/test000002.ktest
Finished test [unoptimized + debuginfo] target(s) in 0.02s
Running target/x86_64-apple-darwin/debug/deps/try_propverify-bb37abd6d1dc60ef
thread 'multiply' panicked at 'assertion failed: 1 <= r && r < 1000000', src/main.rs:10:9
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
error: test failed, to rerun pass '--bin try-propverify'
running 1 test
Value a = 1000
Value b = 1000
test multiply ... FAILED
failures:
failures:
multiply
test result: FAILED. 0 passed; 1 failed; 0 ignored; 0 measured; 0 filtered out
Seeing this failing example, it is obvious that the comparision ‘<’ should be changed to ‘<=’
prop_assert!(1 <= r && r <= 1000000);
With that fix, we can rerun KLEE and see that the test passes
cargo-verify --backend=klee --tests --replay
Running 1 test(s)
test multiply ... ok
test result: ok. 1 passed; 0 failed
VERIFICATION_RESULT: VERIFIED
Verifying with propverify using Crux-mir
[The following does not run in docker at present]
(if you fixed the assertion as discussed above, revert the fix)
To verify the program using propverify and Crux-mir
cargo clean
RUSTFLAGS='--cfg verify' cargo crux-test --features verifier-crux
The program above has a deliberate error and Crux-mir reports the error
test try_propverify/17clqo26::multiply[0]: FAILED
failures:
---- try_propverify/17clqo26::multiply[0] counterexamples ----
Failure for panicking::begin_panic, called from try_propverify/17clqo26::multiply[0]
in try_propverify/17clqo26::multiply[0] at internal
As the test uses Rust’s std::assert
macro which invokes panic!
the failure
Crux-mir detects is the call to panic.
To get a more descriptive failure we can replace assert!
in src/main.rs with
verifier::assert!
, and run the verifier again
RUSTFLAGS='--cfg verify' cargo crux-test --features verifier-crux
This time we get the following report which shows the assertion that failed and its line number
test try_propverify/17clqo26::multiply[0]: FAILED
failures:
---- try_propverify/17clqo26::multiply[0] counterexamples ----
Failure for MIR assertion at src/main.rs:10:9:
VERIFIER: assertion failed: 1 <= r && r < 1000000
in try_propverify/17clqo26::multiply[0] at ./lib/crucible/lib.rs:50:17
Which is better: fuzzing or verification?
This example is intended as a quick introduction to using property-based testing using either fuzzing (with proptest) or verification tools (with propverify).
In this carefully chosen example, proptest missed a bug that propverify found. We could also have chosen an example where proptest quickly finds a bug but propverify runs for hours or days without producing a result.
The point of this example (and the point of making propverify compatible with proptest) is not that one approach is better than another but that they are complementary and you should use both fuzzing and verification tools.