SMT String Bench Results

Running the experimental evaluation

We assume you have VeriFIT/smt-bench set up and running on an evaluation server where you run the experimental evaluation (according to its instructions).

Set up the Python virtual environment:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Get new tasks from the server with experimental results (change host, port, etc. if running from a different server):
```
./get_tasks_and_generate_csv.sh
```
Process the results (choose one):

Run the Jupyter evaluation notebook eval.ipynb:
- Set the correct tools and benchmarks to evaluate (set the version of NOODLER).
- Run the first 4 cells to load the benchmarks and other cells based on your need.
Only prepare the results for manual processing:
```
./pyco_proc.py [options] <requested_tasks_file_with_results.tasks>
```
Store the processed results and evaluate them manually.

Exit the Python virtual environment:
```
deactivate
```

Name		Name	Last commit message	Last commit date
Latest commit History 5,562 Commits
automatark		automatark
biopython		biopython
denghang		denghang
django		django
full_str_int		full_str_int
kaluza		kaluza
kepler		kepler
leetcode		leetcode
norn		norn
notcontains		notcontains
omark		omark
pyex		pyex
redos		redos
regex		regex
selected_hard		selected_hard
slent		slent
slog		slog
snia		snia
str_small_rw		str_small_rw
stringfuzz		stringfuzz
sygus_qgen		sygus_qgen
thefuck		thefuck
transducer_plus		transducer_plus
webapp		webapp
woorpje		woorpje
zaligvinder		zaligvinder
.gitignore		.gitignore
README.md		README.md
Z3-Noodler-no-Nielsen-stats_vs_Z3-Noodler.pdf		Z3-Noodler-no-Nielsen-stats_vs_Z3-Noodler.pdf
Z3-Noodler-no-length-stats_vs_Z3-Noodler.pdf		Z3-Noodler-no-length-stats_vs_Z3-Noodler.pdf
Z3-Noodler-only-stabilization_vs_Z3-Noodler.pdf		Z3-Noodler-only-stabilization_vs_Z3-Noodler.pdf
Z3-Noodler-stats_vs_Z3-Noodler.pdf		Z3-Noodler-stats_vs_Z3-Noodler.pdf
Z3-Noodlerᴹ_vs_Z3-Noodler.pdf		Z3-Noodlerᴹ_vs_Z3-Noodler.pdf
Z3_vs_Z3-Noodler.pdf		Z3_vs_Z3-Noodler.pdf
Z3ᴹ_vs_Z3-Noodler.pdf		Z3ᴹ_vs_Z3-Noodler.pdf
cactus.pdf		cactus.pdf
cvc5_vs_Z3-Noodler.pdf		cvc5_vs_Z3-Noodler.pdf
cvc5ᴹ_vs_Z3-Noodler.pdf		cvc5ᴹ_vs_Z3-Noodler.pdf
eval.ipynb		eval.ipynb
eval_functions.py		eval_functions.py
get_local_tasks_and_generate_csv.sh		get_local_tasks_and_generate_csv.sh
get_tasks_and_generate_csv.sh		get_tasks_and_generate_csv.sh
int_convs-str_small_rw.txt		int_convs-str_small_rw.txt
int_convs-stringfuzz.txt		int_convs-stringfuzz.txt
int_convs_not-full_str_int.txt		int_convs_not-full_str_int.txt
list_of_tools.txt		list_of_tools.txt
old_z3_noodler_eval.ipynb		old_z3_noodler_eval.ipynb
process_local_tasks.sh		process_local_tasks.sh
pyco_proc.py		pyco_proc.py
requirements.txt		requirements.txt
stats_per_benchmark.tex		stats_per_benchmark.tex
stats_per_benchmark_no_preprocess.tex		stats_per_benchmark_no_preprocess.tex
stats_per_benchmark_no_preprocess_percents.tex		stats_per_benchmark_no_preprocess_percents.tex
stats_per_group.tex		stats_per_group.tex
stats_per_group_no_preprocess_no_preprocess.tex		stats_per_group_no_preprocess_no_preprocess.tex
stats_per_group_no_preprocess_percents.tex		stats_per_group_no_preprocess_percents.tex
stats_per_group_total.tex		stats_per_group_total.tex
stats_per_group_total_no_preprocess.tex		stats_per_group_total_no_preprocess.tex
stats_per_group_total_no_preprocess_percents.tex		stats_per_group_total_no_preprocess_percents.tex
stats_per_group_total_no_preprocess_percents_transposed.tex		stats_per_group_total_no_preprocess_percents_transposed.tex
stats_per_group_total_no_preprocess_transposed.tex		stats_per_group_total_no_preprocess_transposed.tex
stats_per_group_total_special_transposed.tex		stats_per_group_total_special_transposed.tex
stats_per_group_total_transposed.tex		stats_per_group_total_transposed.tex
sum_solved_result_per_tool_for_benches_equations.tex		sum_solved_result_per_tool_for_benches_equations.tex
sum_solved_result_per_tool_for_benches_regex_predicates.tex		sum_solved_result_per_tool_for_benches_regex_predicates.tex
sum_solved_result_per_tool_for_benchmark_groups.tex		sum_solved_result_per_tool_for_benchmark_groups.tex
z3_statistics.py		z3_statistics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMT String Bench Results

Running the experimental evaluation

About

Releases

Packages

Contributors 3

Languages

VeriFIT/smt-string-bench-results

Folders and files

Latest commit

History

Repository files navigation

SMT String Bench Results

Running the experimental evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages