Skip to content

Latest commit

 

History

History
368 lines (275 loc) · 23.5 KB

README.md

File metadata and controls

368 lines (275 loc) · 23.5 KB

torte: feature-model experiments à la carte 🍰

torte is a declarative workbench for reproducible experiments in feature-model analysis research.

Why torte? Take your pick:

  • "Tseitin or not Tseitin?" Evaluator
  • CNF Transformation Workbench
  • KConfig Extractor that Tackles Evolution
  • Towards Reproducible Feature-Model Transformation and Extraction
  • That's an Obviously Reverse-Engineered Tool Name
  • KConfig = 🍰 config ∧ 🍰 = torte ∎

torte can be used to

  • extract feature models from KConfig-based configurable software systems (e.g., the Linux kernel),
  • transform feature models between various formats (e.g., FeatureIDE, UVL, and DIMACS), and
  • analyze feature models with solvers to evaluate the extraction and transformation impact,

all in a fully declarative and reproducible fashion backed by reusable Docker containers. This way, you can

  • draft experiments for selected feature models first, then generalize them to a larger corpus later,
  • execute experiments on a remote machine without having to bother with technical setup,
  • distribute fully-automated reproduction packages when an experiment is ready for publication, and
  • adapt and update existing experiments without needing to resort to clone-and-own practices.

Getting Started: The Quick Way

This one-liner will get you started with the default experiment (Docker required).

curl -s https://ekuiter.github.io/torte/ | sh

Read on if you want to know more details.

Getting Started: In Detail

To run torte, you need:

Experiment files in torte are self-executing - so, you can just create or download an experiment file (e.g., from the experiments directory) and run it.

The following instructions will get you started on a fresh system. By default, each of these instruction sets will install torte into the torte directory. All experiment data will then be stored in the directories input and output in your working directory.

Ubuntu 22.04

# install and set up dependencies
sudo apt-get update
sudo apt-get install -y curl git make uidmap dbus-user-session

# install Docker (see https://docs.docker.com/desktop/install/linux-install/)
curl -fsSL https://get.docker.com | sh
dockerd-rootless-setuptool.sh install

# download and run the default experiment
curl -s https://ekuiter.github.io/torte/ | sh

macOS 14

# install and set up dependencies (this will replace macOS' built-in bash with a newer version)
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
(echo; echo 'eval "$(/opt/homebrew/bin/brew shellenv)"') >> $HOME/.zprofile
eval "$(/opt/homebrew/bin/brew shellenv)"
brew install bash coreutils gnu-sed grep

# install Docker (see https://docs.docker.com/desktop/install/mac-install/)
curl -o Docker.dmg https://desktop.docker.com/mac/main/arm64/149282/Docker.dmg
sudo hdiutil attach Docker.dmg
sudo /Volumes/Docker/Docker.app/Contents/MacOS/install --accept-license
sudo hdiutil detach /Volumes/Docker
rm Docker.dmg
open /Applications/Docker.app

# download and run the default experiment
curl -s https://ekuiter.github.io/torte/ | sh

Windows 11

# install WSL (see https://learn.microsoft.com/windows/wsl/install)
powershell
wsl --install

# install Docker (see https://docs.docker.com/desktop/install/windows-install/)
Invoke-WebRequest https://desktop.docker.com/win/main/amd64/149282/Docker%20Desktop%20Installer.exe -OutFile Docker.exe
Start-Process Docker.exe -Wait -ArgumentList 'install', '--accept-license'
Remove-Item Docker.exe

# restart your computer, start Docker, then install and set up dependencies
wsl
sudo apt-get update
sudo apt-get install -y curl git make

# download and run the default experiment
curl -s https://ekuiter.github.io/torte/ | sh

Above, we run the default experiment, which extracts, transforms, and analyzes the feature model of BusyBox 1.36.0 as a demonstration. To execute another experiment, run curl -s https://ekuiter.github.io/torte/ | sh -s - <experiment> (a list of predefined experiments is available here). You can also write your own experiments by adapting an existing experiment file.

Further Tips

  • As an alternative to the self-extracting installer shown above, you can clone this repository and run experiments with ./torte.sh <experiment-file>.
  • A running experiment can be stopped with Ctrl+C. If this does not respond, try Ctrl+Z, then ./torte.sh stop.
  • Run ./torte.sh help to get further usage information (e.g., running an experiment over SSH and im-/export of Docker containers).
  • Developers are recommended to use ShellCheck to improve code quality.
  • If Docker is running in rootless mode, experiments must not be run as sudo. Otherwise, experiments must be run as sudo.
  • The first execution of torte can take a while (~30 minutes), as several complex Docker containers need to be built. This can be avoided by loading a reproduction package that includes Docker images (built by ./torte.sh export).

Supported Subject Systems

This is a list of all subject systems for which feature-model extraction has been tested and confirmed to work for at least one extraction tool. Other systems or revisions may also be supported. Detailed system-specific information on potential threats to validity is available in the scripts/subjects directory.

System Revisions Notes
axtls 1.0.0 - 2.0.0
buildroot 2009.02 - 2024.05
busybox 1.3.0 - 1.36.0
embtoolkit 1.0.0 - 1.8.0
fiasco 5eed420 (2023-04-18) 2
freetz-ng d57a38e (2023-04-18) 2
linux 2.5.45 - 6.11 3 4 5 6
toybox 0.4.5 - 0.8.9 7
uclibc-ng 1.0.2 - 1.0.47

Bundled Tools

Extraction, Transformation, and Analysis

The following tools are bundled with torte and can be used in experiments for extracting, transforming, and analyzing feature models. Most tools are not included in this repository, but cloned and built with tool-specific Docker files in the docker directory. The bundled solvers are listed in a separate table below.

For transparency, we document the changes we make to these tools and known limitations. There are also some general known limitations of torte. 8

Tool Version Date Notes
arminbiere/cadiback 2e912fb 2023-07-21
ckaestne/kconfigreader 913bf31 2016-07-01 9 10 11 12 13 14
ekuiter/clausy 6b816a9 2024-01-15
ekuiter/SATGraf 2677015 2023-04-05 15
FeatureIDE/FeatJAR e27aea7 2023-04-11 16 17
FeatureIDE/FeatureIDE 3.9.1 2022-12-06 18 19 17
paulgazz/kmax 4.5.2 2023-12-20 10 11 20 21 14
Z3Prover/z3 4.11.2 2022-09-04 22

Solvers

The following solvers are bundled with torte and can be used in experiments for analyzing feature-model formulas. The bundled solver binaries are available in the docker/solver directory. Solvers are grouped in collections to allow several versions of the same solver to be used.

In addition to the solvers listed below, z3 (already listed above) can be used as a satisfiability and SMT solver.

Collection: emse-2023

These #SAT solvers (available here) were used in the evaluations of several papers:

The #SAT solvers from the collection model-counting-competition-2022 should be preferred for new experiments.

Solver Version Date Notes
countAntom 1.0 2015-05-11 23
d4 ? ?
dSharp ? ? 24
Ganak ? ?
sharpSAT ? ?

Collection: model-counting-competition-2022

These #SAT solvers (available here) were used in the model-counting competition 2022. Not all evaluated solvers are included here, as some solver binaries (i.e., for MTMC and ExactMC) have not been disclosed.

Solver Notes
c2d
d4
DPMC
gpmc
TwG 25
SharpSAT-TD 23
SharpSAT-td+Arjun 23 26

Collection: other

These are miscellaneous solvers from various sources.

Solver Version Date Class Notes
ApproxMC 4.1.9 2023-02-22 Approximate #SAT Solver
backbone_kissat.py - - Backbone Extractor
d4v2 c1f6842 2023-02-15 #SAT Solver, d-DNNF compiler, PMC
kissat_MAB-HyWalk ? ? SAT Solver
SAT4J 2.3.6 2020-12-14 SAT Solver

Collection: sat-competition

A subset of these SAT solvers was used in the evaluation of the paper Tseitin or not Tseitin? The Impact of CNF Transformations on Feature-Model Analyses (ASE 2022). Each solver is the gold medal winner in the main track (SAT+UNSAT) of the SAT competition in the year encoded in its file name. These binaries were obtained from the SAT competition, SAT heritage, and SAT museum initiatives.

Year Solver
2002 zchaff
2003 Forklift
2004 zchaff
2005 SatELiteGTI
2006 MiniSat
2007 RSat
2008 MiniSat
2009 precosat
2010 CryptoMiniSat
2011 glucose
2012 glucose
2013 lingeling-aqw
2014 lingeling-ayv
2015 abcdSAT
2016 MapleCOMSPS_DRUP
2017 Maple_LCM_Dist
2018 MapleLCMDistChronoBT
2019 MapleLCMDiscChronoBT-DL-v3
2020 Kissat-sc2020-sat
2021 Kissat_MAB
2022 Kissat_MAB-HyWalk
2023 sbva_cadical

Predefined Experiments

This is a list of all predefined experiments in the experiments directory and their purposes. Please create a pull request if you want to publish your own experiment. Experiments starting with draft- are experimental.

Experiment Purpose
busybox-history-full Extraction of all feature models of BusyBox (for every commit that touches the feature model) 27
default "Hello-world" experiment that extracts and transforms a single feature model
feature-model-collection Extraction, transformation, and analysis of several feature-model histories
feature-model-collection-learning Learning from feature-model histories
feature-model-differences Extraction and comparison of all feature models of several feature-model histories
linux-history-releases Extraction, transformation, and analysis of a history of Linux feature models
linux-history-weekly Extraction of a weekly history of Linux feature models
linux-recent-release Extraction and transformation of a recent Linux feature model
prepare-linux-fork Clones and rewrites the Linux Git repository to avoid issues with case-insensitive file systems
tseitin-or-not-tseitin Evaluation for the paper Tseitin or not Tseitin? The Impact of CNF Transformations on Feature-Model Analyses (ASE 2022)

Project History

This project has evolved through several stages and intends to replace them all:

kmax-vm > feature-model-repository-pipeline > tseitin-or-not-tseitin > torte

  • kmax-vm was intended to provide an easy-to-use environment for integrating kmax with PCLocator in a virtual machine using Vagrant/VirtualBox. It is now obsolete due to our Docker integration of kmax.
  • feature-model-repository-pipeline extended kmax-vm and could be used to extract feature models from Kconfig-based software systems with kconfigreader and kmax. The results were stored in the feature-model-repository. Its functionality is completely subsumed by torte and more efficient and reliable due to our Docker integration.
  • tseitin-or-not-tseitin extended the feature-model-repository-pipeline to allow for transformation and analysis of feature models. It was mostly intended as a reproduction package for a single academic paper. Its functionality is almost completely subsumed by torte, which can be used to create reproduction packages for many different experiments.

If you are looking for a curated collection of feature models from various domains, have a look at our feature-model-benchmark.

If you have any feedback, please contact me at kuiter@ovgu.de. New issues, pull requests, or any other kinds of feedback are always welcome.

License

The source code of this project is released under the LGPL v3 license. To ensure reproducibility, we also provide binaries (e.g., for solvers) in this repository. These binaries have been collected or compiled from public sources. Their usage is subject to each binaries' respective license - please contact me if you perceive any licensing issues.

Footnotes

  1. On arm64 systems (e.g., Windows tablets and Apple Silicon Macs), torte cross-compiles some Docker images to ensure that precompiled binaries (e.g., JavaSMT, Z3, and all solvers) function correctly. This may negatively impact performance on some systems (e.g., ARM-based Windows tablets), although recent Macs should not be affected due to Rosetta. (If you encounter errors like this one, try to disable "Use Rosetta for x86_64/amd64 emulation on Apple Silicon" in the Docker settings. This setting can be re-enabled after the Docker images have been built.) Executing torte from within a virtual machine has only been confirmed to work with Linux guest systems on x86_64 host systems. Despite our efforts, some functionality involving precompiled binaries is still known to cause problems on arm64 systems. If such functionality is required, the easiest solution is to switch to an x86_64 system (e.g., with SSH).

  2. This system does not regularly release tagged revisions, so only a single revision has been tested. 2

  3. Most revisions and architectures of Linux (since the introduction of KConfig) can be extracted successfully. The user-mode architecture um is currently not supported, as it requires setting an additional sub-architecture.

  4. Due to extractor limitations, we ignore the more recently introduced KConfig constructs defined in Linux' scripts/Kconfig.include. Most of these only add machine specific-default values or dependencies (affecting about 100 features in the kernel's history up to v6.3). However, these constructs do not affect our feature-model extraction, as we want to ignore machine-dependent restrictions.

  5. Currently, we use the KConfig parser of Linux 2.6.9 for all revisions of Linux up to Linux 2.6.9, as older versions of the parser cannot be compiled. However, our experiments showed that the chosen parser version typically does not seem to affect the extracted formula, should it succeed in extracting a formula.

  6. For Linux, specifying arbitrary commit hashes is not enabled by default, because we must perform a complete Git history rewrite (resetting the commit hashes in the process) in order to ensure that checking out the repository also succeeds cross-platform on case-insensitive file systems (e.g., APFS). To specify arbitrary and up-to-date commit hashes, use LINUX_CLONE_MODE=original|filter (see scripts/subject/linux.sh#post-clone-hook-linux: original only works on case-sensitive file systems, while filter is cross-platform, but takes several hours to run). This does not affect typical use cases that involve tag and branch identifiers.

  7. Feature models for this system are currently likely to be incomplete due to an inaccurate extraction.

  8. Currently, non-Boolean variability (e.g., constraints on numerical features) is only partially supported (e.g., encoded naively into Boolean constraints). It is recommended to check manually whether non-Boolean variability is represented as desired in generated files.

  9. We added the class TransformIntoDIMACS.scala to kconfigreader to decouple the extraction and transformation of feature models, so kconfigreader can also transform feature models extracted with other tools (e.g., kmax).

  10. We majorly revised the native C bindings dumpconf.c (kconfigreader) and kextractor.c (kmax), which are intended to be compiled against a system's Kconfig parser to get accurate feature models. Our improved versions adapt to the KConfig constructs actually used in a system, which is important to extract evolution histories with evolving KConfig parsers. Our changes are generalizations of the original versions of dumpconf.c and kextractor.c and should pose no threat to validity. Specifically, we added support for E_CHOICE (treated as E_LIST), P_IMPLY (treated as P_SELECT, see smba/kconfigreader), and E_NONE, E_LTH, E_LEQ, E_GTH, E_GEQ (ignored). 2

  11. Compiling the native C bindings of kconfigreader and kmax is not possible for all KConfig-based systems (e.g., if the Python-based Kconfiglib parser is used). In that case, you can try to reuse a C binding from an existing system with similar KConfig files; however, this may limit the extracted model's accuracy. 2

  12. The DIMACS files produced by kconfigreader may contain additional variables due to Plaisted-Greenbaum transformation (i.e., satisfiability is preserved, model counts are not). Currently, this behavior is not configurable.

  13. Feature models and formulas produced by kconfigreader have nondeterministic clause order. This does not impact semantics, but it possibly influences the efficiency of solvers.

  14. The formulas produced by kconfigreader and kmax do not explicitly mention unconstrained features (i.e., features that do not occur in any constraints). However, for many analyses that depend on knowing the entire feature set (e.g., simply listing all configurable features or calculating model counts), this is a threat to validity. We do not modify the extracted formulas, to preserve the original output of kconfigreader and kmax. To address this threat, we instead offer the transformation stage transform-into-unconstrained-features, which explicitly computes these features. 2

  15. We forked the original SATGraf tool and migrated it to Gradle. We also added a new feature for exporting the community structure visualization as a JPG file, avoiding the graphical user interface.

  16. FeatJAR is still in an experimental stage and its results should generally be cross-validated with FeatureIDE.

  17. DIMACS files produced by FeatJAR and FeatureIDE do not contain additional variables (i.e., equivalence is preserved). Currently, this behavior is not configurable. 2

  18. We perform all transformations with FeatureIDE from within a FeatJAR instance, which does not affect the results.

  19. Transformations with FeatureIDE into XML and UVL currently only encode a flat feature hierarchy, no feature-modeling notation is reverse-engineered.

  20. We added the script kclause2model.py to kmax to translate kclause's pickle files into the kconfigreader's feature-model format. This file translates Boolean variability correctly, but non-Boolean variability is not supported.

  21. We do not use kmax's kclause_to_dimacs.py script for CNF transformation, as it has had some issues in the past. Instead, we have a separate Docker container for Z3.

  22. The DIMACS files produced by Z3 may contain additional variables due to Tseitin transformation (i.e., satisfiability and model counts are preserved). Currently, this behavior is not configurable.

  23. This solver currently crashes on some or all inputs. 2 3

  24. This version of dSharp is known to produce inaccurate results for some inputs, so use it with caution.

  25. For TwG, two configurations were provided by the model-counting competition (TwG1 and TwG2). As there was no indication as to which configuration was used in the competition, we arbitrarily chose TwG1.

  26. For SharpSAT-td+Arjun, two configurations were provided by the model-counting competition (conf1 and conf2). As only the second configuration actually runs SharpSAT-td, we chose conf2 (conf1 probably implements the approximate counter SharpSAT-td-Arjun+ApproxMC).

  27. As noted by Kröher et al. 2023, the feature model of BusyBox is scattered across its .c source code files in special comments and therefore not trivial to extract as a full history (because we need to detect changes in any KConfig files to identify relevant commits). We solve this problem by iterating over all commits to generate all feature models, committing them to a new busybox-models repository, in which each commit represents one version of the feature model.