SING: Improving the efficiency of Secure Multi-Party Computation Protocol Assignment using Neural Networks

This repository contains the full code and a small excerpt of our dataset for reproducing our training and evaluation results.

Initial Setup

python -m venv venv
source venv/bin/activate

# may differ depending on your platform
#
# see https://pytorch.org/get-started/locally/
pip install torch torchvision torchaudio

pip install -r requirements.txt


# unpack dataset
tar xf dataset-excerpt.tar.gz
tar xf dataset-excerpt-c.tar.gz

Optional: Compiling ABY

This step is not necessary for training or evaluation of our models. ABY is required for benchmarking MPC performance.

cd ABY-vendor
mkdir build
cd build
cmake .. -DCMAKE_BUILD_TYPE=Release -DABY_BUILD_EXE=On
make

Optional: Compiling Silph

This step is not necessary for training, evaluation, or benchmarking. Compiling Silph is a precondition for comparing SING and Silph performance. We have applied minor fixes to the build system of the original Silph repository. The build system as a whole is identical to the original release. We thus refer to the documentation and paper for more detailed instructions.

Silph is written in Rust and thus requires a stable Rust toolchain which is commonly installed with rustup.

Furthermore, an installation of coinor-cbc is required for ILP solving. This library can be found in some distribution repositories or built from source. Other libraries that Silph depends on (e.g., KaHyPar) will be built from source automatically.

cd silph
python driver.py --features aby bench c lp r1cs smt
python driver.py --install
python driver.py --build --mode release

Note that this build process may take multiple hours as tests will be run as part of the build process.

Dataset

Due to the large size of the dataset, we only provide a small excerpt in this repository.

dataset-excerpt-c contains the original C source files for the circuits.
dataset-excerpt contains a processed excerpt of the dataset consisting of compiled circuits, alternative share assignments, and benchmarked metrics.

Training scripts will process the dataset further into the PyTorch Geometric format. This happens automatically. We refer to the following sections on training for more details.

Note that the training and evaluation performance of this dataset excerpt will vary from the results in our paper.

Optional: Bootstrapping the dataset from C source files

The dataset processing starts with a directory containing C source files (dataset-excerpt-c).

Step 1: Compile all circuits

Compilation requires a compiled version of Silph in $CARGO_MANIFEST_DIR (cf. Initial Setup).

scripts/generate_dataset.sh

Step 2: Clean up the dataset

pushd dataset-compiled
scripts/find_duplicate_circuits.sh failed.log
popd

Step 3: Generating alternative share assignments

This is a necessary step for training our cost prediction model.

python generate_alternative_share_assignments.py

Step 4: Benchmark MPC runtimes

This step is necessary for training our cost prediction model on real-world benchmarks (SING 3).

Benchmarking requires ABY (cf. Initial Setup).

python benchmark_mpc.py --mode dataset --network-setting LAN --cost-name runtime-neon-lan --metric runtime

Calculate dataset splits

Training and evaluation requires splitting the dataset into training, validation, and test sets. This is done as follows:

python generate_dataset_split.py

generate_dataset_split.py includes many configuration options that influence the resulting split (e.g., setting a maximum size threshold for circuits included in the dataset). Run

python generate_dataset_split.py --help

for a detailed overview of all command-line options.

Optional: Generating circuits with LLMs

We use Ollama as a local LLM inference engine. Instructions on installing Ollama can be found on the official website.

Once Ollama is set up, various open-source LLMs can be downloaded, e.g.,

ollama pull gemma3:4b

Using a script, all locally downloaded models can be queried with all available prompts respectively.

cd llm-generate
bash generate_missing_combinations.sh

Optional: Generating random circuits

cd grammar-generate
python generate.py

Several options of the generation process (e.g., operation budget) can be configured via command-line options. Run

python generate.py --help

for a full list of options.

Cost Prediction Model

Our cost prediction model $C^\text{SING}$ predicts a numeric cost given a circuit $c$ and a share assignment $s$.

We provide model checkpoints used in our evaluation in the pretrained directory.

Train

Depending on whether the model predicts Silph costs or benchmarked runtimes, the training process uses a different directory to store the dataset in PyTorch Geometric format.

Silph costs: dataset-cost-prediction
Benchmarked runtimes: dataset-cost-prediction-measured

# train on Silph costs
mkdir -p dataset-cost-prediction
ln -s dataset-excerpt dataset-cost-prediction/raw

python train_cost_prediction.py --lr 0.001 --cost-name silph


# train on benchmarked costs (e.g., runtime-neon-lan)
mkdir -p dataset-cost-prediction-measured
ln -s dataset-excerpt dataset-cost-prediction-measured/raw

python train_cost_prediction.py --lr 0.001 --cost-name runtime-neon-lan

Trained models will be saved in the checkpoints directory.

Evaluation

Use eval_cost_prediction.py to calculate metrics on how the share assignment of SING differs from that of Silph (e.g., MSE, R2-score).

python eval_cost_prediction.py

Using the --checkpoint <PATH> flag, the evaluation can be performed on a specific model checkpoint.

This script supports multiple command-line options to load a specific model checkpoint, filter the dataset, or configure visulization. Run

python eval_cost_prediction.py --help

for a detailed overview of all command-line options.

Share Assignment Model

Our share assignment model $S^\text{SING}$ outputs a share assignment $s$ for a circuit $c$.

We provide model checkpoints used in our evaluation in the pretrained directory.

Train

The dataset in PyTorch Geometric format will be stored in dataset.

mkdir -p dataset

ln -s dataset-excerpt dataset/raw

# supervised (SING 1)
python train.py --lr 0.01 --alpha 0.5

# semi-supervised (SING 2, SING 3)
python train.py --lr 0.01 --alpha 0.1 --predicted-cost

Trained models will be saved in the checkpoints directory.

Evaluation

Use eval.py to calculate metrics on how the share assignment of SING differs from that of Silph (e.g., accuracy, confusion matrix).

python eval.py

Use benchmark_share_assignment.py to compare the SING and Silph runtimes of generating share assignments for circuits. This step requires a compiled version of Silph (cf. Initial Setup).

python benchmark_share_assignment.py --circuits-file paper_benchmark_c.txt

Use bechmark_mpc.py to benchmark runtimes and communication amounts of SING and Silph share assignments. The result will be written to a CSV file. This step requires a compiled version of ABY (cf. Initial Setup).

python benchmark_mpc.py --mode benchmark --hashes paper_benchmark_hashes.txt

For setting up network simulations, benchmark_mpc.py needs to be run as a user with sudo ability, i.e., the user needs to be in the wheel group.

Using the --checkpoint <PATH> flag, the evaluation can be performed on a specific model checkpoint.

Reproducing Plots and Tables

All plots and tables in our paper can be reproduced from measured data using the benchmark-results/plot.py script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SING: Improving the efficiency of Secure Multi-Party Computation Protocol Assignment using Neural Networks

Initial Setup

Dataset

Step 1: Compile all circuits

Step 2: Clean up the dataset

Step 3: Generating alternative share assignments

Step 4: Benchmark MPC runtimes

Calculate dataset splits

Cost Prediction Model

Train

Evaluation

Share Assignment Model

Train

Evaluation

Reproducing Plots and Tables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
ABY-vendor		ABY-vendor
analysis		analysis
benchmark-results		benchmark-results
example-gauss		example-gauss
grammar-generate		grammar-generate
llm-generate		llm-generate
pretrained		pretrained
scripts		scripts
silph		silph
sing		sing
sql		sql
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark_mpc.py		benchmark_mpc.py
benchmark_share_assignment.py		benchmark_share_assignment.py
dataset-excerpt-c.tar.gz		dataset-excerpt-c.tar.gz
dataset-excerpt.tar.gz		dataset-excerpt.tar.gz
empirical_costs.json		empirical_costs.json
empirical_wan.json		empirical_wan.json
eval.py		eval.py
eval_cost_prediction.py		eval_cost_prediction.py
exclude.txt		exclude.txt
generate_alternative_share_assignments.py		generate_alternative_share_assignments.py
generate_dataset_split.py		generate_dataset_split.py
grammar_hashes.txt		grammar_hashes.txt
inference.py		inference.py
inference_trivial.py		inference_trivial.py
network.py		network.py
ntfy.py		ntfy.py
paper_benchmark_c.txt		paper_benchmark_c.txt
paper_benchmark_hashes.txt		paper_benchmark_hashes.txt
requirements.txt		requirements.txt
silph_benchmark_c.txt		silph_benchmark_c.txt
silph_benchmark_hashes.txt		silph_benchmark_hashes.txt
train.py		train.py
train_cost_prediction.py		train_cost_prediction.py

Folders and files

Latest commit

History

Repository files navigation

SING: Improving the efficiency of Secure Multi-Party Computation Protocol Assignment using Neural Networks

Initial Setup

Dataset

Step 1: Compile all circuits

Step 2: Clean up the dataset

Step 3: Generating alternative share assignments

Step 4: Benchmark MPC runtimes

Calculate dataset splits

Cost Prediction Model

Train

Evaluation

Share Assignment Model

Train

Evaluation

Reproducing Plots and Tables

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages