WARP: Weight-Space Analysis for Recovering Training Data Portfolios (ICML'26 WSS)

Official repositry for "WARP: Weight-Space Analysis for Recovering Training Data Portfolios" which is accepted to the ICML 2026 Workshop on Weight-Space Symmetries. In our paper, we introduce WARP, a framework that recovers a fine-tuned model’s domain mixture directly from its released weights. WARP extracts geometric features and maps them to domain proportions using either a parameter-free softmax readout or a MLP projector trained on synthetic mixtures. In controlled experiments with BERT and GPT-2, WARP recovers domain mixtures with MAE as low as 0.048 and 0.117 respectively, outperforming membership inference and a variant with access to the true training trajectory, and remains accurate when recovering different training recipes.

Paper: link_to_be_put

🚀 Main Results

Experiment Pipeline

At a high level, the pipeline:

Selects a seed subset (D) of training data.
Builds a fine-tuning subset (D') with a controlled class distribution.
Fine-tunes a base model into an expert (saving intermediate checkpoints along the way).
Constructs pseudo-expert models along the base → expert path (via interpolations).
Computes a per-example alignment matrix (M) using last-layer gradients.

This repository currently contains two experiment implementations:

bert/ — BERT sequence-classification experiments
gpt2/ — GPT-2 sequence-classification experiments

📁 Repository Structure

Top-level utilities:

data.py — Dataset loading, filtering, subset selection (D and D'), and DataLoader creation

Notebooks:

experiment.ipynb — ipynb file to start running the experiments
Baselines.ipynb — ipynb file to generate the Baseline model comparisons
Visualizations.ipynb — ipynb file for generating the visualization utilities e.g alignment matrix visualizations
kfold_pipeline.ipynb — ipynb file for generating training point files, running Kfold validation and saving the results

Model-specific pipelines:

`bert/`

bert_domain_distribution.py — Main runner: loads config, prepares data, fine-tunes model, computes alignment matrices
bert_finetuning.py — Fine-tunes BERT base → expert with intermediate checkpoint saving and saving converged and overtrained checkpoints
bert_models.py — Pseudo-expert creation via linear/quadratic interpolation and mergekit-based methods (SLERP/TIES/DELLA)
bert_alignment.py — Computes alignment matrix using per-example last-layer gradients (not for the converged and overtrained)

`gpt2/`

gpt2_domain_distribution.py — Main runner for GPT-2 (same pipeline as BERT)
gpt2_finetuning.py — Fine-tunes GPT-2 base → expert with intermediate checkpoint saving and saving converged and overtrained checkpoints
gpt2_domain_distribution_converged.py - Computes alignment matrix using per-example last-layer gradients for checkpoints for converged model
gpt2_models.py — Pseudo-expert creation via linear/quadratic interpolation and mergekit-based methods (SLERP/TIES/DELLA)
gpt2_alignment.py — Computes alignment matrix using last-layer score gradients (not for the converged and overtrained)

Quickstart

Install dependencies

Install Miniconda

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh
source ~/.bashrc

Use conda

conda update -n base -c defaults conda
conda env create -f environment.yml
conda activate warp

Running an experiment

Use the experiment.ipynb to generate the configs easily
You can also generate custom configs
Then open a tmux session and run the experiments in it using a jupyter browser

Alternatively, if you want to just execute one experiment-:

BERT

python bert/bert_domain_distribution.py path/to/config.json

GPT-2

python gpt2/gpt2_domain_distribution.py path/to/config.json

Outputs are written to:

output_dir = config.experiment_name

Typical artifacts:

{experiment_name}/dataset_info.json — indices for (D) and (D')
{experiment_name}/theta_base_model.pt — base weights
{experiment_name}/theta_exp_model.pt — expert weights
{experiment_name}/converged_model.pt - converged model weights (optional)
{experiment_name}overtrained_checkpoint.pt- overtrained model weights (optional)
{dataset}_{interpolation}_{proportionArr}/alignment_matrix_<interpolation>.npy — alignment matrices
{dataset}_{interpolation}_{proportionArr}/lambda_statistics.json — per-λ summary stats

Generating the results

mkdir -p results_datainfo/{model}/{dataset_name}
mkdir -p results_align_matrix/{model}/{dataset_name}

ensure to use the dataset name that is comptible with hugging face's methods
Move the {experiment_name} directories to the appropriate subdirectory in results_datainfo
Move the {dataset}{interpolation}{proportionArr} directories to the appropriate subdirectory in results_align_matrix
Run the baselines.ipynb , Visualizations.ipynb and kfold_pipeline.ipynb accordingly. There is no dependency of running before the other

Example config (template)

{
  "experiment_name": "my_experiment",
  "dataset": ["ag_news"],
  "model_name": ["bert-base-uncased"],
  "num_labels": 4,

  "batch_size": 16,
  "max_length": 256,
  "learning_rate": 2e-5,
  "num_epochs": 4,
  "optimizer": ["Adam"],

  "n_total": 5000,
  "n_finetune": 2500,
  "finetuning_source": ["original"],
  "proportionArr": [0.25, 0.25, 0.25, 0.25],

  "K": 15,
  "lambda_min": 0.05,
  "lambda_max": 0.95,

  "interpolations": ["linear", "quadratic", "slerp", "ties", "della","model_baseline"]
}

Parameters

Core Parameters

Parameter	Type	Description	Supported Values
`experiment_name`	`string`	Output directory name for all artifacts
`dataset`	`string`	HuggingFace dataset identifier
`model_name`	`string`	Pre-trained model from HuggingFace	`"bert-base-uncased"`, `"gpt2"`
`num_labels`	`int`	Number of classification classes

Training Hyperparameters

Parameter	Type	Description	Supported Values
`batch_size`	`int`	Training batch size
`max_length`	`int`	Maximum sequence length (tokens)
`learning_rate`	`float`	Optimizer learning rate
`num_epochs`	`int`	Number of fine-tuning epochs
`optimizer`	`string`	Optimization algorithm	`"Adam"`, `"SGD"`

Pseudo-Expert Generation

Parameter	Type	Description	Range
`K`	`int`	Number of pseudo-experts (interpolation points)	Typically `10` – `20`
`lambda_min`	`float`	Starting interpolation value (close to base model θ₀)	`0.0` – `0.1`
`lambda_max`	`float`	Ending interpolation value (close to expert model θₑ)	`0.9` – `1.0`
`interpolations`	`array<string>`	Methods for creating pseudo-experts (see below)	See table below

Available `interpolations` Methods

Method	Description
`"linear"`	Linear weight interpolation: θ(λ) = (1-λ)θ₀ + λθₑ
`"quadratic"`	Quadratic trajectory in weight space
`"model_baseline"`	Uses intermediate checkpoints saved during training
`"slerp"`	Spherical linear interpolation (geodesic path)
`"ties"`	TIES: Task Arithmetic via weight merging
`"della"`	DELLA: Adaptive weight averaging

Data Sampling Configuration

Parameter	Type	Description	Range/Values
`n_total`	`int`	Size of seed subset (D) sampled from full training set
`n_finetune`	`int`	Size of fine-tuning subset (D')	≤ `n_total`
`finetuning_source`	`string`	Controls how (D') is constructed (see below)	`"original"`, `"select"`
`proportionArr`	`array<float>`	Target class distribution for (D') (must sum to 1.0)

Understanding `finetuning_source`

This parameter determines the sampling strategy for constructing the fine-tuning subset (D'):

Value	Description
`"original"`	Sample (D') directly from the full training dataset with target `proportionArr` distribution
`"select"`	Sample (D') from the seed subset (D) with target `proportionArr` distribution

Visual Comparison:

Full Training Dataset (100k samples)
    │
    ├─→ [finetuning_source = "original"]
    │       │
    │       ├─→ Seed Subset D (5000 samples)      ← for alignment computation
    │       └─→ Fine-tuning D' (2500 samples)     ← sampled from full 100k
    │
    └─→ [finetuning_source = "select"]
            │
            └─→ Seed Subset D (5000 samples)
                    │
                    ├─→ Fine-tuning D' (2500 samples)  ← sampled from D

🧐 Todo

We welcome contributions and suggestions to the list!

Complete ReadMe
Convert ipynb files to python scripts
Restructure the project directory to seprate the ipynb files
Integrate the complete converged and overtrained experiment code into one central file

📰 News

[2025/05] Our paper is submitted to ICML workshop 2026: Weight-Space Symmetries!
[2025/05] Our paper is accpeted to ICML workshop 2026: Weight-Space Symmetries!

📑 Citation

If you use the codes, please cite the following paper:

@inproceedings{
huang2026warp,
title={{WARP}: Weight-Space Analysis for Recovering Training Data Portfolios},
author={Tzu-Heng Huang and Aditya Goyal and John Cooper and Frederic Sala},
booktitle={ICML 2026 Workshop on Weight-Space Symmetries: from Foundations to Practical Applications},
year={2026},
url={https://openreview.net/forum?id=5GJDmHFNUY}
}

Notes

This is a research-grade codebase, and some parts may still be mid-refactor. If you run into issues that are hard to resolve, please reach out for further assistance.
If a dataset is not natively supported in the repo, make sure to use the dataset name in the format expected by Hugging Face’s load_dataset
By default, the repo uses the "text" column to generate tokens. If your dataset uses a different text column, update it accordingly in data.py lines 246-252
to integrate another optimzier, pls change lines 321-332 for bert or lines 377-387 for gpt2
advanced interpolation methods (SLERP/TIES/DELLA) rely on mergekit. The repo’s requirements.txt includes a mergekit editable install; if mergekit import fails, those methods will be unavailable.
uncomment the code for converged and overtrained model in lines 251-282 for bert and lines 260-291 for gpt2 if you want to save those checkpoints
for converged and overtrained checkpoint, they are generated normally as we execute experiment.ipynb.
To generate the alignment matrix for those converged checkpoints, we should use gpt2_domain_distribution_converged.py. Only small chnages are required to modify this file to work with overtrained and with bert and with other interpolations. Additonally, you would need to uncomment lines 297-301 to change the name of the alignment matrix. During generating results, you would need to ensure that the methods are working with the converged or overtrained alignment matrix and would have to make small changes in that regard.

License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WARP: Weight-Space Analysis for Recovering Training Data Portfolios (ICML'26 WSS)

🚀 Main Results

Experiment Pipeline

📁 Repository Structure

`bert/`

`gpt2/`

Quickstart

Install dependencies

Running an experiment

BERT

GPT-2

Generating the results

Example config (template)

Parameters

Core Parameters

Training Hyperparameters

Pseudo-Expert Generation

Available `interpolations` Methods

Data Sampling Configuration

Understanding `finetuning_source`

🧐 Todo

📰 News

📑 Citation

Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
bert		bert
final_results		final_results
gpt2		gpt2
.gitignore		.gitignore
Baselines.ipynb		Baselines.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
Visualizations.ipynb		Visualizations.ipynb
data.py		data.py
environment.yml		environment.yml
experiment.ipynb		experiment.ipynb
kfold_pipeline.ipynb		kfold_pipeline.ipynb
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

WARP: Weight-Space Analysis for Recovering Training Data Portfolios (ICML'26 WSS)

🚀 Main Results

Experiment Pipeline

📁 Repository Structure

bert/

gpt2/

Quickstart

Install dependencies

Running an experiment

BERT

GPT-2

Generating the results

Example config (template)

Parameters

Core Parameters

Training Hyperparameters

Pseudo-Expert Generation

Available interpolations Methods

Data Sampling Configuration

Understanding finetuning_source

🧐 Todo

📰 News

📑 Citation

Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`bert/`

`gpt2/`

Available `interpolations` Methods

Understanding `finetuning_source`

Packages