Title: Linear Assignment on Tile-Centric Accelerators: Redesigning Hungarian Algorithm on IPUs

This is the readme file for the Paper "Accelerating the Hungarian Algorithm: Computing Linear Assignments on CPUs, GPUs, and IPUs".

Hardware Platform:

IPU: We run our algorithm on the 1.325GHz Mk2 GC200 IPU.

GPU: All the other algorithms run on an Nvidia A100 GPU with 40GB memory.

Software Platform:

Poplar: We run our algorithm using the Poplar SDK 3.2.0 (https://www.graphcore.ai/downloads). Poplar is a programming framework to directly communicate with IPU.
PopVision: We use PopVision to profile our algorithm (https://www.graphcore.ai/developer/popvision-tools#downloads).

Datasets:

In this repo, we provide the datasets, including

(1) The sparse dataset with a matrix size of 1024 for test (test_data/sparse/new-sparse_1024.txt)

(2) The real-world datasets for graph alignment. (real-world/), for generating the similarity we use the grampa algorithm, attached.

(3) We put the remaining datasets on google drive due to the space limit, and can be accessed from https://drive.google.com/drive/folders/1It5Uq9_u6Gvft41BQRKglFrjArH_h21s?usp=sharing

Running script

We include the run.sh file to run our algorithm, in detail, the run.sh including the following command.

rm main 
g++ --std=c++11 HunIPU.cpp -lpoplar -lpopops -lpoputil -lpoplin -o main
./main 1024 test_data/sparse/new-sparse_1024.txt

The first line means we remove the previously compiled program (if any).

The second line means we compile the program.

The third line is to actually run the program. The third line is in the following format.

./program matrix-size data-source

For example, after we compile the program and generate main, and we want to test the matrix size with 1024, the data is in test_data/sparse/new-sparse_1024.txt. We can run the following command.

./main 1024 test_data/sparse/new-sparse_1024.txt

After running the algorithm, the program will output the running time.

To generate the profile file that can analyse the program, we can use the following command, add POPLAR_ENGINE_OPTIONS='{"autoReport.all":"true","autoReport.directory":"./report"}' before executing the algorithm.

POPLAR_ENGINE_OPTIONS='{"autoReport.all":"true","autoReport.directory":"./report"}' ./main 1024 test_data/sparse/new-sparse_1024.txt

After generating the profile, we can use the PopVision to analysis the algorithm execution.

Baseline:

We compare our algorithm with the following baseline algorithms. We list the GitHub repo in the following.

FastHA: https://github.com/paclopes/HungarianGPU
CuLAP: https://github.com/tianluoabc/CuLAP

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
real-world		real-world
test_data		test_data
Grampa.py		Grampa.py
HunIPU.cpp		HunIPU.cpp
README.md		README.md
codelets.cpp		codelets.cpp
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Title: Linear Assignment on Tile-Centric Accelerators: Redesigning Hungarian Algorithm on IPUs

Hardware Platform:

Software Platform:

Datasets:

Running script

Baseline:

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Title: Linear Assignment on Tile-Centric Accelerators: Redesigning Hungarian Algorithm on IPUs

Hardware Platform:

Software Platform:

Datasets:

Running script

Baseline:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages