Welcome to use Assemblage’s documentation!

Assemblage is a distributed binary corpus discovery, generation, and archival tool. It is built to provide high-quality labeled metadata for the purposes of building training data for machine learning applications of binary analysis and other applications.

You can find our paper at https://arxiv.org/abs/2405.03991, and our deployment/dataset docs at this website.

Documentation Section

Contact us:

To contact us about datasets access, deployment, or any other questions, please email current maintainers by:

Kristopher Micinski: kkmicins@syr.edu
Chang Liu: cliu57@syr.edu

Here are the email addresses of all contributors to this project by last name:

Naveen Ashok: nashok@syr.edu
Alex Duly: apduly@syr.edu
Maya Fuchs: fuchs_maya@bah.com
James Holt: holt@lps.umd.edu
Mia Kerchen: mhkerche@syr.edu
Chang Liu: cliu57@syr.edu
Kristopher Micinski: kkmicins@syr.edu
Townsend Southard Pantano: tgsoutha@syr.edu
Rebecca Saul: Saul_Rebecca@bah.com
Yihao Sun: ysun67@syr.edu