This repo is an experiment in distllation of the Caduceus DNA model.
Due to mamba_ssm setup.py shenanigans, uv init is a little more complicated:
uv sync
uv sync --extra mambaWarning
You can't install mamba (mamba-ssm) package on a macbook.
First download the data:
./bin/fetch_dataExample of a distillation experiment:
uv run distill \
--batch-size=1 \
--accumulate-grad-batches=2 \
--temperature=1 \
--lr=0.001 \
--max-train-batches=32768 \
--max-val-batches=32 \
--max-final-val-batches=1024 \
--val-check-interval=128 \
--no-wandbWarning
The command above doesn't publish to W&B (last flag). For a real experiment, you most definitely want to publish to W&B, see --wandb-project-name flag.
To see available options:
uv run distill --help