Skip to content

Open-Athena/caduceus-distill

Repository files navigation

⚕ Caduceus Distill

This repo is an experiment in distllation of the Caduceus DNA model.

Dependencies

Due to mamba_ssm setup.py shenanigans, uv init is a little more complicated:

uv sync
uv sync --extra mamba

Warning

You can't install mamba (mamba-ssm) package on a macbook.

Results

Report

Run experiment

First download the data:

./bin/fetch_data

Example of a distillation experiment:

uv run distill \
  --batch-size=1 \
  --accumulate-grad-batches=2 \
  --temperature=1 \
  --lr=0.001 \
  --max-train-batches=32768 \
  --max-val-batches=32 \
  --max-final-val-batches=1024 \
  --val-check-interval=128 \
  --no-wandb

Warning

The command above doesn't publish to W&B (last flag). For a real experiment, you most definitely want to publish to W&B, see --wandb-project-name flag.

To see available options:

uv run distill --help

About

⚕ Caduceus distillation experiments

Resources

Stars

Watchers

Forks

Contributors 2

  •  
  •