ordvec

Training-free ordinal & sign quantization for vector retrieval.

ordvec is a small, dependency-light Rust crate for compressed nearest-neighbour search over high-dimensional embeddings.

What's different

Compressed-retrieval libraries usually either fit a codebook to your data (product / scalar quantization) or wrap vectors in a graph (HNSW). ordvec does neither — it quantizes the ordinal structure of each vector on its own:

Training-free, data-oblivious. No codebook, no learned rotation, no fit step. Encoding is a per-vector rank (or sign) transform — index the very first vector with no prior data, and never refit when the corpus drifts.
Zero system dependencies. Pure Rust — no BLAS / LAPACK / ndarray / faer. Builds and cross-compiles cleanly, including to aarch64 and wasm32.
Ordinal + sign quantization. Compresses the rank order of coordinates (1/2/4 bits each) and their signs — a different lever from the product / scalar / binary quantization most crates use.
Predictable footprint. Exactly dim * bits / 8 bytes per document — known before you see any data (256 B at dim = 1024, 2-bit), with bits ∈ {1, 2, 4} the size/recall knob.
Two-stage retrieval, built in. A cheap bitmap / sign-popcount prefilter feeds an exact rerank — the coarse→fine pipeline ships as library primitives.

ordvec is a compressed flat-scan substrate (optionally two-stage): small codes scored by fast SIMD — AVX-512/AVX2 runtime-dispatched on x86_64, baseline NEON on aarch64, and simd128 on wasm32. It is the code-and-scan layer, not a navigable-graph index — but the codes are small and index-agnostic, so they compose under an ANN or sharding layer for large-scale serving rather than competing with one.

Ordinal index family

Rank — full-precision rank vectors (u16 per coordinate).
RankQuant — ranks bucketed into 1 << bits equal-width bins, bits bits per coordinate (dim * bits / 8 bytes/doc). Both a symmetric (Spearman) and asymmetric (float-query LUT) scorer.
Bitmap — a top-bucket bitmap per document (one bit per coordinate); scoring is popcount(Q AND D), a coarsened rank overlap.
SignBitmap — a sign bitmap per document for sign-cosine candidate generation, feeding an exact rerank stage.

Two further paths, for callers who need them:

RankQuantFastscan (#[doc(hidden)] — reachable as ordvec::RankQuantFastscan, but the API is not yet stable) — an optional b=2 FastScan kernel (block-32 PQ-LUT) for absolute-minimum scan latency, at 2× the RankQuant b=2 footprint (dim/2 bytes/doc). Surfaced here so latency-critical callers know it exists.
MultiBucketBitmap (behind --features experimental) — the multi-bucket bilinear-overlap probe behind the research-side decomposition; an algebraic scaffold, not the top-bucket theorem surface or a production path.

The bitmap prefilter has a checked finite model

The Bitmap prefilter scores candidates by popcount(Q AND D) over each document's fixed-size top-bucket set. In the idealized uniform constant-weight null, two unrelated n_top-active bitmaps in dim coordinates overlap hypergeometrically, H(dim, n_top, n_top), with expected overlap n_top² / dim (e.g. 16 at dim = 256, n_top = 64). That makes the null selectivity of an overlap cutoff closed-form.

The current proof story is stronger than a closed-form null alone. Two pieces are machine-checked in Lean 4, both sorry-free on Lean's standard axiom base (propext, Classical.choice, Quot.sound):

the ordinal invariance on which the rank transform rests — that a vector's sorting permutation is unchanged by any strictly monotone reparametrisation of its coordinates — in takens-formalization (theorem isOrdinalPatternOf_comp_strictMono); and
the finite constant-weight bitmap admission model — symmetry makes literal overlap the canonical query-preserving invariant, quotient sufficiency reduces the decision to that evidence, a finite overlap-tilt signal model makes an overlap-count threshold Bayes-optimal among deterministic admission rules, and the uniform constant-weight bitmap null assigns that same threshold event exactly the hypergeometric upper tail — in ordvec-formalization (theorem exists_uniformBitmapOverlapTail_finiteBayesRisk_le_and_hypergeomTail).

This is an in-model result. It proves the rule shape and the idealized finite null under explicit quotient, symmetry, and monotone-overlap assumptions. It does not prove that real encoders satisfy those assumptions, that the textbook hypergeometric is every deployment corpus's null, or that ordinal quotients are representation-complete. Whether true neighbours clear a cutoff remains an empirical contract to measure.

Details in docs/RANK_MODES.md.

Quickstart

[dependencies]
ordvec = "0.4"

# Or, to track unreleased `main`, use a git dependency instead:
# ordvec = { git = "https://github.com/Fieldnote-Echo/ordvec" }

use ordvec::RankQuant;

let dim = 1024;
let mut index = RankQuant::new(dim, 2);   // 2 bits/coord → 256 bytes/doc

// `add` takes a flat, row-major buffer of `dim * n_docs` f32s.
index.add(&doc_embeddings);               // &[f32], len = dim * n_docs

// Asymmetric scan: full-precision queries vs bucketed docs (recommended).
let results = index.search_asymmetric(&query_embeddings, 10); // len = dim * n_queries

let top_ids = results.indices_for_query(0);     // top-10 doc ids for query 0
let top_scores = results.scores_for_query(0);

For the two-stage compressed-scan path (Bitmap / SignBitmap candidate generation → RankQuant rerank) and the full mode comparison, see docs/RANK_MODES.md.

Python

The same Rank / RankQuant / Bitmap / SignBitmap API is available from Python — the bindings ship to PyPI as ordvec:

pip install ordvec

Wheels target CPython 3.10+ (abi3); to build from source instead, see ordvec-python/. The runtime dependency floor is numpy>=2.2.

Threading / concurrency

ordvec supports concurrent read-only/search use. Mutation is exclusive.

Python search, candidate-generation, and scoring methods release the GIL and read NumPy inputs in place. Callers must not mutate query, corpus, candidate, or scoring input arrays passed to those methods until the call returns.

The C ABI allows concurrent search and info calls on one loaded handle. ordvec_index_free must not race with any other call on the same handle.

The Go wrapper serializes Close against Search and Info; after Close, Search and Info return ErrClosed. Callers must not mutate query or candidate slices passed to Search until the call returns.

Documentation

Design deep-dive & reproducible benchmark tables: docs/RANK_MODES.md
Design alternatives evaluated and cut: docs/ALTERNATIVES_CONSIDERED.md
Index-file trust model: docs/INDEX_PROVENANCE.md, docs/determinism.md, THREAT_MODEL.md
Manifest verifier, C ABI, and Go wrapper: ordvec-manifest is versioned and published in lockstep with the core crate through its own package gate; use the GitHub checkout for ordvec-ffi/, ordvec-go/, and docs/c-api.md.
Pre-1.0 compatibility policy: docs/compatibility-policy.md defines the stable, experimental, repo-local sidecar, persisted-format, examples/docs, MSRV, and release-note review surfaces.
Formal proof spine: ordvec-formalization, including its proof-spine, theorem-map, and reviewer brief.
API docs: https://docs.rs/ordvec, https://docs.rs/ordvec-manifest
Paper (OrdVec / RankQuant): link TBD — see Research collaboration.

Benchmarks

Real-embedding retrieval

The current paper-harness run is a real-embedding source-recovery task, not the in-repo synthetic stress test: 207,695 arXiv paper embeddings, 7,200 queries across title / first-sentence / middle-sentence / paraphrase query sets, 1024-D sentence-transformer embeddings, and nDCG@10 / hit@10 / MRR@10 against the source paper id.

The baseline rows use FAISS over L2-normalized FP32 embeddings: IndexFlatIP for dense exact search and IndexHNSWFlat(M=32, efSearch=128) for the tested HNSW configuration. The ordinal rows remove stored dense coordinate magnitudes:

ordinal rank-cosine stores mean-centered, L2-normalized argsort(argsort(.)) rank vectors and queries with the same rank-cosine representation; and
RankQuant b=2 asym stores 2-bit ordinal document codes (256 bytes/vector at dim=1024) and scores FP32 queries with RankQuant::search_asymmetric.

Mode	bytes/vec	nDCG@10	hit@10	MRR@10
FAISS dense exact	4096	0.7817	0.8604	0.7566
ordinal rank-cosine	4096	0.7796	0.8596	0.7542
FAISS HNSW	~4352	0.7756	0.8528	0.7509
RankQuant b=2 asym	256	0.7754	0.8536	0.7506

Paired bootstrap over all 7,200 queries:

ordinal rank-cosine minus FAISS HNSW: +0.00406 nDCG@10, 95% CI [+0.00133, +0.00687]
ordinal rank-cosine minus FAISS dense exact: -0.00205 nDCG@10, 95% CI [-0.00429, +0.00019]
RankQuant b=2 asym minus FAISS HNSW: -0.00014 nDCG@10, 95% CI [-0.00318, +0.00292]

Read narrowly: on this real retrieval task, ordinal structure retains nearly all of the dense retrieval signal, and the 2-bit deployed path matches the tested FAISS HNSW configuration within bootstrap noise at 1/16 the FP32 vector payload. The arXiv artifact set is not shipped in this crate; the self-contained clean-checkout benchmark below is the reproducible stress test.

Synthetic stress test

The head-to-head benchmark generates a seeded synthetic corpus in-process, so the quality numbers (R@10, candidate-recall, bytes/vec) are deterministic and regenerable from a clean checkout with no external corpus file:

cargo run --release --example bench_rank

A few operating points from the committed run (benchmarks/rank_modes_results.txt):

Mode	bytes/vec	p50 (ms)	Mdocs/s	R@10
`Rank` asym (full-precision reference)	512	3.71	8	0.845
`RankQuant` b=4 asym	128	0.31	96	0.806
`RankQuant` b=2 asym	64	0.24	126	0.572
`RankQuant` b=2 FastScan	128	0.09	333	0.570
Two-stage b=2 (M=500, CR=1.000)	96	0.11	275	0.572

One representative run on a synthetic corpus (dim=256, n=30k, seed=1), AMD Ryzen 9 9950X (AVX-512), 32 threads, single-thread scan. R@10 is deterministic run-to-run; throughput/latency vary with hardware and run. R@10 is measured against FP32 brute-force cosine on this generated corpus. Treat it as a small, self-contained kernel and stress-test fixture; the real-embedding retrieval task above is the better guide to retrieval-relevant behaviour.

Scope

ordvec is a library and substrate, not a turnkey service: small ordinal/sign codes, fast SIMD scoring, and a built-in two-stage prefilter — the code-and-scan layer of a retrieval system. It is not a navigable-graph index (HNSW) on its own — yet — and not a serving tier at all: ordvec is the substrate other systems build on, so its small, index-agnostic codes slot under an ANN or sharding layer for large-scale serving rather than replacing it. Encoding is training-free and data-oblivious by design — no codebook fit — so you index the first vector with no prior data and never refit as the corpus grows.

Quality evidence now has two layers: the real-embedding retrieval table above for the paper claim, and the reproducible synthetic stress test for a clean-checkout kernel sanity check.

Security: index-file trust

The on-disk formats (.tvr / .tvrq / .tvbm / .tvsb) carry no built-in checksum, MAC, or signature — by design. The loaders validate structure (magic, version, bounds, exact-length payload) but not origin: a structurally valid file can still be untrusted. If an index file crosses a trust boundary (network transfer, shared storage), verify it before loading. ordvec-manifest binds an index file to a JSON manifest by SHA-256, header metadata, row identity, named auxiliary sidecars, and attestation shape checks. It does not sign artifacts, manage keys, or decide deployment trust policy. No in-format crypto is shipped because it would add key management the library can't own. See docs/PERSISTED_FORMAT.md, docs/INDEX_PROVENANCE.md, and THREAT_MODEL.md in the full repository.

Provenance

ordvec was developed within turbovec, factored out into this standalone, zero-system-dependency crate. turbovec (MIT, by Ryan Codrai) is credited as the project it grew within, with thanks; ordvec's development history is in this repository's git log.

Acknowledgements

Thanks to Todd Baur (@toadkicker) for the sign-cosine intuition and engineering polish.

Thanks to Mike Singleton (@singleton2787) for mathematical assistance and mentorship.

Research collaboration

ordvec is the reference implementation for an in-progress paper on ordinal retrieval — using the rank and sign structure of embeddings, rather than their floating-point magnitudes, as the retrieval signal. The repository is open specifically to grow a group of collaborators, including potential named co-authorship where contributions meet the paper's authorship bar — a different invitation than "send a PR." Collaboration we're actively seeking:

Real-corpus evaluation — running the modes against public corpora (GloVe, MTEB / BEIR, OpenAI embedding dumps) beyond the synthetic benchmark.
Theory — extending and independently auditing the sorry-free Lean formalization, especially the finite bitmap proof spine, rank-cosine invariants, and empirical diagnostics for when real encoders meet or violate the model assumptions.
Independent reproduction — re-running the benchmark on other hardware and reporting the numbers.

If that's your area, see GOVERNANCE.md and open an issue or a discussion.

Contributing

Contributions to the code, the docs, and the paper are all welcome — see CONTRIBUTING.md.

Minimum supported Rust version

ordvec's MSRV is Rust 1.89 — the release that stabilized the specific AVX-512 intrinsics the SIMD kernels compile against (it also clears the 1.87 floor from is_multiple_of). Because the kernels are built against those intrinsics, this is a hard compile floor, not just a convenience pin: a toolchain below 1.89 won't build the crate. Raising the MSRV is treated as a minor-version change under the compatibility policy.

License

Licensed under either of

MIT License (LICENSE-MIT)
Apache License, Version 2.0 (LICENSE-APACHE-2.0)

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Name		Name	Last commit message	Last commit date
Latest commit History 492 Commits
.github		.github
benchmarks		benchmarks
docs		docs
examples		examples
fuzz		fuzz
ordvec-ffi		ordvec-ffi
ordvec-go		ordvec-go
ordvec-manifest-python		ordvec-manifest-python
ordvec-manifest		ordvec-manifest
ordvec-python		ordvec-python
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DCO		DCO
GOVERNANCE.md		GOVERNANCE.md
LICENSE-APACHE-2.0		LICENSE-APACHE-2.0
LICENSE-MIT		LICENSE-MIT
README.md		README.md
RELEASING.md		RELEASING.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
THREAT_MODEL.md		THREAT_MODEL.md
cliff.toml		cliff.toml
codecov.yml		codecov.yml
deny.toml		deny.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ordvec

What's different

Ordinal index family

The bitmap prefilter has a checked finite model

Quickstart

Python

Threading / concurrency

Documentation

Benchmarks

Real-embedding retrieval

Synthetic stress test

Scope

Security: index-file trust

Provenance

Acknowledgements

Research collaboration

Contributing

Minimum supported Rust version

License

Contribution

About

Licenses found

Uh oh!

Releases 3

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ordvec

What's different

Ordinal index family

The bitmap prefilter has a checked finite model

Quickstart

Python

Threading / concurrency

Documentation

Benchmarks

Real-embedding retrieval

Synthetic stress test

Scope

Security: index-file trust

Provenance

Acknowledgements

Research collaboration

Contributing

Minimum supported Rust version

License

Contribution

About

Topics

Resources

License

Licenses found

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages