docs: add FastembedColbertRanker to fastembed integration page by dina-deifallah · Pull Request #442 · deepset-ai/haystack-integrations

dina-deifallah · 2026-04-13T11:49:30Z

Summary

Updates the fastembed integration page to include the new FastembedColbertRanker component from deepset-ai/haystack-core-integrations#3135.

Added FastembedColbertRanker to the components list alongside FastembedRanker
Added a new "Example with ColBERT ranker" section with a working pipeline code snippet
Added a note on unnormalized ColBERT scores

Updates the fastembed integration page to include the new FastembedColbertRanker component from PR #3135, which adds ColBERT late-interaction reranking support via fastembed. - Added FastembedColbertRanker to the components list - Added a usage example with ColBERT ranker in a pipeline Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

kacperlukawski

Hey, thanks for contributing @dina-deifallah! I have some minor comments. One question: do we allow choosing a metric used to calculate the maxsim score in the implementation? If so, it would be great to mention the parameter in the example.

kacperlukawski · 2026-04-13T13:24:54Z


+### Example with ColBERT ranker
+
+`FastembedColbertRanker` uses ColBERT late-interaction scoring: the query and documents are encoded independently into token-level embeddings, and a MaxSim score is computed for each document. This offers stronger ranking quality than cross-encoders on many tasks while remaining efficient.


According to the benchmarks, cross-encoders are stronger than late interaction. Models, such as ColBERT, are used as a middle ground due to their scalability and performance.

Thanks for the feedback Kacper. That makes sense. I committed your suggested changes.

I took a look at the ColBERTv2 paper to check about the metric. The current implementation doesn't expose a metric parameter. It uses dot product similarity via np.matmul, which is the standard ColBERT approach. Since fastembed's LateInteractionTextEmbedding returns L2-normalized embeddings (as described in the ColBERTv2 paper, Santhanam et al., 2021 — https://arxiv.org/abs/2112.01488), dot product and cosine similarity are mathematically equivalent for these models, so the ranking order is the same regardless of metric. For that reason, no metric parameter is needed for the current supported models.

Co-authored-by: Kacper Łukawski <kacperlukawski@users.noreply.github.com>

kacperlukawski

LGTM, but I assume a corresponding PR in the integration itself has to be merged first

anakin87 · 2026-04-16T08:14:57Z

@dina-deifallah @kacperlukawski

This component has been renamed to FastembedLateInteractionRanker. So I updated the current PR in 9745bcb.

In the meantime, we also released a new version of the integration, containing this new component: https://pypi.org/project/fastembed-haystack/2.2.0/

Merging this PR now

dina-deifallah requested a review from a team as a code owner April 13, 2026 11:49

kacperlukawski self-requested a review April 13, 2026 13:11

kacperlukawski requested changes Apr 13, 2026

View reviewed changes

Update integrations/fastembed.md

21243c7

Co-authored-by: Kacper Łukawski <kacperlukawski@users.noreply.github.com>

dina-deifallah requested a review from kacperlukawski April 13, 2026 14:29

kacperlukawski approved these changes Apr 13, 2026

View reviewed changes

Adopt new component name

9745bcb

anakin87 merged commit 32138ef into deepset-ai:main Apr 16, 2026

anakin87 mentioned this pull request Apr 16, 2026

FastEmbed Late Interaction Ranker deepset-ai/haystack-core-integrations#3167

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add FastembedColbertRanker to fastembed integration page#442

docs: add FastembedColbertRanker to fastembed integration page#442
anakin87 merged 3 commits intodeepset-ai:mainfrom
dina-deifallah:docs/add-fastembed-colbert-ranker

dina-deifallah commented Apr 13, 2026

Uh oh!

kacperlukawski left a comment

Uh oh!

kacperlukawski Apr 13, 2026

Uh oh!

dina-deifallah Apr 13, 2026

Uh oh!

dina-deifallah Apr 13, 2026

Uh oh!

Uh oh!

kacperlukawski left a comment

Uh oh!

anakin87 commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		### Example with ColBERT ranker

		`FastembedColbertRanker` uses ColBERT late-interaction scoring: the query and documents are encoded independently into token-level embeddings, and a MaxSim score is computed for each document. This offers stronger ranking quality than cross-encoders on many tasks while remaining efficient.

Conversation

dina-deifallah commented Apr 13, 2026

Summary

Related

Uh oh!

kacperlukawski left a comment

Choose a reason for hiding this comment

Uh oh!

kacperlukawski Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

dina-deifallah Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

dina-deifallah Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kacperlukawski left a comment

Choose a reason for hiding this comment

Uh oh!

anakin87 commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants