Conversation
e0e7441 to
a0bb9fc
Compare
| github: superlinked | ||
| linkedin: superlinked | ||
| pypi: https://pypi.org/project/sie-haystack/ | ||
| repo: https://github.com/superlinked/sie |
There was a problem hiding this comment.
It is supposed to point to the integration package itself, not the main project.
| from haystack import Document, Pipeline | ||
| from haystack.components.retrievers.in_memory import InMemoryEmbeddingRetriever | ||
| from haystack.document_stores.in_memory import InMemoryDocumentStore | ||
| from sie_haystack import SIEDocumentEmbedder, SIETextEmbedder |
There was a problem hiding this comment.
It's a bit more related to the integration itself, but a common convention is to put all the embedding integrations under haystack_integrations.components.embedders.INTEGRATION_NAME. It might be easier to navigate for Haystack users, as they are used to that structure.
| `sie-haystack` exposes Haystack-native components for every SIE capability: | ||
|
|
||
| **Dense embedding** | ||
| - `SIETextEmbedder` - embed a query string | ||
| - `SIEDocumentEmbedder` - embed a list of `Document`s (supports `meta_fields_to_embed`) | ||
|
|
||
| **Sparse embedding** (for hybrid search) | ||
| - `SIESparseTextEmbedder` | ||
| - `SIESparseDocumentEmbedder` | ||
|
|
||
| **Multivector / ColBERT** | ||
| - `SIEMultivectorTextEmbedder` | ||
| - `SIEMultivectorDocumentEmbedder` | ||
|
|
||
| **Multimodal** | ||
| - `SIEImageEmbedder` - CLIP, SigLIP, ColPali | ||
|
|
||
| **Reranking** | ||
| - `SIERanker` - reranking with cross-encoder models (e.g. `jinaai/jina-reranker-v2-base-multilingual`) or ColBERT / late-interaction models (e.g. `jinaai/jina-colbert-v2`) | ||
|
|
||
| **Extraction** | ||
| - `SIEExtractor` - zero-shot entity extraction (GLiNER), relation extraction (GLiREL), classification (GLiClass), and object detection (GroundingDINO, OWL-v2) |
There was a problem hiding this comment.
I can't find the sources of the package, so I couldn't see how the other components would fit Haystack's pipelines/agents. It would be great to have a complete documentation using all the components provided.
There was a problem hiding this comment.
as in you mean for these to lead to Superlinked's docs where the integrations are described?
c28ccc8 to
d25ce66
Compare
Adds an integration page for Superlinked (SIE), a self-hosted inference engine for embeddings, reranking, and extraction. The sie-haystack package provides Haystack 2.0 components for 85+ embedding models (dense, sparse, multivector/ColBERT, multimodal), cross-encoder and late-interaction reranking, and zero-shot entity/relation/classification extraction.
d25ce66 to
727cdde
Compare
Summary
Adds an integration page for Superlinked (SIE), a self-hosted inference engine for embeddings, reranking, and extraction. The
sie-haystackpackage exposes Haystack 2.0 components for 85+ embedding models (dense, sparse, multivector/ColBERT, multimodal), cross-encoder reranking, and zero-shot entity/relation/classification extraction — all backed by a single SIE endpoint.Files added
integrations/superlinked.md— integration page with install, usage example, and component referencelogos/superlinked.png— logo assetType
Model Provider— matches the convention used by Cohere, Jina, Voyage, and Fastembed integrations, which similarly wrap an external embedding/reranking backend behind Haystack embedder and ranker components.Links
Opened as a draft for maintainer review of placement,
type:choice, and frontmatter. Happy to adjust.