bi-encoder

Here are 17 public repositories matching this topic...

svjack / Sbert-ChineseExample

Sentence-Transformers Information Retrieval example on Chinese

elasticsearch pandas pytorch indexer chinese cos bm25 sentence-embeddings faiss sentence-transformers bi-encoder cross-encoder

Updated Feb 18, 2024
Python

lemuria-wchen / CFC

Star

Code and created datasets for our ACL 2022 paper: "Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations"

knowledge-distillation retrieval-chatbot bi-encoder dense-retrieval

Updated Jun 20, 2022
Python

ntphuc149 / ViIR

Star

ViIR: The Unified Framework for Fine-tuning Vietnamese Information Retrieval Models with Various Tuning Statergies.

information-retrieval vietnamese baseline fine-tuning hard-negative-mining plms bi-encoder positive-pair

Updated May 25, 2025
Python

rasyosef / amharic-neural-ir

Star

Official codebase for the ACL 2026 MeLLM Workshop paper "The Multilingual Curse at the Retrieval Layer: Evidence from Amharic"

natural-language-processing information-retrieval roberta amharic-nlp colbert low-resource-nlp bi-encoder cross-encoder splade

Updated Jun 11, 2026
Jupyter Notebook

High-accuracy job classification system using Sentence Transformers. Maps job titles & descriptions to 1,016 O*NET-SOC categories. 100% Top-1 accuracy on real job postings. Fast CPU inference (<100ms). 126K+ training samples from 8 O*NET data sources.

nlp machine-learning pytorch semantic-similarity hr-tech sentence-transformers bi-encoder job-classification

Updated May 8, 2026
Python

Madhvansh / Neural-E-Commerce-Search

Star

Two-stage retrieve-and-rank neural product search on Amazon ESCI: a dense bi-encoder retriever with hard-negative mining + a DeBERTa cross-encoder reranker over Exact/Substitute/Complement/Irrelevant labels. NDCG@10 0.71 (+16% vs BM25), 0.74 micro-F1.

information-retrieval transformers pytorch e-commerce learning-to-rank semantic-search ndcg reranking faiss fastapi product-search neural-search sentence-transformers deberta bi-encoder cross-encoder amazon-esci

Updated Jun 14, 2026
Python

jhondados / neural-search-engine

Star

Neural search engine with bi-encoder + cross-encoder re-ranking, BM25 hybrid search, query expansion, typo correction and multi-language support — processes 10M queries/day

python elasticsearch transformers bm25 neural-search hybrid-search bi-encoder cross-encoder

Updated Jun 10, 2026
HTML

ericphann / search-for-movie-plots

Star

Baseline models for searching for movie plots from Wikipedia articles. Techniques include BM25 (lexical search), bi/cross-encoding (semantic search), and retrieval-augmented generation (RAG) using Mistal 7B through Fireworks.ai.

tf-idf semantic-search fireworks bm25 mistral keyword-search rag bi-encoder cross-encoder lexical-search retrieval-augmented-generation mistral-7b

Updated Jul 22, 2024
Jupyter Notebook

stefanmzeidler / Medical-Journal-Article-Summarizer

Star

Proof of concept for large language model summarization of medical journal articles for different reading levels

natural-language-processing information-retrieval medicine transformers embeddings gemini database-management gemini-api sbert bi-encoder cross-encoder large-language-models generative-ai retrieval-augmented-generation

Updated Jun 7, 2026
Python

stormtroober / GliNerBioMed-Label-SoftPrompt

Star

Comparative study of parameter-efficient fine-tuning (PEFT) strategies for biomedical NER on top of GLiNER — including soft prompt tuning, embedding injection, and a custom in-place embedding extension that matches full fine-tuning performance at 13% of trainable parameters.

transformers named-entity-recognition optuna bi-encoder cross-encoder prompt-tuning biomedical-nlp parameter-efficient-fine-tuning gliner soft-prompting

Updated Mar 18, 2026
Python

MohamedNassih / Evaluation-Pertinence-Juridique-ML

Star

Évaluation de la pertinence (question ↔ article juridique) en français. Pipeline complet (prépa → modèles → soumission) avec CamemBERT en bi-encodeur calibré (MSE/Spearman), + variantes cross-encoder.

nlp machine-learning deep-learning transformers pytorch relevance tokenization sentence-embeddings document-ranking camembert bi-encoder cross-encoder information-retrievalinformation-retrieval

Updated Nov 5, 2025
Python

santicam06 / Semantic-Search-Engine

Star

Powered by a catalog of 190+ products, this engine delivers high-precision results using semantic embeddings and vector similarity principles. By mapping product data into high-dimensional space and calculating the cosine similarity between search queries and items, it identifies matches based on intent and meaning rather than just keywords.

search-engine embeddings cosine-similarity bi-encoder

Updated Jun 6, 2026
Python

nking / recommender_systems

Star

Recommendation systems overview and an MLOps TFX-pipeline implementation

monitoring dnn mlops tfxpipelines bi-encoder custom-beam-components

Updated Jun 16, 2026
Python

smqd19 / RAG_AI_Approach

Star

InsureLLM RAG Challenge — Two-stage retrieval pipeline (Bi-Encoder + Cross-Encoder) with context compression

rag bi-encoder cross-encoder llm langchain retrieval-augmented-generation

Updated Oct 30, 2025
Python

mellowfarm / tiny-search

Star

Tiny semantic search engine with bi-encoder retrieval and cross-encoder reranking — built to understand how production search works under the hood.

search bi-encoder cross-encoder

Updated Jun 1, 2026
HTML

StefanHeng / Zeroshot-Text-Classification

Star

Exploring fast & accurate zero-shot text classification

natural-language-processing text-classification transformer zero-shot-learning transformer-encoder sentence-transformers zero-shot-classification bi-encoder

Updated May 17, 2023
Python

MohanKrishnaGR / bert-bi-encoder-depth-ablation

Star

Controlled depth ablation of a BERT bi-encoder across training budgets and seeds on three BEIR tasks (nfcorpus, scifact, fiqa). L3–L12 is flat within seed noise at 20K steps; 80K training degrades every depth on zero-shot transfer (−45% NDCG@10 on fiqa for L12).

nlp information-retrieval pytorch embeddings reproducibility bert neural-ir ablation-studies sentence-transformers zero-shot-retrieval bi-encoder dense-retrieval beir ms-marco depth-ablation

Updated Apr 24, 2026
Python

Improve this page

Add a description, image, and links to the bi-encoder topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bi-encoder topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bi-encoder

Here are 17 public repositories matching this topic...

svjack / Sbert-ChineseExample

lemuria-wchen / CFC

ntphuc149 / ViIR

rasyosef / amharic-neural-ir

hsq-end / job_classification

Madhvansh / Neural-E-Commerce-Search

jhondados / neural-search-engine

ericphann / search-for-movie-plots

stefanmzeidler / Medical-Journal-Article-Summarizer

stormtroober / GliNerBioMed-Label-SoftPrompt

MohamedNassih / Evaluation-Pertinence-Juridique-ML

santicam06 / Semantic-Search-Engine

nking / recommender_systems

smqd19 / RAG_AI_Approach

mellowfarm / tiny-search

StefanHeng / Zeroshot-Text-Classification

MohanKrishnaGR / bert-bi-encoder-depth-ablation

Improve this page

Add this topic to your repo