-
Notifications
You must be signed in to change notification settings - Fork 400
Pull requests: huggingface/text-embeddings-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix
max_position_embeddings handling in NomicBertConfig and FlashNomicBertConfig
#876
opened Jun 18, 2026 by
alvarobartt
Member
Loading…
1 of 5 tasks
fix(docker): detect CUDA version from
CUDA UMD Version header on driver 6xx (#870)
#871
opened Jun 3, 2026 by
Anai-Guo
Loading…
refactor: streamline tensor initialization and rearrange struct fields for clarity
#869
opened May 26, 2026 by
Unmesh100
Loading…
chore: enable Dependabot weekly GitHub Actions bumps
dependabot
#868
opened May 26, 2026 by
hf-dependantbot-rollout
Bot
Loading…
Support modular Sentence Transformers cross-encoder rerankers (e.g. ettin-reranker)
#867
opened May 25, 2026 by
hotchpotch
Loading…
3 of 5 tasks
feat: ROCm flash-attn varlen, triton layer norm, and AMD Dockerfile
#860
opened Apr 9, 2026 by
Abdennacer-Badaoui
Member
Loading…
Add repository cloning step for local installation
#781
opened Dec 19, 2025 by
smedegaard
Loading…
1 of 5 tasks
feat: add varlen attention on cpu
#777
opened Dec 17, 2025 by
michaelfeil
Contributor
•
Draft
5 tasks
candle: health check by queuing on cuda
#775
opened Dec 17, 2025 by
michaelfeil
Contributor
Loading…
5 tasks
Add Support for XProvence Sentence-Level Context Pruning (naver/xprovence-reranker-bgem3-v1)
#770
opened Dec 4, 2025 by
sigridjineth
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.