fix: pull onnx models from huggingface instead of GCS (WIP) by nleroy917 · Pull Request #36 · Anush008/fastembed-js

nleroy917 · 2025-12-17T01:09:40Z

This prioritizes huggingface weights over ones stored in google cloud storage (GCS). The reason is twofold: 1) the python fastembed implementation does this, and 2) we shouldnt point at GCS. When people are using sentence-transformers and fastembed they expect embeddings to be the same... we have zero control over what weights they are putting on all-MiniLM-L6-v2 or bge-small-en-v1.5, and so we should point to them as a source of truth.

This addresses some issues in: #30

However, some of the models this module supports (bge-small-zh, bge-small-en) dont actually have onnx weights on HF so that can be a problem.

nleroy917 added 7 commits December 16, 2025 19:39

add a dense model registry file

b444c86

more updates to migrate the models to hf

6725abe

custom model work

00fe832

overloaded init function update

00c1fe1

bump version

4a6c0c6

add custom hf test

4bab647

update canonical values

d78f5c0

nleroy917 changed the title ~~Pull onnx models from HF~~ fix: pull onnx models from huggingface instead of GCS (WIP) Dec 17, 2025

nleroy917 mentioned this pull request Dec 17, 2025

Different embedding vectors results than with Python version #30

Open

Anush008 marked this pull request as draft December 17, 2025 12:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pull onnx models from huggingface instead of GCS (WIP)#36

fix: pull onnx models from huggingface instead of GCS (WIP)#36
nleroy917 wants to merge 7 commits into
Anush008:mainfrom
nleroy917:main

nleroy917 commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nleroy917 commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant