Instructions to use korben99/bne-float-384 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use korben99/bne-float-384 with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("korben99/bne-float-384") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
| language: en | |
| license: mit | |
| tags: | |
| - sentence-transformers | |
| - embeddings | |
| - bert | |
| pipeline_tag: sentence-similarity | |
| # bne-float-384 | |
| Float32 baseline for the **Binary Native Embeddings** project. | |
| - Backbone: `prajjwal1/bert-mini` (4L × 256d, ~11M params) | |
| - Output: 384-dim float32 via Linear(256→384) + mean pooling | |
| - Training: MultipleNegativesRankingLoss on NLI 550k pairs, 3 epochs | |
| | STS-B Spearman | Recall@10 (SciFact) | Memory / 1k vecs | | |
| |---|---|---| | |
| | 0.7355 | 0.3131 | 1.46 MB | | |
| Part of [binary-native-embeddings-for-CPU-Retrieval](https://github.com/korben99/binary-native-embeddings-for-CPU-Retrieval) · [Discussion](https://discuss.huggingface.co/t/native-binary-embeddings-experiment-curious-about-your-thoughts/177107) | |
| ## Usage | |
| ```python | |
| import torch | |
| from transformers import BertTokenizer | |
| from huggingface_hub import hf_hub_download | |
| from models.float_embedder import FloatEmbedder | |
| tokenizer = BertTokenizer.from_pretrained("prajjwal1/bert-mini") | |
| model = FloatEmbedder(output_dim=384) | |
| weights = hf_hub_download("korben99/bne-float-384", "float_embedder.pt") | |
| model.load_state_dict(torch.load(weights, map_location="cpu")) | |
| model.eval() | |
| vecs = model.encode(["hello world"], tokenizer) # (1, 384) float32 | |
| ``` | |