codefuse-ai
/

F2LLM-v2-8B-Preview

@@ -51,12 +51,27 @@ pipeline_tag: feature-extraction
 library_name: transformers
 tags:
 - sentence-transformers
 ---
 # F2LLM-v2-8B-Preview
 **F2LLM-v2-8B-Preview** is a multilingual embedding model trained from Qwen3-8B on a corpus of **27 million samples**, spanning **over 100 natural and programming languages**. It is a "preview" version trained without instructions and intended to serve as a foundation for downstream embedding tasks and further fine-tuning.
 ## Usage
 ### With Sentence Transformers
@@ -138,12 +153,3 @@ print(similarity)
 ## Intermediate Checkpoints
 To facilitate future research, we release intermediate checkpoints in the `intermediate_checkpoints` branch.
-## Future Releases
-We are committed to the open-source community and will soon release:
-- **The Finetuned Version:** Optimized for downstream tasks, with state-of-the-art performance on MTEB.
-- **The Training Data:** We will be releasing the data used to train F2LLM-v2 to help advance the field of multilingual embeddings.
-Stay tuned for more updates!

 library_name: transformers
 tags:
 - sentence-transformers
+datasets:
+- codefuse-ai/F2LLM-v2
 ---
 # F2LLM-v2-8B-Preview
 **F2LLM-v2-8B-Preview** is a multilingual embedding model trained from Qwen3-8B on a corpus of **27 million samples**, spanning **over 100 natural and programming languages**. It is a "preview" version trained without instructions and intended to serve as a foundation for downstream embedding tasks and further fine-tuning.
+F2LLM-v2 is fully open. We release base models in 5 sizes, instruct models in 8 sizes, the training data, the training code, and intermediate checkpoints. The three smallest instruct models are pruned and trained from the 0.6B base model.
+| Model | Base                                                                                | Instruct                                                            |
+| ----- | ----------------------------------------------------------------------------------- | ------------------------------------------------------------------- |
+| 80M   |                                                                                     | [🤗F2LLM-v2-80M](https://huggingface.co/codefuse-ai/F2LLM-v2-80M)   |
+| 160M  |                                                                                     | [🤗F2LLM-v2-160M](https://huggingface.co/codefuse-ai/F2LLM-v2-160M) |
+| 330M  |                                                                                     | [🤗F2LLM-v2-330M](https://huggingface.co/codefuse-ai/F2LLM-v2-330M) |
+| 0.6B  | [🤗F2LLM-v2-0.6B-Preview](https://huggingface.co/codefuse-ai/F2LLM-v2-0.6B-Preview) | [🤗F2LLM-v2-0.6B](https://huggingface.co/codefuse-ai/F2LLM-v2-0.6B) |
+| 1.7B  | [🤗F2LLM-v2-1.7B-Preview](https://huggingface.co/codefuse-ai/F2LLM-v2-1.7B-Preview) | [🤗F2LLM-v2-1.7B](https://huggingface.co/codefuse-ai/F2LLM-v2-1.7B) |
+| 4B    | [🤗F2LLM-v2-4B-Preview](https://huggingface.co/codefuse-ai/F2LLM-v2-4B-Preview)     | [🤗F2LLM-v2-4B](https://huggingface.co/codefuse-ai/F2LLM-v2-4B)     |
+| 8B    | [🤗F2LLM-v2-8B-Preview](https://huggingface.co/codefuse-ai/F2LLM-v2-8B-Preview)     | [🤗F2LLM-v2-8B](https://huggingface.co/codefuse-ai/F2LLM-v2-8B)     |
+| 14B   | [🤗F2LLM-v2-14B-Preview](https://huggingface.co/codefuse-ai/F2LLM-v2-14B-Preview)   | [🤗F2LLM-v2-14B](https://huggingface.co/codefuse-ai/F2LLM-v2-14B)   |
 ## Usage
 ### With Sentence Transformers
 ## Intermediate Checkpoints
 To facilitate future research, we release intermediate checkpoints in the `intermediate_checkpoints` branch.