openbmb
/

MiniCPM-MoE-8x2B

Text Generation

Model card Files Files and versions

Improve model card

#4

by nielsr HF Staff - opened Jun 10, 2025

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -1,8 +1,14 @@
 # Introduction
 [OpenBMB Technical Blog Series](https://openbmb.vercel.app/)
-The MiniCPM-MoE-8x2B is a decoder-only transformer-based generative language model.
 The MiniCPM-MoE-8x2B adopt a Mixture-of-Experts(MoE) architecture, which has 8 experts per layer and activates 2 of 8 experts for each token.
@@ -30,5 +36,4 @@ print(responds)
 1. As a language model, MiniCPM-MoE-8x2B generates content by learning from a vast amount of text.
 2. However, it does not possess the ability to comprehend or express personal opinions or value judgments.
 3. Any content generated by MiniCPM-MoE-8x2B does not represent the viewpoints or positions of the model developers.
-4. Therefore, when using content generated by MiniCPM-MoE-8x2B, users should take full responsibility for evaluating and verifying it on their own.

+---
+library_name: transformers
+pipeline_tag: text-generation
+license: apache-2.0
+---
 # Introduction
 [OpenBMB Technical Blog Series](https://openbmb.vercel.app/)
+The MiniCPM-MoE-8x2B is a decoder-only transformer-based generative language model. For more details, please refer to our [github repo](https://github.com/OpenBMB/MiniCPM). Also check out the [project page](https://openbmb.vercel.app/?category=Chinese+Blog). The model is described in the paper [MiniCPM4: Ultra-Efficient LLMs on End Devices](https://huggingface.co/papers/2506.07900).
 The MiniCPM-MoE-8x2B adopt a Mixture-of-Experts(MoE) architecture, which has 8 experts per layer and activates 2 of 8 experts for each token.
 1. As a language model, MiniCPM-MoE-8x2B generates content by learning from a vast amount of text.
 2. However, it does not possess the ability to comprehend or express personal opinions or value judgments.
 3. Any content generated by MiniCPM-MoE-8x2B does not represent the viewpoints or positions of the model developers.
+4. Therefore, when using content generated by MiniCPM-MoE-8x2B, users should take full responsibility for evaluating and verifying it on their own.