Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
MiniLLM
community
https://github.com/microsoft/LMOps/tree/main/minillm
t1101675
Activity Feed
Follow
43
AI & ML interests
Training efficient language models (MiniLLM, MiniPLM)
Team members
1
MiniLLM
's models
50
Sort:Β Recently updated
MiniLLM/MiniLLM-gpt2-340M
Text Generation
β’
Updated
Apr 11, 2025
β’
376
β’
6
MiniLLM/SFT-gpt2-120M
Text Generation
β’
0.1B
β’
Updated
Mar 25, 2025
β’
398
MiniLLM/SFT-gpt2-760M
Text Generation
β’
0.8B
β’
Updated
Mar 25, 2025
β’
5
MiniLLM/MiniPLM-Qwen-500M
Text Generation
β’
0.5B
β’
Updated
Mar 25, 2025
β’
14
β’
β’
7
MiniLLM/MiniPLM-llama3.1-212M
Text Generation
β’
0.2B
β’
Updated
Mar 25, 2025
β’
14
β’
6
MiniLLM/MiniPLM-Mamba-130M
Text Generation
β’
0.1B
β’
Updated
Mar 25, 2025
β’
14
β’
4
MiniLLM/MiniPLM-Qwen-1.2B
Text Generation
β’
1B
β’
Updated
Mar 25, 2025
β’
9
β’
4
MiniLLM/Ref-Pretrain-Qwen-104M
Text Generation
β’
0.1B
β’
Updated
Mar 25, 2025
β’
14
β’
2
MiniLLM/Pretrain-Qwen-1.2B
Text Generation
β’
1B
β’
Updated
Mar 25, 2025
β’
10
MiniLLM/Pretrain-Qwen-500M
Text Generation
β’
0.5B
β’
Updated
Mar 25, 2025
β’
8
MiniLLM/Pretrain-Qwen-200M
Text Generation
β’
0.2B
β’
Updated
Mar 25, 2025
β’
5
MiniLLM/VanillaKD-Pretrain-Qwen-200M
Text Generation
β’
0.2B
β’
Updated
Mar 25, 2025
β’
3
MiniLLM/VanillaKD-Pretrain-Qwen-500M
Text Generation
β’
0.5B
β’
Updated
Mar 25, 2025
β’
16
β’
MiniLLM/VanillaKD-Pretrain-Qwen-1.2B
Text Generation
β’
1B
β’
Updated
Mar 25, 2025
β’
4
MiniLLM/init-gpt2-120M
Text Generation
β’
0.1B
β’
Updated
Nov 13, 2024
β’
395
β’
1
MiniLLM/teacher-Llama-13B
Text Generation
β’
Updated
Oct 30, 2024
β’
5
MiniLLM/MiniLLM-Llama-7B
Text Generation
β’
Updated
Oct 30, 2024
β’
8
β’
3
MiniLLM/MiniPLM-Qwen-200M
Text Generation
β’
0.2B
β’
Updated
Oct 27, 2024
β’
79
β’
9
MiniLLM/init-Llama-7B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
MiniLLM/teacher-OPT-13B
Text Generation
β’
Updated
Sep 26, 2024
β’
7
MiniLLM/SeqKD-Llama-7B
Text Generation
β’
Updated
Sep 26, 2024
β’
6
MiniLLM/KD-Llama-7B
Text Generation
β’
Updated
Sep 26, 2024
β’
6
MiniLLM/SFT-Llama-7B
Text Generation
β’
Updated
Sep 26, 2024
β’
4
MiniLLM/init-OPT-6.7B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
MiniLLM/init-OPT-2.7B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
MiniLLM/init-OPT-1.3B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
MiniLLM/SeqKD-OPT-6.7B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
MiniLLM/SeqKD-OPT-2.7B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
MiniLLM/SeqKD-OPT-1.3B
Text Generation
β’
Updated
Sep 26, 2024
β’
13
MiniLLM/KD-OPT-6.7B
Text Generation
β’
Updated
Sep 26, 2024
β’
5
Previous
1
2
Next