Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
JustinLeee
/
GrandLine_LLM
like
1
Question Answering
Chinese
English
chatbot
LLM
Pretrain
SFT
Distill
GRPO
CoT
Pytorch
Deepseek-MoE
Qwen3-Dense
License:
mit
Model card
Files
Files and versions
xet
Community
main
GrandLine_LLM
/
images
5.89 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
JustinLeee
Update images via Git LFS
5158eb1
15 days ago
CoT_sample_1.jpg
305 kB
xet
Update images via Git LFS
15 days ago
Distil_Loss.jpg
29 kB
xet
Update images via Git LFS
15 days ago
GRPO_Format.jpg
87.7 kB
xet
Update images via Git LFS
15 days ago
GRPO_KL_Length.jpg
71.5 kB
xet
Update images via Git LFS
15 days ago
GRPO_LLM_Judge.jpg
74 kB
xet
Update images via Git LFS
15 days ago
GRPO_Mean.jpg
80.8 kB
xet
Update images via Git LFS
15 days ago
GRPO_Sample_1.jpg
118 kB
xet
Update images via Git LFS
15 days ago
GRPO_Solve_All_None.jpg
68.4 kB
xet
Update images via Git LFS
15 days ago
GranLine_Dense.jpg
1.34 MB
xet
Update images via Git LFS
15 days ago
GranLine_MoE.jpg
1.51 MB
xet
Update images via Git LFS
15 days ago
Pretrain_Benchmark.jpg
54.1 kB
xet
Update images via Git LFS
15 days ago
Pretrain_Loss.jpg
25 kB
xet
Update images via Git LFS
15 days ago
Pretrain_sample_1.jpg
37.8 kB
xet
Update images via Git LFS
15 days ago
SFT_Factuality.jpg
52.6 kB
xet
Update images via Git LFS
15 days ago
SFT_Fluency.jpg
49.6 kB
xet
Update images via Git LFS
15 days ago
SFT_Instruction_Following.jpg
56.3 kB
xet
Update images via Git LFS
15 days ago
SFT_Loss.jpg
35.8 kB
xet
Update images via Git LFS
15 days ago
SFT_Mean.jpg
50.3 kB
xet
Update images via Git LFS
15 days ago
SFT_sample_1.jpg
155 kB
xet
Update images via Git LFS
15 days ago
Training_Pipeline.jpg
1.24 MB
xet
Update images via Git LFS
15 days ago
data_distribution_professional.png
443 kB
xet
Update images via Git LFS
15 days ago