·
AI & ML interests
None yet
Organizations
None yet
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-fhm600-batch32-epoch3-8192
Text Generation
•
3B
•
Updated
Lansechen/Qwen2.5-3B-Distill-ot114k-batch32-epoch3-8192
Text Generation
•
3B
•
Updated
•
1
Lansechen/Qwen2.5-3B-Distill-bs17k-batch32-epoch3-8192
Text Generation
•
3B
•
Updated
•
1
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken-new
Text Generation
•
3B
•
Updated
•
1
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken
Text Generation
•
3B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192
Text Generation
•
7B
•
Updated
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192
Text Generation
•
3B
•
Updated
•
6
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch16-epoch3-8192
Updated
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-16384
Updated
Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch3-8192
Text Generation
•
7B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-0125-Distill-or-math220k-batch32-epoch1-8192
Text Generation
•
7B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch1-8192
Text Generation
•
7B
•
Updated
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch5-8192
Text Generation
•
7B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-or-math220k-batch32
Text Generation
•
7B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192
Text Generation
•
7B
•
Updated
•
3
•
1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5-8192
Text Generation
•
7B
•
Updated
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch1-8192
Text Generation
•
7B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32-epoch2
Updated
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32
Text Generation
•
7B
•
Updated
•
2
•
1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-bs17k-batch32
Text Generation
•
16B
•
Updated
•
5
•
1
Lansechen/Qwen2.5-3B-Instruct-Distill-om220k-batch32
Text Generation
•
3B
•
Updated
Lansechen/Qwen2.5-3B-Instruct-Distill-ot114k-batch32
Text Generation
•
3B
•
Updated
•
1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5
Text Generation
•
7B
•
Updated
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath
Text Generation
•
Updated
•
3
•
1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch8-numinamath
Text Generation
•
16B
•
Updated
•
3
•
1
Lansechen/Qwen2.5-7B-Open-R1-Distill
Text Generation
•
8B
•
Updated
•
4
Lansechen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
1
•
1