hug
joggyback
·
AI & ML interests
text to speech
Recent Activity
updated a collection 3 days ago
explanableModel new activity 9 days ago
Qwen/Qwen3-14B:怎么用qwen3-14B进行SFT训练,不是Lora方式 updated a collection 10 days ago
gemma4Organizations
None yet
怎么用qwen3-14B进行SFT训练,不是Lora方式
2
#15 opened 10 months ago
by
chlyzzo
Plans for the future
2
#7 opened 29 days ago
by
Akicou
支持1024的序列长度吗
1
#3 opened 20 days ago
by
joggyback
Will there be QAT models?
🤝👍 12
4
#49 opened 3 months ago
by
Regrin
124B, pretty please?
👍 29
9
#10 opened 3 months ago
by
vody-am
The Gemma 4 model is great. But...
👍 4
7
#43 opened 3 months ago
by
suitup91
Reviews of Gemma 4
4
#92 opened 2 months ago
by
Juanoto2012
Gemma 4:124b
👍🚀 90
18
#1 opened 29 days ago
by
seamon67
Gemma 4 124b
🚀 20
7
#17 opened 28 days ago
by
FusionCow
Congrats on 1 MILLION downloads
🔥 7
4
#28 opened about 1 month ago
by
plz12345
When will this be availbale on llama.cpp?
❤️🔥 16
3
#10 opened about 2 months ago
by
Kendolph
What is it for?
👍 5
5
#3 opened about 2 months ago
by
Tikhonum
Hardware requirement
👍👀 3
15
#52 opened 3 months ago
by
Charan01
High First Token Latency Issue with AWQ-4bit Model Deployment Using vLLM
👍 2
3
#2 opened 3 months ago
by
Jeanxx
vLLM on 24gb gpu
👍 2
1
#2 opened over 1 year ago
by
roadtoagi