Install & run this model easily using llmpm
#25 opened about 16 hours ago
by
sarthak-saxena
Add MMLU-Pro evaluation result
β€οΈ 1
#24 opened about 1 month ago
by
burtenshaw
Add GPQA evaluation result
β€οΈ 1
#22 opened about 1 month ago
by
burtenshaw
注ζοΌζ€ζ¨‘εδ» ζ―ζιζθ樑εΌοΌδΈδΌε¨ε ΆθΎεΊδΈηζ <think></think> εγ
#21 opened about 1 month ago
by
Jay-v2
Safety Audit: GAE Score 40.13% (FAIL)
#20 opened 3 months ago
by
GAE-Auditor
Discrepancy in benchmark score (BFCL-v3)
1
#18 opened 4 months ago
by
mmrbulbul
The model doesn't know about itself
2
#17 opened 4 months ago
by
sakazakiMGJ
vRAM needed ?
1
#15 opened 5 months ago
by
Ashish18110
Please support vietnamese more and more in the models.
π 1
#14 opened 5 months ago
by
DuongLeVan
Adding mention of Tinker and TRL support
π₯ 2
#13 opened 5 months ago
by
clem
<tool_call> generated even with no tools or asked for
π 4
6
#12 opened 5 months ago
by
dipta007
Release training token stats
#11 opened 6 months ago
by
jquessada
Remove `<think></think>` blocks from chat template
#10 opened 6 months ago
by
mamousavi
No other 2507 models
π 3
#9 opened 7 months ago
by
SipOfSpike
Sampling parameters to tau2-bench?
#8 opened 7 months ago
by
lewtun
1.7b 2507?
π 7
#7 opened 7 months ago
by
CHNtentes
Why is <think></think> required in history messages?
#4 opened 7 months ago
by
giangndm
Terrible instruction following
π 1
4
#3 opened 7 months ago
by
denisalpino
4b model with an 84.2 MMLU-Redux score?
π€ 3
1
#2 opened 7 months ago
by
phil111
when 32B?
π 8
2
#1 opened 7 months ago
by
AaronFeng753