Resources

Install & run this model easily using llmpm

#25 opened about 16 hours ago by

sarthak-saxena

Add MMLU-Pro evaluation result

❤️ 1

#24 opened about 1 month ago by

burtenshaw

Add GPQA evaluation result

❤️ 1

#22 opened about 1 month ago by

burtenshaw

注意：此模型仅支持非思考模式，不会在其输出中生成 <think></think> 块。

#21 opened about 1 month ago by

Jay-v2

Safety Audit: GAE Score 40.13% (FAIL)

#20 opened 3 months ago by

GAE-Auditor

Discrepancy in benchmark score (BFCL-v3)

#18 opened 4 months ago by

mmrbulbul

The model doesn't know about itself

#17 opened 4 months ago by

sakazakiMGJ

vRAM needed ?

#15 opened 5 months ago by

Ashish18110

Please support vietnamese more and more in the models.

👍 1

#14 opened 5 months ago by

DuongLeVan

Adding mention of Tinker and TRL support

🔥 2

#13 opened 5 months ago by

clem

<tool_call> generated even with no tools or asked for

👀 4

#12 opened 5 months ago by

dipta007

Release training token stats

#11 opened 6 months ago by

jquessada

Remove `<think></think>` blocks from chat template

#10 opened 6 months ago by

mamousavi

No other 2507 models

👍 3

#9 opened 7 months ago by

SipOfSpike

Sampling parameters to tau2-bench?

#8 opened 7 months ago by

lewtun

1.7b 2507?

👍 7

#7 opened 7 months ago by

CHNtentes

Why is <think></think> required in history messages?

#4 opened 7 months ago by

giangndm

Terrible instruction following

👍 1

#3 opened 7 months ago by

denisalpino

4b model with an 84.2 MMLU-Redux score?

🤝 3

#2 opened 7 months ago by

phil111

when 32B?

👍 8

#1 opened 7 months ago by

AaronFeng753