Fix: Support loading dt_bias and other trained-model parameters in modeling_nemotron_h.py
#7 opened 8 days ago
by
shiftyblock
Correction: modeling_nemotron_h.py
#6 opened 8 months ago
by
JennBing
When will Instruct models be released?
#5 opened 9 months ago
by
mariamavagyan
Setting for Throughput Experiments
#4 opened 10 months ago
by
tonymwt
What’s the Pre-training Data Strategy Behind Nemotron-H?
#3 opened 10 months ago
by
Zieksy
When is NemotronHForCausalLM going to be updated to transformers?
1
#2 opened 11 months ago
by
mjamro3
RL/ Instruct Models wen ?
#1 opened 11 months ago
by
spsbosch