UT_340M_0.01 / model

Commit History

Upload model/model/token_position_embeddings/pp_block/token_embedding/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
ee28d45
verified

ridger commited on

Upload model/model/final_layer_norm/pp_block/model_weight.safetensors with huggingface_hub
c160fdb
verified

ridger commited on

Upload model/model/decoder/9/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
cf3d29c
verified

ridger commited on

Upload model/model/decoder/9/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
6de4374
verified

ridger commited on

Upload model/model/decoder/9/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
9b105b2
verified

ridger commited on

Upload model/model/decoder/9/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
57de5f1
verified

ridger commited on

Upload model/model/decoder/8/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
1add6d5
verified

ridger commited on

Upload model/model/decoder/9/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
0942c74
verified

ridger commited on

Upload model/model/decoder/9/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
27c7ab2
verified

ridger commited on

Upload model/model/decoder/8/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
a5df9ff
verified

ridger commited on

Upload model/model/decoder/8/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
92b1b44
verified

ridger commited on

Upload model/model/decoder/8/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
3ccc8a8
verified

ridger commited on

Upload model/model/decoder/8/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
0ff9cf1
verified

ridger commited on

Upload model/model/decoder/7/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
3bf26b3
verified

ridger commited on

Upload model/model/decoder/8/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
9dc1417
verified

ridger commited on

Upload model/model/decoder/7/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
9ee05c2
verified

ridger commited on

Upload model/model/decoder/7/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
d5553f2
verified

ridger commited on

Upload model/model/decoder/7/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
8f70645
verified

ridger commited on

Upload model/model/decoder/6/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
3485e5d
verified

ridger commited on

Upload model/model/decoder/7/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
6440449
verified

ridger commited on

Upload model/model/decoder/7/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
b01851a
verified

ridger commited on

Upload model/model/decoder/6/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
13af4d6
verified

ridger commited on

Upload model/model/decoder/6/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
9b8956c
verified

ridger commited on

Upload model/model/decoder/6/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
1604489
verified

ridger commited on

Upload model/model/decoder/6/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
ffb3491
verified

ridger commited on

Upload model/model/decoder/6/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
163c0b7
verified

ridger commited on

Upload model/model/decoder/5/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
766b587
verified

ridger commited on

Upload model/model/decoder/5/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
ca63828
verified

ridger commited on

Upload model/model/decoder/5/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
fde222a
verified

ridger commited on

Upload model/model/decoder/5/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
47cb45b
verified

ridger commited on

Upload model/model/decoder/5/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
d54a30f
verified

ridger commited on

Upload model/model/decoder/4/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
1da0912
verified

ridger commited on

Upload model/model/decoder/5/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
0f9b632
verified

ridger commited on

Upload model/model/decoder/4/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
1738cb2
verified

ridger commited on

Upload model/model/decoder/4/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
c740c35
verified

ridger commited on

Upload model/model/decoder/4/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
d68b127
verified

ridger commited on

Upload model/model/decoder/4/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
1ebc668
verified

ridger commited on

Upload model/model/decoder/31/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
0868f78
verified

ridger commited on

Upload model/model/decoder/4/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
f09ea5a
verified

ridger commited on

Upload model/model/decoder/31/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
d858573
verified

ridger commited on

Upload model/model/decoder/31/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
a2d838a
verified

ridger commited on

Upload model/model/decoder/31/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
029c885
verified

ridger commited on

Upload model/model/decoder/31/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
96c972c
verified

ridger commited on

Upload model/model/decoder/30/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
1eeaeb2
verified

ridger commited on

Upload model/model/decoder/31/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
9a37d61
verified

ridger commited on

Upload model/model/decoder/30/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
e0f7ae3
verified

ridger commited on

Upload model/model/decoder/30/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
f2d2edc
verified

ridger commited on

Upload model/model/decoder/30/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
d32677a
verified

ridger commited on

Upload model/model/decoder/3/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
9a6c55f
verified

ridger commited on

Upload model/model/decoder/3/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
1367105
verified

ridger commited on