kylelovesllms/hi-hf-v2-frames-depth_withhold2_k_train100_k_eval5_eval_frame_cap500 Viewer • Updated 12 days ago • 252k • 31
kylelovesllms/hi-hf-v2-frames-depth_withhold2_k_train100_k_eval5_eval_frame_cap500 Viewer • Updated 12 days ago • 252k • 31
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_d4_100_heldoutdepth_4 Text Generation • 809k • Updated May 28 • 2
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_d4_100_heldoutdepth_4 Text Generation • 809k • Updated May 28 • 2
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_random_depth_3 Text Generation • 809k • Updated May 28 • 50
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_random_depth_3 Text Generation • 809k • Updated May 28 • 50
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_d3_100_heldoutdepth_3 Text Generation • 809k • Updated May 28 • 8
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_d3_100_heldoutdepth_3 Text Generation • 809k • Updated May 28 • 8
kylelovesllms/07_vaswani_RoPE_hi_hf_frames_d4_100_heldoutdepth_4 Text Generation • 1.86M • Updated May 28 • 3
kylelovesllms/07_vaswani_RoPE_hi_hf_frames_d4_100_heldoutdepth_4 Text Generation • 1.86M • Updated May 28 • 3