kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_d4_100_heldoutdepth_4 Text Generation • 809k • Updated May 28 • 2
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_random_depth_3 Text Generation • 809k • Updated May 28 • 50
kylelovesllms/08_GPT2_RoPE_hi_hf_frames_heads_4_layers_4_d3_100_heldoutdepth_3 Text Generation • 809k • Updated May 28 • 8
kylelovesllms/07_vaswani_RoPE_hi_hf_frames_d4_100_heldoutdepth_4 Text Generation • 1.86M • Updated May 28 • 3
kylelovesllms/07_vaswani_RoPE_hi_hf_frames_heads_4_layers_4_random_depth_3 Text Generation • 1.86M • Updated May 28 • 62
kylelovesllms/07_vaswani_RoPE_hi_hf_frames_d3_100_heldoutdepth_3 Text Generation • 1.86M • Updated May 28 • 19