tdde19-llm-from-scratch/swe_512h_4l
37.1M • Updated
tdde19-llm-from-scratch/swe_64h_8l
4.39M • Updated
tdde19-llm-from-scratch/swe_64h_4l
4.29M • Updated
tdde19-llm-from-scratch/swe_64h_2l
4.24M • Updated
tdde19-llm-from-scratch/swe_512h_8l
40.6M • Updated
tdde19-llm-from-scratch/swe_512h_2l
35.3M • Updated
tdde19-llm-from-scratch/swe_256h_8l
18.7M • Updated
tdde19-llm-from-scratch/swe_256h_4l
17.8M • Updated
tdde19-llm-from-scratch/swe_256h_2l
17.3M • Updated
tdde19-llm-from-scratch/swe_128h_8l
8.98M • Updated
tdde19-llm-from-scratch/swe_128h_4l
8.68M • Updated
tdde19-llm-from-scratch/swe_128h_2l
8.54M • Updated
tdde19-llm-from-scratch/swe_1024h_8l
93.9M • Updated
tdde19-llm-from-scratch/swe_1024h_4l
80.5M • Updated
tdde19-llm-from-scratch/swe_1024h_2l
73.8M • Updated
tdde19-llm-from-scratch/eng_64h_8l
3.41M • Updated
tdde19-llm-from-scratch/eng_64h_4l
3.32M • Updated
tdde19-llm-from-scratch/eng_64h_2l
3.27M • Updated
tdde19-llm-from-scratch/eng_512h_4l
29.3M • Updated
tdde19-llm-from-scratch/eng_512h_2l
27.5M • Updated
tdde19-llm-from-scratch/eng_256h_8l
14.8M • Updated
tdde19-llm-from-scratch/eng_256h_4l
13.9M • Updated
tdde19-llm-from-scratch/eng_256h_2l
13.4M • Updated
tdde19-llm-from-scratch/eng_128h_8l
7.02M • Updated
tdde19-llm-from-scratch/eng_128h_4l
6.73M • Updated
tdde19-llm-from-scratch/eng_128h_2l
6.58M • Updated
tdde19-llm-from-scratch/eng_1024h_8l
78.2M • Updated
tdde19-llm-from-scratch/eng_1024h_4l
64.8M • Updated
tdde19-llm-from-scratch/eng_1024h_2l
58.2M • Updated
tdde19-llm-from-scratch/tinystories_swe_tokenizer
Updated