tokyotech-llm/Llama-3.1-8B-code-ablation-exp9-LR2.5e-5-WD0.1-iter0007500
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0010000
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-code-ablation-exp7-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0007500
8B • Updated • 2
tokyotech-llm/Llama-3.1-8B-code-ablation-exp6-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0007500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp9-LR2.5e-5-WD0.1-iter0005000
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0007500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp7-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0005000
tokyotech-llm/Llama-3.1-8B-code-ablation-exp6-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0005000
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-code-ablation-exp9-LR2.5e-5-WD0.1-iter0002500
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0005000
tokyotech-llm/Llama-3.1-8B-code-ablation-exp7-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0002500
8B • Updated • 2
tokyotech-llm/Llama-3.1-8B-code-ablation-exp6-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0002500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0002500
tokyotech-llm/Llama-3.1-Swallow-8B-v0.2
Text Generation
• 8B • Updated • 24
• • 4
tokyotech-llm/Llama-3.1-Swallow-70B-v0.1
Text Generation
• 71B • Updated • 81
• • 5
tokyotech-llm/Llama-3.1-Swallow-8B-v0.1
Text Generation
• 8B • Updated • 76
• • 10
tokyotech-llm/edu-classifier
Text Classification
• Updated • 258
• 13
tokyotech-llm/Swallow-7b-NVE-hf
Text Generation
• 7B • Updated • 31
• • 2
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0012500
8B • Updated • 2
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0010000
8B • Updated • 3
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0007500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0005000
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0002500
8B • Updated • 2
tokyotech-llm/Llama-3-Swallow-70B-v0.1
Text Generation
• Updated • 24
• • 6
tokyotech-llm/Llama-3-Swallow-8B-v0.1
Text Generation
• Updated • 1.14k
• • 12
tokyotech-llm/Llama-3-Swallow-70B-Instruct-v0.1
Text Generation
• 71B • Updated • 39
• • 7
tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1
Text Generation
• 8B • Updated • 11.5k
• • 21
tokyotech-llm/Swallow-70b-instruct-v0.1
Text Generation
• 69B • Updated • 110
• tokyotech-llm/Swallow-13b-instruct-v0.1
Text Generation
• 13B • Updated • 13
• 1
tokyotech-llm/Swallow-7b-instruct-v0.1
Text Generation
• 7B • Updated • 233
• 4