YWZBrandon/google_flan-t5-large_semantic_10_clusters_2_full_upsample1000 0.8B • Updated May 7, 2025 • 1
YWZBrandon/google_flan-t5-large_semantic_3_clusters_1_full_upsample1000 0.8B • Updated May 7, 2025 • 1
YWZBrandon/google_flan-t5-large_semantic_10_clusters_1_full_upsample1000 0.8B • Updated May 7, 2025 • 1
YWZBrandon/google_flan-t5-large_semantic_10_clusters_0_full_upsample1000 0.8B • Updated May 7, 2025 • 3
YWZBrandon/Qwen_Qwen2.5-1.5B_ds3500_upsample1000_predict_mask Text Generation • 2B • Updated May 7, 2025 • 1
YWZBrandon/Qwen_Qwen2.5-1.5B_ds1000_upsample1000_predict_mask Text Generation • 2B • Updated May 7, 2025 • 1
YWZBrandon/Qwen_Qwen2.5-1.5B_ds100_upsample1000_predict_mask Text Generation • 2B • Updated May 7, 2025 • 1
YWZBrandon/openai-gsm8k_meta-llama-Llama-3.2-3B_2e-6_500vocab Text Generation • 3B • Updated Jan 6, 2025
YWZBrandon/openai-gsm8k_meta-llama-Llama-3.2-3B_2e-5_allvocab Text Generation • 3B • Updated Jan 5, 2025 • 1
YWZBrandon/openai-gsm8k_meta-llama-Llama-3.2-3B_2e-6_allvocab Text Generation • 3B • Updated Jan 5, 2025