AI & ML interests
None yet
Organizations
None yet
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-100.0_seed_1_seed_5
Updated
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-100.0_seed_1
Updated
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-20.0_seed_1_seed_5_seed_42_seed_10_seed_11
Updated
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-20.0_seed_1_seed_5_seed_42_seed_10
Updated
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-20.0_seed_1_seed_5_seed_42
Updated
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-20.0_seed_1_seed_5
Updated
gradientrouting-spar/YOUR_MODEL_ID
Updated
gradientrouting-spar/gcd_syco_cap_math_representation_constraint_beta_kl-20.0_seed_1
Updated
gradientrouting-spar/gcd_syco_cap_math_st_we_train_split-0.3_is_peft-False_st_alp-1.0_seed_1
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/gcd_syco_cap_math_safe_lora_safe_lora_num_proj_layers-100_safe_lora_threshold-0.01_seed_1_seed_5
Updated
gradientrouting-spar/gcd_syco_cap_math_safe_lora_safe_lora_num_proj_layers-100_safe_lora_threshold-0.01_seed_1
Updated
gradientrouting-spar/gcd_syco_cap_math_safe_lora_safe_lora_num_proj_layers-100_safe_lora_threshold-0.0_seed_1_seed_5
Updated
gradientrouting-spar/gcd_syco_cap_math_safe_lora_safe_lora_num_proj_layers-100_safe_lora_threshold-0.0_seed_1
Updated
gradientrouting-spar/2d_1proxy_ntr1_random_seed_1_seed_2_seed_25_seed_42_20250701_103936
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_random_seed_1_seed_2_seed_25_20250701_101913
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_random_seed_1_seed_2_20250701_095822
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_random_seed_1_20250701_092732
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_objects_seed_1_seed_2_seed_25_seed_42_20250701_045641
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_objects_seed_1_seed_2_seed_25_20250701_043607
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_objects_seed_1_seed_2_20250701_041536
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_objects_seed_1_20250701_035503
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_foods_seed_1_seed_2_seed_25_seed_42_20250701_033429
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_foods_seed_1_seed_2_seed_25_20250701_031347
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_foods_seed_1_seed_2_20250701_025300
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_foods_seed_1_20250701_023221
Text Generation
•
3B
•
Updated
gradientrouting-spar/2d_1proxy_ntr1_actions_seed_1_seed_2_seed_25_seed_42_20250701_021152
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_actions_seed_1_seed_2_seed_25_20250630_234606
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_actions_seed_1_seed_2_20250630_232538
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_1proxy_ntr1_actions_seed_1_20250630_205548
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/2d_data_color_base_seed_42_20250630_183914
Text Generation
•
3B
•
Updated
•
1