AI & ML interests
None yet
Organizations
None yet
gradientrouting-spar/base_brwn_bott_s1_1_proxy_actions_ntr_25_20250612_163824
Text Generation
•
3B
•
Updated
gradientrouting-spar/gcd_syco_capitalsdpo_limit_proxy_data_to-1_pos_prx-proxy_neg_prx-proxy_neg_ldpo-2_seed_1
Updated
gradientrouting-spar/gcd_syco_capitalsst_we_limit_proxy_data_to-1_pos_prx-proxy_neg_prx-proxy_neg_st_alpha-0.8_seed_5
Updated
gradientrouting-spar/gcd_syco_capitalsst_we_limit_proxy_data_to-1_pos_prx-proxy_neg_prx-proxy_neg_st_alpha-0.8_seed_1
Updated
gradientrouting-spar/gcd_syco_capitalsnaive_seed_42
Updated
gradientrouting-spar/gcd_syco_capitalsnaive_seed_5
Updated
gradientrouting-spar/gcd_syco_capitalsnaive_seed_1
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-100_seed_42
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-100_seed_5
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-100_seed_1
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-10_seed_42
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-10_seed_5
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-10_seed_1
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-1_seed_42
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-1_seed_5
Updated
gradientrouting-spar/gcd_syco_modkl_div_beta_kl-1_seed_1
Updated
gradientrouting-spar/gcd_syco_modst_we_pos_prx-out_neg_prx-proxy_neg_st_alpha-0.8_seed_42
Updated
gradientrouting-spar/gcd_syco_modst_we_pos_prx-out_neg_prx-proxy_neg_st_alpha-0.8_seed_5
Updated
gradientrouting-spar/gcd_syco_modst_we_pos_prx-out_neg_prx-proxy_neg_st_alpha-0.8_seed_1
Updated
gradientrouting-spar/gcd_syco_modst_we_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_st_alpha-1.0_seed_42
Updated
gradientrouting-spar/gcd_syco_modst_we_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_st_alpha-1.0_seed_5
Updated
gradientrouting-spar/mc9_badmed_st_we_atc-0.45_dsd-42_msd-42_pos_prx-out_neg_prx-proxy_neg_st_alp-0.6_seed_1
Updated
gradientrouting-spar/mc9_badmed_st_we_atc-0.45_dsd-42_msd-42_pos_prx-out_neg_prx-proxy_neg_st_alp-0.6_seed_1_epoch_1
Updated
gradientrouting-spar/gcd_syco_modst_we_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_st_alpha-1.0_seed_1
Updated
gradientrouting-spar/gcd_syco_moddpo_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_ldpo-6_seed_42
Updated
gradientrouting-spar/base_brown_bottom_2_20250612_151228
Text Generation
•
3B
•
Updated
•
1
gradientrouting-spar/gcd_syco_moddpo_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_ldpo-6_seed_5
Updated
gradientrouting-spar/gcd_syco_moddpo_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_ldpo-6_seed_1
Updated
gradientrouting-spar/gcd_syco_moddpo_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_ldpo-4_seed_42
Updated
gradientrouting-spar/gcd_syco_moddpo_train_split-0.3_pos_prx-proxy_neg_prx-proxy_neg_ldpo-4_seed_5
Updated