ML Foundations Development

non-profit

https://github.com/mlfoundations

AI & ML interests

None defined yet.

Recent Activity

wenwenD submitted a paper about 1 month ago

Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents

liangyuch authored a paper about 1 month ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

liangyuch submitted a paper about 1 month ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

View all activity

mlfoundations-dev 's models 2,134

mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_scp

Updated Mar 19, 2025

mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_wikipedia_biology

Updated Mar 19, 2025

mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_fineweb

Updated Mar 19, 2025

mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_expertqa

Updated Mar 19, 2025

mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_anderson_chemistry

Updated Mar 19, 2025

mlfoundations-dev/DCFT-filter_math_neg_s1_1_gemini-etash

8B • Updated Mar 19, 2025 • 1

mlfoundations-dev/DCFT-swe_gym_annotate_with_patch-etash

8B • Updated Mar 19, 2025 • 1

mlfoundations-dev/Qwen2.5-7B-stratos_verified_mix-megatron

8B • Updated Mar 19, 2025

mlfoundations-dev/global_batchsize_512_lradjusted8_warmup05

Text Generation • 8B • Updated Mar 19, 2025 • 1

mlfoundations-dev/DCFT-science_verification_scale_up_random_fix-etash

8B • Updated Mar 18, 2025 • 1

mlfoundations-dev/DCFT-filter_code_pos_majority-etash

8B • Updated Mar 18, 2025 • 1

mlfoundations-dev/DCFT-filter_code_pos_codeforces_gpt-etash

8B • Updated Mar 18, 2025 • 1

mlfoundations-dev/DCFT-filter_code_baseline-etash

8B • Updated Mar 18, 2025 • 1

mlfoundations-dev/weight_decay_10

Text Generation • 8B • Updated Mar 18, 2025

mlfoundations-dev/weight_decay_05

Text Generation • 8B • Updated Mar 18, 2025

mlfoundations-dev/weight_decay_02

Text Generation • 8B • Updated Mar 18, 2025

mlfoundations-dev/weight_decay_15

Text Generation • 8B • Updated Mar 18, 2025

mlfoundations-dev/DCFT-ot-114k_general-thought-feb-25-etash

8B • Updated Mar 18, 2025 • 1

mlfoundations-dev/hero_run_2_fix_conversations_32b

Updated Mar 17, 2025

mlfoundations-dev/Bespoke-Stratos-17k

Text Generation • 8B • Updated Mar 17, 2025 • 3

mlfoundations-dev/DCFT-general-thought-feb-25-etash

8B • Updated Mar 17, 2025

mlfoundations-dev/DCFT-science_verification_scale_up_random-etash

8B • Updated Mar 17, 2025

mlfoundations-dev/DCFT-swe_gym_annotate-etash

8B • Updated Mar 17, 2025

mlfoundations-dev/DCFT-science_verification_scale_up_majority_consensus-etash

8B • Updated Mar 17, 2025

mlfoundations-dev/DCFT-science_verification_scale_up_gpt_verification-etash

8B • Updated Mar 17, 2025 • 1

mlfoundations-dev/DCFT-science_verification_scale_up_all-etash

8B • Updated Mar 17, 2025

mlfoundations-dev/qwen_lawma_filtered_deepseek-2k-5x

Text Generation • 8B • Updated Mar 17, 2025 • 2

mlfoundations-dev/hero_run_2_fix_conversations

Text Generation • 8B • Updated Mar 17, 2025

mlfoundations-dev/DCFT-seed_code_r1_glaive-etash

8B • Updated Mar 17, 2025 • 1

mlfoundations-dev/qwen2-5_multiple_samples_ground_truth_openr1_llm_verifier_clean

Text Generation • 0.5B • Updated Mar 16, 2025 • 6