Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
updated a model about 11 hours ago
mnoukhov/nuevamol-360M-6Btok-wd8 updated a model about 11 hours ago
mnoukhov/nuevamol-360M-6Btok-wd5 updated a model about 19 hours ago
mnoukhov/nuevamol-135m-reinvent-sftOrganizations
models 56
mnoukhov/nuevamol-360M-6Btok-wd8
Text Generation • 0.3B • Updated
mnoukhov/nuevamol-360M-6Btok-wd5
Text Generation • 0.3B • Updated
mnoukhov/nuevamol-135m-reinvent-sft
Text Generation • 0.1B • Updated • 620
mnoukhov/nuevamol-46M-sft
Text Generation • 46.2M • Updated • 12
mnoukhov/nuevamol-46M-6Btok-wd0.5
Text Generation • 46.2M • Updated • 14
mnoukhov/nuevamol-46M-6Btok-wd1
Text Generation • 46.2M • Updated • 14
mnoukhov/nuevamol-360m-init
0.4B • Updated • 35
mnoukhov/nuevamol-135M-wsd-6Btok-wd2.0
Text Generation • 0.1B • Updated • 19
mnoukhov/nuevamol-135m-6B-wd3
Text Generation • 0.1B • Updated • 135
mnoukhov/nuevamol-80m-reinvent-sft
Text Generation • 78.1M • Updated • 343
datasets 102
mnoukhov/chembl_filtered
Viewer • Updated • 1.18M • 61
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 8
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 7
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 10
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 8
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 25.3k • 86
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples
Viewer • Updated • 12.6k • 25
mnoukhov/gsm8k-train-harder-quartiles
Viewer • Updated • 11.2k • 8
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128
Viewer • Updated • 874 • 8
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128-completions
Viewer • Updated • 874 • 40