MononitoGoswami (Mononito Goswami)

liked a Space 2 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

115

Explore on-policy distillation visualization for any model

liked a dataset 3 months ago

allenai/Dolci-Think-SFT-32B

Viewer • Updated Mar 2 • 2.25M • 1.24k • 29

liked 2 Spaces 3 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

Evaluation Guidebook

📝

330

Explore LLM benchmark scores over time

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.91k

The ultimate guide to training LLM on large GPU Clusters

liked 2 models 11 months ago

autogluon/mitra-regressor

Tabular Regression • Updated Nov 19, 2025 • 240k • 31

autogluon/mitra-classifier

Tabular Classification • Updated Nov 19, 2025 • 163k • 39

liked a Space over 1 year ago

Tech Tree Blog

🌳

5

Visualize and track machine learning research progress using a tech tree

liked 2 models over 1 year ago

AutonLab/MOMENT-1-small

Time Series Forecasting • 37.9M • Updated Mar 26, 2025 • 711k • 6

AutonLab/MOMENT-1-large

Time Series Forecasting • 0.3B • Updated Mar 26, 2025 • 48.4k • 98

liked a dataset over 1 year ago

AutonLab/TimeSeriesExam1

Viewer • Updated Mar 13, 2025 • 746 • 634 • 7

liked a model over 1 year ago

AutonLab/MOMENT-1-base

Time Series Forecasting • 0.1B • Updated Mar 26, 2025 • 23.1k • 6

liked a dataset over 1 year ago

mlfoundations/tabula-8b-eval-suite

Viewer • Updated Dec 13, 2024 • 19.7k • 66 • 6

liked 2 models over 1 year ago

google/gemma-2b-it

Text Generation • 3B • Updated Sep 27, 2024 • 89.9k • • 921

google/gemma-7b

Text Generation • 9B • Updated Jun 27, 2024 • 26.5k • • 3.37k

liked a dataset almost 2 years ago

mapitanywhere/mapitanywhere

Updated Jun 27, 2024 • 1 • 1

liked 2 models almost 2 years ago

mapitanywhere/mapper

Updated Jun 29, 2024 • 1

sentence-transformers/sentence-t5-base

liked a dataset about 2 years ago

inria-soda/tabular-benchmark

Viewer • Updated Sep 4, 2023 • 17.2M • 4.18k • 48

liked a model about 2 years ago

google/timesfm-1.0-200m

Time Series Forecasting • Updated May 17, 2024 • 253 • 826

Mononito Goswami

AI & ML interests

Organizations

Unlocking On-Policy Distillation for Any Model Family

allenai/Dolci-Think-SFT-32B

The Smol Training Playbook

Evaluation Guidebook

The Ultra-Scale Playbook

autogluon/mitra-regressor

autogluon/mitra-classifier

Tech Tree Blog

AutonLab/MOMENT-1-small

AutonLab/MOMENT-1-large

AutonLab/TimeSeriesExam1

AutonLab/MOMENT-1-base

mlfoundations/tabula-8b-eval-suite

google/gemma-2b-it

google/gemma-7b

mapitanywhere/mapitanywhere

mapitanywhere/mapper

sentence-transformers/sentence-t5-base

inria-soda/tabular-benchmark

google/timesfm-1.0-200m

Mononito Goswami

AI & ML interests

Organizations

MononitoGoswami's activity

Unlocking On-Policy Distillation for Any Model Family

The Smol Training Playbook

Evaluation Guidebook

The Ultra-Scale Playbook

Tech Tree Blog