Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Vaibhavi Lokegaonkar's picture

6

Vaibhavi Lokegaonkar

vlokegaonkar

RamaniD's profile picture

nishitanand's profile picture

timmy2327's profile picture

·

AI & ML interests

None yet

Organizations

vlokegaonkar 's collections 8

RL in Diffusion

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

Paper • 2602.06040 • Published Feb 5 • 10
RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published Feb 5 • 27

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 637k • 2.79k
google/smol

Viewer • Updated 13 days ago • 842k • 2.86k • 108

dataset-ui-understanding

smirki/UI_Reasoning_Dataset

Viewer • Updated Feb 19, 2025 • 421 • 58 • 44

Language Models

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.63M • • 13.3k
facebook/opt-125m

Text Generation • Updated Sep 15, 2023 • 8.62M • 251
meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 851k • • 2.76k
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 9.39M • • 5.81k

Diffusion models

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published Feb 3 • 27
Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 52
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better

Paper • 2602.05393 • Published Feb 5 • 8

dataset-math-reasoning

bethgelab/CuratedThoughts

Viewer • Updated Feb 26, 2025 • 222k • 674 • 44
open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 22.8k • 743
facebook/natural_reasoning

Viewer • Updated Feb 21, 2025 • 1.15M • 2k • 568
open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31, 2025 • 228k • 127k • 838

old-language-models

openai-community/gpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 15.9M • 3.24k

Code Language Models

Models generating code or performing code completion

refactai/Refact-1_6B-fim

Text Generation • Updated Nov 9, 2023 • 8.2k • 141
bigcode/starcoder2-3b

Text Generation • 3B • Updated Mar 4, 2024 • 99.3k • 219
Kwaipilot/KwaiCoder-DS-V2-Lite-Base

Text Generation • 16B • Updated Jan 6, 2025 • 872 • 7

RL in Diffusion

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

Paper • 2602.06040 • Published Feb 5 • 10
RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published Feb 5 • 27

Diffusion models

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published Feb 3 • 27
Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 52
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better

Paper • 2602.05393 • Published Feb 5 • 8

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 637k • 2.79k
google/smol

Viewer • Updated 13 days ago • 842k • 2.86k • 108

dataset-math-reasoning

bethgelab/CuratedThoughts

Viewer • Updated Feb 26, 2025 • 222k • 674 • 44
open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 22.8k • 743
facebook/natural_reasoning

Viewer • Updated Feb 21, 2025 • 1.15M • 2k • 568
open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31, 2025 • 228k • 127k • 838

dataset-ui-understanding

smirki/UI_Reasoning_Dataset

Viewer • Updated Feb 19, 2025 • 421 • 58 • 44

old-language-models

openai-community/gpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 15.9M • 3.24k

Language Models

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.63M • • 13.3k
facebook/opt-125m

Text Generation • Updated Sep 15, 2023 • 8.62M • 251
meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 851k • • 2.76k
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 9.39M • • 5.81k

Code Language Models

Models generating code or performing code completion

refactai/Refact-1_6B-fim

Text Generation • Updated Nov 9, 2023 • 8.2k • 141
bigcode/starcoder2-3b

Text Generation • 3B • Updated Mar 4, 2024 • 99.3k • 219
Kwaipilot/KwaiCoder-DS-V2-Lite-Base

Text Generation • 16B • Updated Jan 6, 2025 • 872 • 7

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs