Anirudh Thatipelli

Anirudh25

·

https://anirudh257.github.io/

Anirudh257

AI & ML interests

None yet

Recent Activity

liked a model 30 days ago

openmmlab/upernet-convnext-large

liked a dataset 9 months ago

sayakpaul/coco-30-val-2014

liked a model 9 months ago

openai/clip-vit-large-patch14-336

View all activity

Organizations

None yet

upvoted an article about 1 year ago

Article

Introduction to State Space Models (SSM)

lbourdois

•

Jul 19, 2024

• 232

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 728

upvoted 3 papers about 1 year ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 98

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published May 30, 2025 • 82

upvoted a collection about 1 year ago

LaViDa-1.0

LArge VIsion-language Diffusion moDel with mAsking • 10 items • Updated Mar 2 • 8

upvoted a paper about 1 year ago

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13, 2025 • 42

upvoted an article about 1 year ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 123

upvoted a paper about 1 year ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8, 2025 • 87

upvoted an article about 1 year ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 488

upvoted a collection over 1 year ago

Meta's Llama 3.3 models & evals

2 items • Updated Dec 13, 2024 • 114

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted 3 papers over 1 year ago

GAEA: A Geolocation Aware Conversational Model

Paper • 2503.16423 • Published Mar 20, 2025 • 6

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published Mar 9, 2025 • 31

Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?

Paper • 2503.10632 • Published Mar 13, 2025 • 16

upvoted an article over 1 year ago

Article

SmolVLM - small yet mighty Vision Language Model

+3

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 420

upvoted a paper over 1 year ago

WebArena: A Realistic Web Environment for Building Autonomous Agents

Paper • 2307.13854 • Published Jul 25, 2023 • 27

upvoted a collection over 1 year ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Mar 2 • 233

upvoted 2 papers over 1 year ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 67

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15, 2024 • 35