4 6

SHIVAM KUMAR

Shivam3002

AI & ML interests

llm,ai agant , nlp

Recent Activity

liked a model 9 days ago

Jiunsong/supergemma4-26b-uncensored-gguf-v2

liked a model 26 days ago

Shivam3002/ppo-Huggy

updated a model 26 days ago

Shivam3002/ppo-Huggy

View all activity

Organizations

liked a model 9 days ago

Jiunsong/supergemma4-26b-uncensored-gguf-v2

Text Generation • 25B • Updated Apr 12 • 244k • 646

liked a model 26 days ago

Shivam3002/ppo-Huggy

Reinforcement Learning • Updated 26 days ago • 101 • 1

updated a model 26 days ago

Shivam3002/ppo-Huggy

Reinforcement Learning • Updated 26 days ago • 101 • 1

published a model 26 days ago

Shivam3002/ppo-Huggy

Reinforcement Learning • Updated 26 days ago • 101 • 1

liked a model 26 days ago

Shivam3002/ppo-LunarLander-v3

Reinforcement Learning • Updated 26 days ago • 73 • 1

updated a model 26 days ago

Shivam3002/ppo-LunarLander-v3

Reinforcement Learning • Updated 26 days ago • 73 • 1

published a model 26 days ago

Shivam3002/ppo-LunarLander-v3

Reinforcement Learning • Updated 26 days ago • 73 • 1

liked a Space about 2 months ago

Robot Learning: A Tutorial

📝

406

Read Robot Learning tutorial with interactive TOC

upvoted an article about 2 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 334

updated a Space about 1 year ago

Reader

⚡

Extract serial number and meter reading from an image

published a Space about 1 year ago

Reader

⚡

Extract serial number and meter reading from an image

upvoted an article about 1 year ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

upvoted a collection about 1 year ago

TxAgent

Collection

6 items • Updated Sep 30, 2025 • 20

updated a collection about 1 year ago

ai_paper

Collection

upvoted a paper about 1 year ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 67

updated a Space about 1 year ago

Alfredagent

💬

published a Space about 1 year ago

Alfredagent

💬

updated a Space about 1 year ago

First Agent Template

⚡

Fetch the local time in any timezone

liked 2 models about 1 year ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 2.21M • • 2.14k

Shivam3002/test

Updated Mar 5, 2025 • 1

SHIVAM KUMAR

AI & ML interests

Recent Activity

Organizations

Shivam3002's activity

Robot Learning: A Tutorial

KV Caching Explained: Optimizing Transformer Inference Efficiency

Reader

Reader

Mixture of Experts Explained

Alfredagent

Alfredagent

First Agent Template