Shaleen Bhartiya's picture

Shaleen Bhartiya

Shaleen123

·

https://brainwaveml.ai

ShaleenBhartiya

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

WhitzardAgent/CyberSecurity-1M

liked a dataset 3 days ago

hcnote/Cybersecurity-bigDataset

liked a dataset 3 days ago

sarahwei/cyber_MITRE_attack_tactics-and-techniques

View all activity

Organizations

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.69k

upvoted a paper 5 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 207

upvoted a paper 6 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 328

upvoted 3 papers 8 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 133

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 48

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 117

upvoted a collection 10 months ago

BrainWave-ML

Best Models in the Game! • 10 items • Updated Sep 3, 2025 • 1

upvoted a collection about 1 year ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 739

upvoted an article about 1 year ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

prithivMLmods

•

Feb 17, 2025

• 30

upvoted an article over 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

upvoted a paper over 1 year ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 100

upvoted a paper almost 2 years ago

TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder

Paper • 2409.08248 • Published Sep 12, 2024 • 16

upvoted an article almost 2 years ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

sayakpaul, dacorvo

•

Jul 30, 2024

• 68

upvoted a paper almost 2 years ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 119

upvoted a collection almost 2 years ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Mar 12 • 85

upvoted an article almost 2 years ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted a paper almost 2 years ago

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 57

upvoted a collection about 2 years ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 975

upvoted a paper about 2 years ago

From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published Jun 18, 2024 • 21

upvoted an article about 2 years ago

Article

Diffusers welcomes Stable Diffusion 3

+4

dn6, YiYiXu, sayakpaul, OzzyGT, kashif, multimodalart

•

Jun 12, 2024

• 99