arxiv:2601.03269
Vishesh Tripathi
vishesh-t27
AI & ML interests
Large Language Models
Generative AI
Recent Activity
upvoted a paper 2 days ago
Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention submitted a paper 2 days ago
Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention liked a model 26 days ago
FrontiersMind/Nandi-Mini-V1.1-600M-Intermediate-Checkpoint-400GT