Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Marius Dinca's picture
5 23 11

Marius Dinca

Puddings22
ksora1816's profile picture
·
  • Puddings22

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
upvoted a paper 16 days ago
Training Language Models via Neural Cellular Automata
commentedon a paper 24 days ago
SimpleGPT: Improving GPT via A Simple Normalization Strategy
View all activity

Organizations

None yet

commented a paper 24 days ago

SimpleGPT: Improving GPT via A Simple Normalization Strategy

Paper • 2602.01212 • Published Feb 1 • 3 •
6
commented 2 papers about 1 month ago

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

Paper • 2602.11715 • Published Feb 12 • 6 •
3

Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm

Paper • 2602.11543 • Published Feb 12 • 6 •
4
commented 2 papers about 2 months ago

NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models

Paper • 2602.06694 • Published Feb 6 • 15 •
5

SimpleGPT: Improving GPT via A Simple Normalization Strategy

Paper • 2602.01212 • Published Feb 1 • 3 •
6
commented a paper 2 months ago

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Paper • 2601.14250 • Published Jan 20 • 48 •
5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs