Marius Dinca's picture

Marius Dinca

Puddings22

·

Puddings22

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

upvoted a paper about 1 month ago

Generative Recursive Reasoning

upvoted a paper about 1 month ago

WavFlow: Audio Generation in Waveform Space

View all activity

Organizations

upvoted 3 papers about 1 month ago

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published May 17 • 34

Generative Recursive Reasoning

Paper • 2605.19376 • Published May 20 • 30

WavFlow: Audio Generation in Waveform Space

Paper • 2605.18749 • Published May 18 • 10

upvoted a paper about 2 months ago

Fast Byte Latent Transformer

Paper • 2605.08044 • Published May 8 • 12

upvoted 4 papers 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 167

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published Apr 14 • 37

Large Language Models Align with the Human Brain during Creative Thinking

Paper • 2604.03480 • Published Apr 3 • 6

Beyond the Assistant Turn: User Turn Generation as a Probe of Interaction Awareness in Language Models

Paper • 2604.02315 • Published Apr 3 • 5

upvoted a collection 3 months ago

Bonsai

1-bit Bonsai models • 7 items • Updated 25 days ago • 208

upvoted a paper 3 months ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 53

upvoted 9 papers 4 months ago

Training Language Models via Neural Cellular Automata

Paper • 2603.10055 • Published Mar 9 • 8

Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality

Paper • 2602.14080 • Published Feb 15 • 23

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Paper • 2602.16849 • Published Feb 18 • 7

2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

Paper • 2602.17363 • Published Feb 19 • 8

Preliminary sonification of ENSO using traditional Javanese gamelan scales

Paper • 2602.14560 • Published Feb 16 • 1

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Paper • 2602.15322 • Published Feb 17 • 11

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

Paper • 2602.11715 • Published Feb 12 • 8

Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm

Paper • 2602.11543 • Published Feb 12 • 6

LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Paper • 2602.11451 • Published Feb 11 • 16

upvoted a paper 5 months ago

NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models

Paper • 2602.06694 • Published Feb 6 • 20