Vladimir

galqiwi

·

AI & ML interests

None yet

Organizations

upvoted a paper 5 months ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 63

upvoted a paper 12 months ago

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Paper • 2507.08800 • Published Jul 11, 2025 • 81

upvoted 2 papers about 1 year ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 79

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8, 2025 • 110

upvoted an article over 1 year ago

Article

Digest of models based on YandexGPT 5 Lite

WaveCut

•

Mar 19, 2025

• 33

upvoted 2 papers over 1 year ago

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Paper • 2501.02625 • Published Jan 5, 2025 • 15

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7, 2025 • 44

upvoted a collection almost 2 years ago

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 26 items • Updated Feb 18 • 22