Andrei Semenov's picture

Andrei Semenov

Andron00e

·

https://andron00e.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Enhancing LLM Training via Spectral Clipping

upvoted a paper about 2 months ago

Enhancing LLM Training via Spectral Clipping

upvoted a paper 6 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

View all activity

Organizations

authored a paper about 1 month ago

Enhancing LLM Training via Spectral Clipping

Paper • 2603.14315 • Published Mar 15 • 1

authored 3 papers 8 months ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 18

Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed

Paper • 2406.04443 • Published Jun 6, 2024

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published Sep 1, 2025 • 25

authored a paper about 2 years ago

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning

Paper • 2404.03323 • Published Apr 4, 2024 • 3