Viktor Nedov's picture

Viktor Nedov

aiwannatry

·

AI & ML interests

None yet

Organizations

upvoted a paper 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 451

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.65k

upvoted a paper 4 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 232

upvoted a collection 6 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.8k

upvoted an article over 1 year ago

Article

Welcome to Inference Providers on the Hub 🔥

+5

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 495

upvoted a paper over 1 year ago

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Paper • 2401.04658 • Published Jan 9, 2024 • 27