41 62

Harry Soteriou

HarrySoteriou

HarrySoteriou

AI & ML interests

LLMs, Deep Reinforcement Learning, TinyML, Computer Vision

Recent Activity

liked a dataset 3 days ago

nvidia/Nemotron-ClimbMix

upvoted a paper 21 days ago

Meta-Harness: End-to-End Optimization of Model Harnesses

updated a Space 2 months ago

HarrySoteriou/video-inference-zerogpu-space

View all activity

Organizations

upvoted a paper 21 days ago

Meta-Harness: End-to-End Optimization of Model Harnesses

Paper • 2603.28052 • Published Mar 30 • 21

upvoted a paper 4 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 222

upvoted a paper 8 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted 5 papers 11 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 255

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 276

upvoted a paper 12 months ago

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17, 2025 • 58

upvoted 4 papers about 1 year ago

Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models

Paper • 2505.02686 • Published May 5, 2025 • 16

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14, 2025 • 77

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14, 2025 • 71

LLMs Get Lost In Multi-Turn Conversation

Paper • 2505.06120 • Published May 9, 2025 • 7

upvoted an article about 1 year ago

Article

All LLMs Will Be Sparse BitNet Hybrids

codys12

•

May 14, 2025

• 16

upvoted 6 papers about 1 year ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 123

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 449

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 303

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 187

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 192

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 122

Harry Soteriou

AI & ML interests

Recent Activity

Organizations

HarrySoteriou's activity

All LLMs Will Be Sparse BitNet Hybrids