1 25 16

dma2077 PRO

dma2077

AI & ML interests

None yet

Recent Activity

liked a dataset 18 days ago

m-a-p/OProofs

liked a dataset 29 days ago

nvidia/Nemotron-CC-Math-v1

upvoted a collection about 1 month ago

OProver

View all activity

Organizations

upvoted a collection about 1 month ago

OProver

Collection

9 items • Updated May 19 • 3

upvoted a paper about 1 month ago

OProver: A Unified Framework for Agentic Formal Theorem Proving

Paper • 2605.17283 • Published May 17 • 31

upvoted a paper 3 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 312

upvoted a paper 4 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 201

upvoted 2 papers 5 months ago

sangkuriang: A pseudo-spectral Python library for Korteweg-de Vries soliton simulation

Paper • 2601.12029 • Published Jan 17 • 2

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 196

upvoted 2 papers 7 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 107

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 306

upvoted a paper 8 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 87

upvoted a paper 9 months ago

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Paper • 2509.13160 • Published Sep 16, 2025 • 31

upvoted a paper 10 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 128

upvoted 3 papers 12 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 24

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 60

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 133

upvoted 3 papers about 1 year ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 64

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21, 2025 • 53

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 187

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted 2 papers about 1 year ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21, 2025 • 23

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44

dma2077 PRO

AI & ML interests

Recent Activity

Organizations

dma2077's activity

Open-R1: a fully open reproduction of DeepSeek-R1