5 26

Zijun

TranSirius

TranSirius

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

upvoted a paper 29 days ago

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

upvoted a paper 5 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

View all activity

Organizations

upvoted a paper 13 days ago

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

Paper • 2606.13662 • Published 14 days ago • 27

upvoted a paper 29 days ago

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

Paper • 2605.27354 • Published about 1 month ago • 15

upvoted a paper 5 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 48

upvoted 2 papers 9 months ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 57

SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression

Paper • 2509.25176 • Published Sep 29, 2025 • 14

upvoted 4 papers about 1 year ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 57

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4, 2025 • 35

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16, 2025 • 77

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83

commented a paper about 1 year ago

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Paper • 2504.15270 • Published Apr 21, 2025 • 9 •

upvoted a paper about 1 year ago

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Paper • 2504.15270 • Published Apr 21, 2025 • 9

upvoted 2 papers over 1 year ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26, 2025 • 23

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Paper • 2502.14638 • Published Feb 20, 2025 • 11

published 2 models over 1 year ago

THU-KEG/OpenSAE-LLaMA-3.1-Layer_05-shift_back

2B • Updated Jan 28, 2025 • 3

THU-KEG/OpenSAE-LLaMA-3.1-Layer_04-shift_back

2B • Updated Jan 28, 2025 • 2

updated a collection over 1 year ago

OpenSAE-LLaMA-3.1-8B

Collection

OpenSAE checkpoints for LLaMA 3.1 8B base model • 38 items • Updated Jan 29, 2025 • 5

updated 3 models over 1 year ago

Zijun

AI & ML interests

Recent Activity

Organizations

TranSirius's activity