Honglin Guo's picture

🔄 In a Training Loop

Honglin Guo

KYLN24

·

KYLN24

AI & ML interests

None yet

Recent Activity

authored a paper about 3 hours ago

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

submitted a paper 1 day ago

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

new activity 14 days ago

nex-agi/Nex-N2-Pro:Add WildClawBench evaluation result

View all activity

Organizations

authored a paper about 3 hours ago

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Paper • 2606.24526 • Published 3 days ago

submitted a paper to Daily Papers 1 day ago

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Paper • 2606.24526 • Published 3 days ago

New activity in nex-agi/Nex-N2-Pro 14 days ago

Add WildClawBench evaluation result

#1 opened 21 days ago by

liked a model 14 days ago

nex-agi/Nex-N2-mini

Text Generation • 35B • Updated 14 days ago • 15.2k • 256

liked a model 21 days ago

nex-agi/Nex-N2-Pro

Text Generation • 397B • Updated 14 days ago • 8.12k • 351

reacted to danieldk's post with 🔥 5 months ago

Post

2842

kernels 0.12 is out! 🎉

Changes:

* Support for kernel version branches to gracefully roll out kernel API changes.
* Support for PyTorch 2.10.
* kernel-builder is now merged into the kernels repo.
* Initial support for standardized kernel benchmarks.

https://github.com/huggingface/kernels/releases/tag/v0.12.0

upvoted a paper 5 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

authored a paper 5 months ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67

upvoted 2 papers 5 months ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67

OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment

Paper • 2601.01576 • Published Jan 4 • 19

authored a paper 5 months ago

OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

Paper • 2601.10343 • Published Jan 15 • 2

liked a dataset 6 months ago

nex-agi/agent-sft

Preview • Updated Dec 9, 2025 • 1.17k • 116

authored a paper 6 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 159

upvoted a paper 6 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 159

New activity in nex-agi/agent-sft 7 months ago

Improve dataset card: Add paper, code, project page links and tags

#3 opened 7 months ago by

Is it normal that there is no thinking process in the content?

#4 opened 7 months ago by

liked a dataset 7 months ago

allenai/Dolci-Think-SFT-Python

Viewer • Updated Jan 5 • 1.09M • 127 • 6

authored 3 papers 7 months ago

Better Process Supervision with Bi-directional Rewarding Signals

Paper • 2503.04618 • Published Mar 6, 2025

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85