31 31

mannaandpoem

manna-ai

https://github.com/mannaandpoem

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

upvoted a paper about 8 hours ago

PatchWorld: Gradient-Free Optimization of Executable World Models

upvoted a paper 25 days ago

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

View all activity

Organizations

upvoted 2 papers about 8 hours ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Paper • 2606.19236 • Published 9 days ago • 13

PatchWorld: Gradient-Free Optimization of Executable World Models

Paper • 2605.30880 • Published 28 days ago • 12

upvoted a paper 25 days ago

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

Paper • 2605.20873 • Published May 20 • 44

liked a dataset about 1 month ago

rl-research/dr-tulu-sft-data

Viewer • Updated Nov 25, 2025 • 13.1k • 197 • 29

liked a model 2 months ago

tencent/Hy3-preview

Text Generation • 299B • Updated Apr 23 • 85.4k • 281

liked a dataset 3 months ago

OpenResearcher/OpenResearcher-Dataset

Viewer • Updated Mar 25 • 97.6k • 8.22k • 129

upvoted 2 papers 4 months ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published Mar 2 • 34

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published Mar 2 • 64

liked a model 4 months ago

zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated Jan 29 • 2.16M • • 1.76k

upvoted 4 papers 5 months ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 128

liked a dataset 6 months ago

CognitiveKernel/WebAggregatorQA

Updated Oct 17, 2025 • 484 • 4

upvoted a paper 7 months ago

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 92

upvoted 2 papers 8 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 233

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published Oct 27, 2025 • 123

upvoted an article 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted a collection 12 months ago

Seed-Coder

Collection

4 items • Updated May 13, 2025 • 27

liked a dataset about 1 year ago

hjshah/bfcl_v3

Viewer • Updated May 7, 2025 • 4.44k • 16 • 1

mannaandpoem

AI & ML interests

Recent Activity

Organizations

manna-ai's activity

Smol2Operator: Post-Training GUI Agents for Computer Use