mannaandpoem

manna-ai

31 31

https://github.com/mannaandpoem

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

upvoted a paper 4 days ago

PatchWorld: Gradient-Free Optimization of Executable World Models

upvoted a paper 29 days ago

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

View all activity

Organizations

upvoted 2 papers 4 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Paper • 2606.19236 • Published 13 days ago • 13

PatchWorld: Gradient-Free Optimization of Executable World Models

Paper • 2605.30880 • Published May 29 • 12

upvoted a paper 29 days ago

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

Paper • 2605.20873 • Published May 20 • 44

upvoted 2 papers 4 months ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published Mar 2 • 34

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published Mar 2 • 64

upvoted 4 papers 5 months ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 128

upvoted a paper 7 months ago

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 92

upvoted 2 papers 8 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 233

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published Oct 27, 2025 • 123

upvoted an article 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted a collection 12 months ago

Seed-Coder

Collection

4 items • Updated May 13, 2025 • 27

upvoted 3 papers about 1 year ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 185

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31, 2025 • 305

upvoted a collection about 1 year ago

GLM-4-0414

Collection

GLM-4-0414 series model • 6 items • Updated Mar 2 • 135

upvoted a paper about 1 year ago

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Paper • 2504.02605 • Published Apr 3, 2025 • 49

upvoted a paper about 2 years ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 126

mannaandpoem

AI & ML interests

Recent Activity

Organizations

manna-ai's activity

Smol2Operator: Post-Training GUI Agents for Computer Use