EvoClaw-Bench

community

https://evo-claw.com/

AI & ML interests

Evaluating AI Agents on Continuous Tasks

Recent Activity

hyd2apse updated a dataset 14 days ago

EvoClaw-Bench/EvoClaw-log

hyd2apse published a dataset 20 days ago

EvoClaw-Bench/EvoClaw-log

hyd2apse updated a Space about 2 months ago

EvoClaw-Bench/README

View all activity

updated a dataset 14 days ago

EvoClaw-Bench/EvoClaw-log

Updated 14 days ago • 232 • 1

published a dataset 20 days ago

EvoClaw-Bench/EvoClaw-log

Updated 14 days ago • 232 • 1

updated a Space about 2 months ago

README

published a Space about 2 months ago

README

authored 2 papers about 2 months ago

HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging

Paper • 2102.00824 • Published Jan 18, 2021

CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning

Paper • 2306.11128 • Published Jun 19, 2023

authored a paper 2 months ago

EvoClaw: Evaluating AI Agents on Continuous Software Evolution

Paper • 2603.13428 • Published Mar 13 • 21

authored a paper 2 months ago

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Paper • 2603.10160 • Published Mar 10 • 26

authored a paper 4 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

authored a paper about 1 year ago

S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning

Paper • 2504.06426 • Published Apr 8, 2025 • 2

authored a paper over 1 year ago

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published Dec 30, 2024 • 26

authored 5 papers over 1 year ago

LLM-Rec: Personalized Recommendation via Prompting Large Language Models

Paper • 2307.15780 • Published Jul 24, 2023 • 28

Language Models are Graph Learners

Paper • 2410.02296 • Published Oct 3, 2024

Mixture of Weak & Strong Experts on Graphs

Paper • 2311.05185 • Published Nov 9, 2023 • 1

Decoupling the Depth and Scope of Graph Neural Networks

Paper • 2201.07858 • Published Jan 19, 2022 • 1

GraphSAINT: Graph Sampling Based Inductive Learning Method

Paper • 1907.04931 • Published Jul 10, 2019

authored 4 papers almost 2 years ago

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 46

Text-Based Reasoning About Vector Graphics

Paper • 2404.06479 • Published Apr 9, 2024

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Paper • 2405.20974 • Published May 31, 2024

A Single Transformer for Scalable Vision-Language Modeling

Paper • 2407.06438 • Published Jul 8, 2024 • 1