era-temporary

AI & ML interests

None defined yet.

Recent Activity

FlippyDora authored a paper 16 days ago

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents

FlippyDora authored a paper 16 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

FlippyDora authored a paper 16 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

View all activity

authored 5 papers 16 days ago

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents

Paper • 2412.13549 • Published Dec 18, 2024

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 26

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 28

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published Mar 14 • 10

AgentSPEX: An Agent SPecification and EXecution Language

Paper • 2604.13346 • Published 25 days ago • 162

submitted a paper to Daily Papers about 2 months ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published Mar 14 • 10

authored a paper 4 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published Jan 15 • 9

submitted a paper to Daily Papers 4 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published Jan 15 • 9

updated a model 6 months ago

era-temporary/openvla-7b-era_dataset-b16-lr-0.0005-lora-r32-dropout-0.0

8B • Updated Nov 21, 2025 • 3

published a model 6 months ago

era-temporary/openvla-7b-era_dataset-b16-lr-0.0005-lora-r32-dropout-0.0

8B • Updated Nov 21, 2025 • 3

authored a paper 7 months ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 28

updated a model 8 months ago

era-temporary/eb_alfred_sft_best

4B • Updated Sep 24, 2025 • 1

published a model 8 months ago

era-temporary/eb_alfred_sft_best

4B • Updated Sep 24, 2025 • 1

updated a model 8 months ago

era-temporary/eb_man_sft_best

4B • Updated Sep 24, 2025 • 1

published a model 8 months ago

era-temporary/eb_man_sft_best

4B • Updated Sep 24, 2025 • 1

updated a model 8 months ago

era-temporary/eb_alfred_sft_stage1_grounding_action_full_planning_randomized

4B • Updated Sep 18, 2025 • 3

published a model 8 months ago

era-temporary/eb_alfred_sft_stage1_grounding_action_full_planning_randomized

4B • Updated Sep 18, 2025 • 3

updated a model 8 months ago

era-temporary/eb-alfred-external-know-env-anchored-lr1e-5-full-e1-bs-16

4B • Updated Sep 16, 2025 • 3

published a model 8 months ago

era-temporary/eb-alfred-external-know-env-anchored-lr1e-5-full-e1-bs-16

4B • Updated Sep 16, 2025 • 3

updated a model 8 months ago

era-temporary/eb_alfred_sft_openo1_1w

4B • Updated Sep 16, 2025 • 1