HiCRISP: A Hierarchical Closed-Loop Robotic Intelligent Self-Correction Planner Paper • 2309.12089 • Published Sep 21, 2023
CAMBranch: Contrastive Learning with Augmented MILPs for Branching Paper • 2402.03647 • Published Feb 6, 2024
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models Paper • 2308.14149 • Published Aug 27, 2023 • 1
Panacea: A foundation model for clinical trial search, summarization, design, and recruitment Paper • 2407.11007 • Published Jun 25, 2024
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval Paper • 2411.16454 • Published Nov 25, 2024
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Paper • 2503.24289 • Published Mar 31 • 1
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning Paper • 2503.00223 • Published Feb 28 • 1
Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning Paper • 2507.17842 • Published Jul 23
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20 • 19
TrialPanorama: Database and Benchmark for Systematic Review and Design of Clinical Trials Paper • 2505.16097 • Published May 22
SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs Paper • 2509.20758 • Published Sep 25 • 1