2 4 1

Jiacheng Lin

linjc16

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

OpenResearcher/OpenResearcher

upvoted a paper about 2 months ago

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

upvoted a paper about 2 months ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

View all activity

Organizations

liked a Space about 1 month ago

OpenResearcher

🏃

Answer questions using web searches and citations

upvoted 2 papers about 2 months ago

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

Paper • 2509.20758 • Published Sep 25, 2025 • 2

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published Feb 1 • 42

upvoted a paper 2 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 202

updated a model 3 months ago

TrialPanorama/Qwen-3-8B-TP

Text Generation • Updated Dec 25, 2025 • 16

published a model 3 months ago

TrialPanorama/Qwen-3-8B-TP

Text Generation • Updated Dec 25, 2025 • 16

updated a model 3 months ago

TrialPanorama/LLaMA-3-8B-TP

Text Generation • Updated Dec 25, 2025 • 3

published a model 3 months ago

TrialPanorama/LLaMA-3-8B-TP

Text Generation • Updated Dec 25, 2025 • 3

authored 12 papers 3 months ago

HiCRISP: A Hierarchical Closed-Loop Robotic Intelligent Self-Correction Planner

Paper • 2309.12089 • Published Sep 21, 2023

CAMBranch: Contrastive Learning with Augmented MILPs for Branching

Paper • 2402.03647 • Published Feb 6, 2024

Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models

Paper • 2308.14149 • Published Aug 27, 2023 • 1

Panacea: A foundation model for clinical trial search, summarization, design, and recruitment

Paper • 2407.11007 • Published Jun 25, 2024

Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval

Paper • 2411.16454 • Published Nov 25, 2024

Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning

Paper • 2503.24289 • Published Mar 31, 2025 • 1

DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning

Paper • 2503.00223 • Published Feb 28, 2025 • 1

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Paper • 2507.17842 • Published Jul 23, 2025

s3: You Don't Need That Much Data to Train a Search Agent via RL

Paper • 2505.14146 • Published May 20, 2025 • 20

TrialPanorama: Database and Benchmark for Systematic Review and Design of Clinical Trials

Paper • 2505.16097 • Published May 22, 2025

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

Paper • 2509.20758 • Published Sep 25, 2025 • 2

Adaptation of Agentic AI

Paper • 2512.16301 • Published Dec 18, 2025 • 108

Jiacheng Lin

AI & ML interests

Recent Activity

Organizations

linjc16's activity

OpenResearcher