-
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 40 -
A Survey on Large Language Model Benchmarks
Paper • 2508.15361 • Published • 19 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 105
yangdechuan
yangdechuan
·
AI & ML interests
None yet
Organizations
None yet
LLM
-
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 40 -
A Survey on Large Language Model Benchmarks
Paper • 2508.15361 • Published • 19 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 105
models 6
yangdechuan/mt5-small-finetuned-amazon-en-es
Summarization • 0.3B • Updated • 4
yangdechuan/bert-base-uncased
Feature Extraction • 0.1B • Updated • 7
yangdechuan/bert-base-cased
Feature Extraction • 0.1B • Updated • 8
yangdechuan/mt5-small-finetuned-amazon-en-es-accelerate
Updated • 4
yangdechuan/codeparrot-ds
Text Generation • Updated • 1
yangdechuan/bert-base-cased-finetuned-mrpc
Text Classification • Updated • 3