DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper
• 2401.02954
• Published • 53
Perspectives on the State and Future of Deep Learning - 2023
Paper
• 2312.09323
• Published • 8
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to
the Edge of Generalization
Paper
• 2405.15071
• Published • 42
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
Reasoning
Paper
• 2407.10718
• Published • 19
LAB-Bench: Measuring Capabilities of Language Models for Biology
Research
Paper
• 2407.10362
• Published • 7
SciCode: A Research Coding Benchmark Curated by Scientists
Paper
• 2407.13168
• Published • 17
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper
• 2407.20183
• Published • 43
Building and better understanding vision-language models: insights and
future directions
Paper
• 2408.12637
• Published • 133
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
Language Models
Paper
• 2409.11136
• Published • 23
Recursive Language Models
Paper
• 2512.24601
• Published • 94
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation
Paper
• 2601.09688
• Published • 127
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper
• 2601.09259
• Published • 96