Models and datasets of "A Controlled Study on Long Context Extension and Generalization in LLMs"
Luyi
lulululuyi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation upvoted a paper 3 months ago
AI Can Learn Scientific Taste updated a collection 4 months ago
R-HORIZON ModelsOrganizations
None yet
TDAR-Evaluation
R-HORIZON Models
models of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
Long Context Controlled Study
Models and datasets of "A Controlled Study on Long Context Extension and Generalization in LLMs"
TDAR-8B-Thinking
TDAR-Evaluation
R-HORZION Datasets
Training and evaluation datasets of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
R-HORIZON Models
models of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?