tencent/HY3D-Bench
Updated
•
16
•
18
None defined yet.
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
CL-bench: A Benchmark for Context Learning