TRL-Bench Collection TRL-Bench: cross-paradigm representation-level evaluation of tabular encoders. CTbench + Rbench + DLTE. • 4 items • Updated May 6 • 4
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published Apr 8 • 97
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks Paper • 2504.12764 • Published Apr 17, 2025 • 42