EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements Paper • 2506.08762 • Published Jun 10, 2025
How Can I Publish My LLM Benchmark Without Giving the True Answers Away? Paper • 2505.18102 • Published May 23, 2025
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published Oct 25, 2025 • 3
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18, 2025 • 1
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements Paper • 2506.08762 • Published Jun 10, 2025
llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length Paper • 2504.15544 • Published Apr 22, 2025
On the Optimal Reasoning Length for RL-Trained Language Models Paper • 2602.09591 • Published Feb 10 • 5
DroPE Collection Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding (https://www.arxiv.org/abs/2512.12167) • 1 item • Updated Jan 11 • 2