Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection Paper β’ 2601.05403 β’ Published Jan 8 β’ 10
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning Paper β’ 2509.22576 β’ Published Sep 26, 2025 β’ 137
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Paper β’ 2506.14028 β’ Published Jun 16, 2025 β’ 93