RNGBench Collection Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games • 2 items • Updated 11 days ago • 1
RNGBench Collection Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games • 2 items • Updated 11 days ago • 2
RNGBench Collection Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games • 2 items • Updated 11 days ago • 2
RNGBench Collection Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games • 2 items • Updated 11 days ago • 2
CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning Paper • 2606.09393 • Published 26 days ago
Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games Paper • 2606.19338 • Published 17 days ago • 49
Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games Paper • 2606.19338 • Published 17 days ago • 49
OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs Paper • 2606.03890 • Published Jun 2 • 31
OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs Paper • 2606.03890 • Published Jun 2 • 31
SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction Paper • 2605.20110 • Published May 19 • 4