OmniGAIA Towards Native Omni-Modal AI Agents Running 4 OmniGAIA Leaderboard 🏆 4 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 6 days ago • 360 • 2.15k • 6 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated 6 days ago • 2.16k • 4.85k • 7 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B Image-Text-to-Text • 32B • Updated 6 days ago • 79 • 3
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 30 days ago • 55 RUC-NLPIR/DISBench Updated 9 days ago • 297 • 2 Running 2 DISBench Leaderboard 🏆 2 Explore and submit multimodal image retrieval benchmark results
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 30 days ago • 55
GISA Running 2 GISA Leaderboard 🏆 2 Submit model predictions and view GISA leaderboard scores GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26 RUC-NLPIR/GISA Preview • Updated 28 days ago • 844 • 3
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Sleeping 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 3.31k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 46 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41
OmniGAIA Towards Native Omni-Modal AI Agents Running 4 OmniGAIA Leaderboard 🏆 4 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 6 days ago • 360 • 2.15k • 6 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated 6 days ago • 2.16k • 4.85k • 7 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B Image-Text-to-Text • 32B • Updated 6 days ago • 79 • 3
GISA Running 2 GISA Leaderboard 🏆 2 Submit model predictions and view GISA leaderboard scores GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26 RUC-NLPIR/GISA Preview • Updated 28 days ago • 844 • 3
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 30 days ago • 55 RUC-NLPIR/DISBench Updated 9 days ago • 297 • 2 Running 2 DISBench Leaderboard 🏆 2 Explore and submit multimodal image retrieval benchmark results
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 30 days ago • 55
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Sleeping 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 3.31k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 46 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41