Spaces:

VideoReason
/

README

Running

App Files Files Community

juyil commited on Dec 3, 2025

Commit

c656565

verified ·

1 Parent(s): 79257b6

Update README.md

Browse files

Files changed (1) hide show

README.md +69 -1

README.md CHANGED Viewed

@@ -7,4 +7,72 @@ sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 pinned: false
 ---
+# VMEvalKit 🎥🧠
+<div align="center">
+[![results](https://img.shields.io/badge/Result-A42C2?style=for-the-badge&logo=googledisplayandvideo360&logoColor=white)](https://grow-ai-like-a-child.com/video-reason/)
+[![Paper](https://img.shields.io/badge/Paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white)](https://github.com/hokindeng/VMEvalKit/paper/video-models-start-to-solve/Video_Model_Start_to_Solve.pdf)
+[![Hugging Face](https://img.shields.io/badge/hf-fcd022?style=for-the-badge&logo=huggingface&logoColor=white)](https://huggingface.co/VideoReason)
+[![WeChat](https://img.shields.io/badge/WeChat-07C160?style=for-the-badge&logo=wechat&logoColor=white)](https://github.com/hokindeng/VMEvalKit/issues/132)
+</div>
+A framework to evaluate reasoning capabilities in video generation models at scale.
+<p align="center">
+</p>
+![VMEvalKit Framework](https://github.com/hokindeng/VMEvalKit/paper/video-models-start-to-solve/assets/draft_1.jpg)
+## 🎬 Supported Models
+VMEvalKit provides unified access to **40 video generation models** across **11 provider families**:
+For commercial APIs, we support Luma Dream Machine, Google Veo, Google Veo 3.1, WaveSpeed WAN 2.1, WaveSpeed WAN 2.2, Runway ML, OpenAI Sora. For open-source models, we support HunyuanVideo, VideoCrafter, DynamiCrafter, Stable Video Diffusion, Morphic, LTX-Video, and so on. See [here](docs/models/README.md) for details.
+## Invitation to Collaborate 🤝
+VMEvalKit is meant to be a permissively open-source **shared playground** for everyone. If you’re interested in machine cognition, video models, evaluation, or anything anything 🦄✨, we’d love to build with you:
+* 🧪 Add new reasoning tasks (planning, causality, social, physical, etc.)
+* 🎥 Plug in new video models (APIs or open-source)
+* 📊 Experiment with better evaluation metrics and protocols
+* 🧱 Improve infrastructure, logging, and the web dashboard
+* 📚 Use VMEvalKit in your own research and share back configs/scripts
+* 🌟🎉 Or Anything anything 🦄✨
+💬 **Join us on Slack** to ask questions, propose ideas, or start a collab:
+[Slack Invite](https://join.slack.com/t/growingailikeachild/shared_invite/zt-309yqd0sl-W8xzOkdBPha1Jh5rnee78A) 🚀
+## Research
+Here we keep track of papers spinned off from this code infrastructure and some works in progress.
+- [**"Video Models Start to Solve Chess, Maze, Sudoku, Mental Rotation, and Raven's Matrices"**](paper/video-models-start-to-solve/Video_Model_Start_to_Solve.pdf)
+This paper implements our experimental framework and demonstrates that leading video generation models (Sora-2 etc) can perform visual reasoning tasks with >60% success rates. See [**results**](https://grow-ai-like-a-child.com/video-reason/).
+## License
+Apache 2.0
+## Citation
+If you find VMEvalKit useful in your research, please cite:
+```bibtex
+@misc{VMEvalKit,
+  author       = {VMEvalKit Team},
+  title        = {VMEvalKit: A framework for evaluating reasoning abilities in foundational video models},
+  year         = {2025},
+  howpublished = {\url{https://github.com/Video-Reason/VMEvalKit}}
+}
+```