metadata
license: mit
task_categories:
- video-classification
- reinforcement-learning
- robotics
language:
- en
tags:
- Chain-of-Frames
- Video-Reasoning
- Visual-Planning
- Maze
- Wan
size_categories:
- 10K<n<100K
base_model:
- Wan-AI/Wan2.2-TI2V-5B
pipeline_tag: image-to-video
🎯 Wan‑R1: A Reasoning‑via‑Video Maze‑Solving Model 🎯
🧠 Models
| Model | Download | Description |
|---|---|---|
| Wan_R1_3d_maze_5B | 🤗 HuggingFace | Fine-tuned LoRA for Maze3D tasks (easy, medium, and hard) from the base model Wan2.2-TI2V-5B. |
| Wan_R1_irregular_maze_5B | 🤗 HuggingFace | Fine-tuned LoRA for PathFinder tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B. |
| Wan_R1_regular_maze_5B | 🤗 HuggingFace | Fine-tuned LoRA for Maze tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B. |
| Wan_R1_sokoban_5B | 🤗 HuggingFace | Fine-tuned LoRA for Sokoban tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B. |
| Wan_R1_trapfield_5B | 🤗 HuggingFace | Fine-tuned LoRA for TrapField tasks (easy, medium, and hard) from base model Wan2.2-TI2V-5B. |