Vision Coder OpenEnv
🖼
9
RL environment for screenshot-to-HTML generation
Real-time video captioning powered by FastVLM
Generate Python code from a brief project description
An agent that allows chatting with parts of video
CLIP embeddings for both text and image similarity search