PolaroidVL Installer
install PolaroidVL Model locally on your devices
install PolaroidVL Model locally on your devices
In-browser speech recognition w/ word-level timestamps
Generate Talking avatars from Text-to-Speech
LLM for long context
Clean your room with the help of AI
Enhance image resolution to 16 megapixels
Upscale images using various models
a mini vision-language ai model
Generate images by repairing and modifying masked areas
Enhance and upscale images with HDR and tile control
Transform text into engaging podcast dialogue
Personalised Podcasts For All - Available in 13 Languages
Video Dubbing with Open Source Projects
Explore and filter research papers, then chat with them
Generate spokenβstyle scripts from documents
Upscale an image by 4x using FLUX
Upscale images with AI-powered high-resolution enhancement
Remove image backgrounds and get transparent PNGs
Transcribe or translate audio and YouTube videos to text
Transcribe audio files and YouTube videos into text
Paper Whisperer
Modify images using text guidance
Remove/Change background of video.
Efficient T2V generation
Demo for DocLayout-YOLO
Apache Licensed Advanced Video Generation Model
Restore and enhance images with text prompts
Generate detailed pony images from text prompts
A unified multimodal understanding and generation model.
OpenAI's Deep Research, but open
Generate 3D models from images
In-browser unified multimodal understanding and generation.
A curated collection of AI tools for journalists & creators
Integrate Hugging Face models into Google Sheets
Chatting with scientific papers made easy
Chat with Eagle2-VL to generate text based on text and images
Transcribe audio files into timestamped text and subtitles
Inpaint videos by adding masks and removing unwanted objects