Arena Leaderboard
View the LMArena leaderboard in fullβscreen
A collection of tools as various HF Spaces on LLMs.
View the LMArena leaderboard in fullβscreen
Track, rank and evaluate open LLMs and chatbots
Generate speech in a cloned voice from a short audio clip
Launch a Streamlit web app interface
View and submit LLM evaluations
Pick a text splitter => visualize chunks. Great for RAG.
Create a model card for Hugging Face Hub
Compare LLM hardware performance and find the best model
Search, filter and submit LLM benchmark evaluations
Fine-tuning large language model with Gradio UI
Replace objects in images using prompts or reference images
High-fidelity Text-To-Speech
Identify named entities in text
Generate AI responses to any text prompt
Chat with a visual AI that answers questions about images
Annotate and describe images with text prompts
Extract custom entities from any text
Perform multiple NLP tasks like NER, QA, and summarization
Convert PDFs to a Hugging Face dataset
Generate instruction-response pairs from text
Quantize Hugging Face models to GGUF format instantly
Generate artist-style 3D meshes from any input model
Display a web page
Run Gemini Nano locally in your browser with Transformers.js
Answer questions about images using text prompts
Experiment with and compare different tokenizers
Generate speakerβlabeled transcripts from audio files
Compare two faces to verify identity
Segment images with prompts or automatic masks
All paper summaries read by Merve
Generate images from text prompts instantly
Generate images from text prompts
Display a React app with TypeScript
Summarize text from a PDF URL
Generate chat responses using FalconMamba-7b model
LLM for long context
Chat with Llama3.1 using spoken audio or synthesize speech
Generate text based on prompts
Travel through the model latent space
Upload a paper to get reviews and vote on quality
Find datasets and models using semantic search
Convert models to Safetensors and open a PR
Convert text to natural-sounding speech audio
Generate audioβready script from documents
Chat with a language model
Transcribe audio files into readable text
Transcribe audio or YouTube videos into text
RAG with source links inserted using LXT library.
Extract and format text from images with advanced OCR
Ask questions and get detailed answers
Deduplicate HuggingFace datasets in seconds
Generate text from audio recordings
Run code and get instant results with Qwen Code Interpreter
Refine your prompts
Interact with the Aya family of models.
Compare Open LLM Leaderboard results
Display a loading screen with a spinner
Convert Hugging Face models to MLX format and upload
Add vectors to Hub datasets and do in memory vector search.
Talk to Fixie.ai's Ultravox with WebRTC β‘οΈ
An analysis of LFS files on the Hub.
Demo for DocLayout-YOLO
Compress prompts while preserving key tokens
diffusion-based Image Restoration model
Prompt with Images in flux[dev]
Generate structured GitHub issues
PaliGemma2 LoRA finetuned on VQAv2
Interact with multiple chatbots simultaneously
Fantasy story generator
Generate executable Jupyter notebooks from natural language prompts
QwQ-32B-Preview
Generate and preview app code from a text description
Search, load and play with transformer pipelines
Generate 3D models from images
Aligns the tokens of two sentences
Upgraded to v1.0!
Small and powerful reasoning LLM that runs in your browser
Chat with an AI assistant (text and images)
Quickest way to test naive RAG run with AutoRAG.
Next-generation reasoning model that runs locally in-browser
Generate descriptions from images and text prompts
In-browser unified multimodal understanding and generation.
Need to analyze data? Let a Llama-3.1 agent do it for you!
β¨[With v1.0.0] Accelerated TTS on Kokoro-82M
Answer questions and run tools with an AI agent
Generate text answers or segment objects from images
The ultimate guide to training LLM on large GPU Clusters
Collection of marimo notebooks from a GitHub repository
Download and run a Hugging Face app
Magma-8B model for UI Agents
Answer questions using advanced AI
Blazingly Fast and Embarrassingly Simple Song Generation
Generate radial plots comparing language model performance
Contributing to OpenStreetMap with the help of AI
Convert images and sketches into graphics programs with TikZ
A bulk labelling interface for binary text classification
Generate any application by Vibe Coding it
Generate custom evaluations from your data easily!
Generate realistic dialogue from a script, using Dia!
Chat with an AI assistant that thinks before answering
Generate modified audio from text and voice
Expressive Zeroshot TTS
Create and enrich datasets using AI and web search
Submit model evaluations and view leaderboard results
https://nanonets.com/research/nanonets-ocr-s/
Visual Audio Question Answering
Translate text instantly between many languages
FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
Generate detailed captions for your images
Edit images using natural language instructions
Duplicate this leaderboard to initialize your own!
Find similar images and match them across collections
Generate code snippets with AI for web and app frameworks
Real-time video captioning powered by FastVLM
Visualize embeddings in 3D space, powered by EmbeddingGemma
Interactive timeline to explore the π€Transformers models
Convert and query documents from images with AI
Run Granite-4.0-Micro 100% locally in your browser on WebGPU
Convert document images to HTML with a single click
Fast 4 step inference with Qwen Image Edit 2509
Fast 4 step inference with Qwen Image Edit 2509
Configurable Generalist Agent, leader in AppWorld Benchmark
Generate speech from text using voice design, cloning or presets
Transcribe audio to text with timestamps and visualization
Run GPT-OSS-20B locally in your browser on WebGPU
Build, train, and run LLMs in the browser
Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU
Run Olmo-Hybrid-7B 100% locally in your browser on WebGPU
Space for LuxTTS: a 150x realtime voice cloning TTS model
State-of-the-art image generation, in your browser.