AI2 WildBench Leaderboard (V2)
Display LLM performance leaderboards with customizable views
Display LLM performance leaderboards with customizable views
View the LMArena leaderboard in fullβscreen
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Check if your GPU can run a chosen LLM model
Extract custom entities from any text
Explore various text processing tasks with TURNA
Explore and submit LLM benchmarks
Generate text from document images
Analyze document layout from images
Extract text from documents using images or PDFs
Submit model evaluation results to leaderboard
Chat with an AI about any uploaded image
Efficient quantized retrieval over Wikipedia
Explore and compare model scores on RewardBench benchmarks
Highlight objects in images based on text descriptions
Explore Vision Arena visual AI demo online
VLMEvalKit Evaluation Results Collection
Launch a Streamlit web app interface
Visualize Open vs. Proprietary LLM Progress
Upload a PDF and ask questions about its content
Submit and view GAIA model evaluation leaderboard
Explore code-generation model leaderboards and task details
Transform text files into a Hugging Face dataset
Generate natural speech in 7000+ languages
Generate captions, detections, and segmentations for any image
Display a React app with TypeScript
Video captioning/tracking
Browse and compare visual document retrieval model scores
In-browser speech recognition w/ word-level timestamps
Answer questions about chart images
Need to analyze data? Let a Llama-3.1 agent do it for you!
Display MTEB Arena interface
View and submit LLM benchmark evaluations
Detect objects in images using text prompts
VLMEvalKit Eval Results in video understanding benchmark
Extract and format text from images with advanced OCR
Generate a leaderboard for evaluating language models
remove background from any image
View and compare openβsource AI model rankings with ELO scores
What happened in open-source AI this year, and whatβs next?
View and filter LLM hallucination leaderboard
Detect human poses in images and videos
Generate executable Jupyter notebooks from natural language prompts
Ranking of LLMs for agentic tasks
OmniParser, turn your LLM into GUI agent
Enhance low-light images to improve clarity
PDF to Structured Data powered by Google DeepMind Gemini 2.0
Handwritten Signature Detection
Convert document images to structured text and data
Chat with text, audio, images, and video, get spoken replies
Detect faces in uploaded images
Convert PDFs to Markdown with open-source parsers
Remove backgrounds from images instantly
A Unified Framework for Image Customization
Dolphin Demo
Create and enrich datasets using AI and web search
View OCR model leaderboard rankings
Hand-controlled arpeggiator, drum machine, and visualizer
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Display OCRBench leaderboard for text recognition models
coreOCR / Camel-Doc-OCR / docscopeOCR / MonkeyOCR
FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
Run GGUF directly on your browser!
Extract text from images and XML files using OCR models
AI Image Detection Demo
Kontext image editing on FLUX[dev]
Classify text with zero-shot classification
High-accuracy vision & reasoning for complex tasks
Generate and run a Jupyter notebook from your description
Demo Space for EfficientLoFTR architecture in Transformers
Convert and query documents from images with AI
Chat with AI using text, audio, images, or video
Vision-Language Models for Document Conversion
Extract structured layout and text from PDFs or images
Compare original and improved OCR text from historical documents
Chat with AI using text and images
Turkish Benchmark Leaderboard of LLM Models
In-browser tool calling with IBM Granite-4.0
Solve complex questions with stepβbyβstep AI reasoning
Let Us Detect your multilingual hallucinations!
Launch an interactive web interface
Segment objects from images using natural language prompts
Analyze table tennis videos for insights and coaching advice
Generate brainβsteered text variants
Identify plant diseases from images
Turkish Sentence Embedding Benchmark Results
Transform image viewpoint with adjustable camera angles
Examples of OCR performance for LigthOnOCR-2-1B models
Extract text and layout from images or PDF documents
Measuring how wordy LLMs are when a short answer would do
Configurable Generalist Agent, leader in AppWorld Benchmark
Compare SOTA VLMs OCR models
Pergel: A Unified Benchmark for Evaluating Turkish LLMs
Extract text and bounding boxes from images
Explore TIPSv2 features, segmentations, depth and normals