Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

JavisVerse

community
https://javisverse.github.io/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ChocoWu  authored a paper 6 days ago
Audio-Visual Intelligence in Large Foundation Models
scofield7419  authored a paper 9 days ago
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models
scofield7419  authored a paper 9 days ago
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
View all activity

Papers

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

View all Papers

KAI LIU's profile pictureHao Fei's profile pictureShengqiong Wu's profile pictureLi Bobo's profile pictureQIN YOU's profile picture

JavisVerse 's datasets 7

JavisVerse/AV-FineTune

Viewer • Updated Jan 3 • 1.43M • 46

JavisVerse/JavisUnd-Eval

Updated Dec 31, 2025 • 19

JavisVerse/MM-PreTrain

Viewer • Updated Dec 31, 2025 • 340k • 94

JavisVerse/JavisInst-Omni

Viewer • Updated Dec 30, 2025 • 91.4k • 68 • 1

JavisVerse/JavisBench

Viewer • Updated Sep 29, 2025 • 22.3k • 160

JavisVerse/JavisData-Audio

Viewer • Updated Jun 12, 2025 • 788k • 98

JavisVerse/TAVGBench_clean

Viewer • Updated Apr 12, 2025 • 1.58M • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs