Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Ytgetahun
/
visual-narrator-llm

Image-to-Text
English
video-to-text
vision-language-model
real-time
accessibility
video-narration
cinematic-description
Model card Files Files and versions
xet
Community
visual-narrator-llm / api
22.3 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 2 commits
Ytgetahun's picture
Ytgetahun
Implement VN-003 Rekognition object detection
ac727b6 20 days ago
  • compatible_server.py
    10.1 kB
    Add API server, engine modules, Lambda handler, and .gitignore 21 days ago
  • fastapi_server.py
    6.45 kB
    Implement VN-003 Rekognition object detection 20 days ago
  • unified_visual_narrator_engine.py
    5.73 kB
    Add API server, engine modules, Lambda handler, and .gitignore 21 days ago