Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OceanirAI
/
Oculus

Image-Text-to-Text
Safetensors
English
oceanir
oculus
vision
multimodal
vision-language
vqa
image-captioning
object-detection
research
training
Model card Files Files and versions
xet
Community
Oculus
4.78 GB
  • 1 contributor
History: 33 commits
kobiakor15's picture
kobiakor15
Upload LICENSE with huggingface_hub
5cac238 verified 5 days ago
  • checkpoints
    Upload folder using huggingface_hub 5 days ago
  • docs
    Upload docs/TRAINING_ROADMAP.md with huggingface_hub 5 days ago
  • logs
    Upload logs/training_v2_final.log with huggingface_hub 5 days ago
  • oculus_model
    Upload oculus_model/config.json with huggingface_hub 5 days ago
  • oculus_unified_model
    Upload oculus_unified_model/README.md with huggingface_hub 5 days ago
  • training
    Upload training/train_oculus_coco.py with huggingface_hub 5 days ago
  • .gitattributes
    1.52 kB
    initial commit 5 days ago
  • LICENSE
    2.4 kB
    Upload LICENSE with huggingface_hub 5 days ago
  • README.md
    6.11 kB
    Upload README.md with huggingface_hub 5 days ago
  • benchmark_vlm.py
    18.5 kB
    Upload benchmark_vlm.py with huggingface_hub 5 days ago
  • config.json
    1.01 kB
    Upload config.json with huggingface_hub 5 days ago
  • demo_caption_vqa.py
    13.9 kB
    Upload demo_caption_vqa.py with huggingface_hub 5 days ago
  • demo_oculus.py
    7.41 kB
    Upload demo_oculus.py with huggingface_hub 5 days ago
  • demo_oculus_unified.py
    8.65 kB
    Upload demo_oculus_unified.py with huggingface_hub 5 days ago
  • eval_benchmarks.py
    20.6 kB
    Upload eval_benchmarks.py with huggingface_hub 5 days ago
  • oculus_inference.py
    3.7 kB
    Upload oculus_inference.py with huggingface_hub 5 days ago