noamrot
/

FuseCap_Image_Captioning

image-text-to-text

image-captioning

Model card Files Files and versions

noamrot commited on May 31, 2023

Commit

2355c93

·

1 Parent(s): e2f2b2b

Update README.md

Files changed (1) hide show

README.md +28 -3

README.md CHANGED Viewed

@@ -1,3 +1,28 @@
----
-license: mit
----

+# FuseCap: Leveraging Large Language Models to Fuse Visual Data into Enriched Image Captions
+A framework designed to generate semantically rich image captions.
+## Resources
+- 💻 **Project Page**: For more details, visit the official [project page](https://rotsteinnoam.github.io/FuseCap/).
+- 📝 **Read the Paper**: You can find the paper [here](https://arxiv.org/abs/2305.17718).
+- 🚀 **Demo**: Try out our BLIP-based model [demo](https://huggingface.co/spaces/noamrot/FuseCap) trained using FuseCap, hosted on Huggingface Spaces.
+## Upcoming Updates
+The official codebase and trained models for this project will be released soon.
+## BibTeX
+``` Citation
+@misc{rotstein2023fusecap,
+      title={FuseCap: Leveraging Large Language Models to Fuse Visual Data into Enriched Image Captions},
+      author={Noam Rotstein and David Bensaid and Shaked Brody and Roy Ganz and Ron Kimmel},
+      year={2023},
+      eprint={2305.17718},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV}
+}
+```