RhapsodyAI
/

MiniCPM-V-Embedding-preview

Feature Extraction

information retrieval

embedding model

visual information retrieval

Model card Files Files and versions

bokesyo commited on Jul 14, 2024

Commit

6696a1b

·

verified ·

1 Parent(s): e74da3d

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -15,6 +15,14 @@ license: apache-2.0
 The model only takes images as document-side inputs and produce vectors representing document pages. `minicpm-visual-embedding-v0` is trained with over 200k query-visual document pairs, including textual document, visual document, arxiv figures, plots, charts, industry documents, textbooks, ebooks, and openly-available PDFs, etc. The performance of `minicpm-visual-embedding-v0` is on a par with our ablation text embedding model on text-oriented documents, and an advantages on visually-intensive documents.
 ![Memex Archtechture](images/memex.png)
 # News

 The model only takes images as document-side inputs and produce vectors representing document pages. `minicpm-visual-embedding-v0` is trained with over 200k query-visual document pairs, including textual document, visual document, arxiv figures, plots, charts, industry documents, textbooks, ebooks, and openly-available PDFs, etc. The performance of `minicpm-visual-embedding-v0` is on a par with our ablation text embedding model on text-oriented documents, and an advantages on visually-intensive documents.
+Our model is capable of:
+- Help you read a long visually-intensive or text-oriented PDF document and find the pages that answer your question.
+- Help you build a personal library and retireve book pages from a large collection of books.
+- It works like human: read and comprehend with **vision** and remember **multimodal** information in hippocampus.
 ![Memex Archtechture](images/memex.png)
 # News