@prithivMLmods on Hugging Face: "GLM OCR, a multimodal OCR model for complex document understanding, built on…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update Feb 5

Post

908

GLM OCR, a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture. It delivers high accuracy and strong generalization with a blazing-fast inference pipeline. The demo is live . Try it now. 🤗🚀

✨ Demo: prithivMLmods/GLM-OCR-Demo
✨ Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
✨ GitHub: https://github.com/PRITHIVSAKTHIUR/GLM-OCR-Demo

In this post

prithivMLmods Prithiv Sakthi