PyPE: Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
For more details, please refer to Github: PyPE.
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support