Waseem AlShikh
wassemgtk
AI & ML interests
Multi-modal, Palmyra LLMs, Knowledge Graph
Recent Activity
liked a model 2 days ago
wassemgtk/glm-5.2-visual-runtime updated a model 3 days ago
wassemgtk/glm-5.2-visual-runtime posted an update 3 days ago
Built GLM-5.2-visual-runtime: a training-free multimodal runtime gateway that makes GLM-5.2 work like a vision-capable model.
It keeps images as persistent visual variables, runs local visual/OCR/chart/palette tools only when needed, and sends compact structured evidence to the reasoning model instead of retraining or modifying weights.
The one-click stack includes GLM-5.2 via vLLM, Qwen3-Omni for vision/omni input, local OCR, Postgres, MinIO, and an OpenAI-compatible API.
Model repo: https://huggingface.co/wassemgtk/glm-5.2-visual-runtime