| --- |
| title: VRFAI |
| emoji: π€ |
| colorFrom: blue |
| colorTo: yellow |
| sdk: static |
| pinned: false |
| --- |
| |
| # VRFAI β Edge AI & Model Optimization |
|
|
| We optimize and deploy **LLMs, ASR, VLM and VLA (Vision-Language-Action) models** on real-world systems. |
|
|
| ## π§ What we do |
| - Optimization: quantization (INT8/INT4/FP8/NVFP4), pruning, distillation, ... |
| - Deployment: VLLM, TensorRT, ONNX Runtime, edge runtimes |
| - Systems: real-time pipelines (vision, audio, language, action) |
|
|
| ## π― Focus |
| - Edge devices (Jetson, SoCs) |
| - Robotics & VLA systems |
| - Latency, stability, deployability |
|
|
| ## β‘ Philosophy |
| Optimization = **model + runtime + system** |
|
|
| --- |
|
|
| **VRFAI β making AI models fast, efficient, and real** |