--- title: README emoji: 😻 colorFrom: pink colorTo: pink sdk: static pinned: false ---

**Nota AI bridges the gap between high-performance AI models and edge devices.**
From our automated **optimization platform** to bespoke **AI solutions**, we ensure your AI functions efficiently—everywhere it is needed. [![Website](https://img.shields.io/badge/Website-Nota%20AI-black?style=for-the-badge&logo=googlechrome&logoColor=white)](https://www.nota.ai/) [![LinkedIn](https://img.shields.io/badge/LinkedIn-Nota%20AI-0077B5?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/company/nota-inc/) [![NetsPresso](https://img.shields.io/badge/Platform-NetsPresso-002060?style=for-the-badge&logo=robot&logoColor=00C7E8)](https://netspresso.ai/)

# 🌟 Spotlight > ## **World Best LLM (WBL) Project** > Nota AI participates in the **'World Best LLM' (WBL)** project, a key initiative by the South Korean government (NIPA) to develop global-tier foundation models. As a core optimization partner, we focus on compressing massive LLMs for practical deployment. > > ## **🔥 New Release: [Solar-Open-100B-NotaMoEQuant-Int4](https://huggingface.co/nota-ai/Solar-Open-100B-NotaMoEQuant-Int4)** > **Quantized Model for Upstage's Solar-Open-100B** > > This model is optimized using our proprietary **NotaMoEQuant**, a specialized methodology for Mixture-of-Experts (MoE) architectures. > * **Why NotaMoEQuant:** Unlike conventional methods (e.g., AutoRound) that overlook expert routing changes during quantization, our approach directly resolves the resulting representational distortion, delivering superior benchmark accuracy. > * **Hardware Efficiency:** Reduces the GPU requirement for maximum context generation from **4x A100 (80GB) to 2x A100 (80GB)**, saving up to 50% on inference costs. > > *Also available: [Solar-Open-100B-Nota-FP8](https://huggingface.co/nota-ai/Solar-Open-100B-Nota-FP8)* # 🚀 Our Core Business

🛠️ AI Platform: NetsPresso

"We make AI lighter, faster, and ready for deployment."

NetsPresso is our proprietary platform that accelerates model optimization, enabling you to secure on-device latency and accuracy without deep hardware expertise.

Develop & Compress: Create lightweight models effortlessly using our Model Zoo and advanced Compressor (Structured Pruning).
Optimize & Convert: Maximize speed on verified hardware (NVIDIA, Arm, Qualcomm, etc.) with Graph Optimization and Graph Quantization.
Test on Real Devices: Validate performance instantly on actual devices via our Device Farm to eliminate deployment failures.

👉 Ready to optimize? Try NetsPresso Now | View Documentation

🌍 AI Solutions

"We provide end-to-end AI solutions powered by our core optimization technology."

1. Nota Vision Agent

Powered by Vision Language Models (VLM), this agent goes beyond simple detection to understand complex situational contexts.
It interprets video feeds through natural language prompts, delivering real-time insights locally without cloud dependency.

2. Edge AI Solutions

🚦 Intelligent Transportation Systems (ITS): We optimize real-time traffic control and analysis to enhance urban mobility.
🚗 Driver Monitoring System (DMS): We provide lightweight AI for driver safety, ensuring fast and accurate facial recognition inside vehicles.
🛡️ Security Surveillance: We deliver efficient video analytics designed to maintain public and industrial safety with minimal resource usage.

📚 Tech Blog

Gain insights into our engineering philosophy. We share deep dives into model compression methodologies and NPU acceleration techniques to help you stay ahead.

👉 Read our Tech Blog

🔗 Connect with Us

🏠 Official Website: nota.ai
🛠️ NetsPresso Platform: netspresso.ai
📧 Business Inquiries: contact@nota.ai

---