--- title: README emoji: ๐Ÿ˜ป colorFrom: pink colorTo: pink sdk: static pinned: false ---
Nota AI Banner

**Nota AI bridges the gap between high-performance AI models and edge devices.**
From our automated **optimization platform** to bespoke **AI solutions**, we ensure your AI functions efficientlyโ€”everywhere it is needed. [![Website](https://img.shields.io/badge/Website-Nota%20AI-black?style=for-the-badge&logo=googlechrome&logoColor=white)](https://www.nota.ai/) [![LinkedIn](https://img.shields.io/badge/LinkedIn-Nota%20AI-0077B5?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/company/nota-inc/) [![NetsPresso](https://img.shields.io/badge/Platform-NetsPresso-002060?style=for-the-badge&logo=robot&logoColor=00C7E8)](https://netspresso.ai/)

# ๐ŸŒŸ Spotlight > ## **World Best LLM (WBL) Project** > Nota AI participates in the **'World Best LLM' (WBL)** project, a key initiative by the South Korean government (NIPA) to develop global-tier foundation models. As a core optimization partner, we focus on compressing massive LLMs for practical deployment. > > ## **๐Ÿ”ฅ New Release: [Solar-Open-100B-NotaMoEQuant-Int4](https://huggingface.co/nota-ai/Solar-Open-100B-NotaMoEQuant-Int4)** > **Quantized Model for Upstage's Solar-Open-100B** > > This model is optimized using our proprietary **NotaMoEQuant**, a specialized methodology for Mixture-of-Experts (MoE) architectures. > * **Why NotaMoEQuant:** Unlike conventional methods (e.g., AutoRound) that overlook expert routing changes during quantization, our approach directly resolves the resulting representational distortion, delivering superior benchmark accuracy. > * **Hardware Efficiency:** Reduces the GPU requirement for maximum context generation from **4x A100 (80GB) to 2x A100 (80GB)**, saving up to 50% on inference costs. > > *Also available: [Solar-Open-100B-Nota-FP8](https://huggingface.co/nota-ai/Solar-Open-100B-Nota-FP8)* # ๐Ÿš€ Our Core Business

๐Ÿ› ๏ธ AI Platform: NetsPresso

"We make AI lighter, faster, and ready for deployment."

NetsPresso is our proprietary platform that accelerates model optimization, enabling you to secure on-device latency and accuracy without deep hardware expertise.

  • Develop & Compress: Create lightweight models effortlessly using our Model Zoo and advanced Compressor (Structured Pruning).
  • Optimize & Convert: Maximize speed on verified hardware (NVIDIA, Arm, Qualcomm, etc.) with Graph Optimization and Graph Quantization.
  • Test on Real Devices: Validate performance instantly on actual devices via our Device Farm to eliminate deployment failures.

๐Ÿ‘‰ Ready to optimize? Try NetsPresso Now | View Documentation

๐ŸŒ AI Solutions

"We provide end-to-end AI solutions powered by our core optimization technology."

1. Nota Vision Agent

Powered by Vision Language Models (VLM), this agent goes beyond simple detection to understand complex situational contexts.
It interprets video feeds through natural language prompts, delivering real-time insights locally without cloud dependency.

2. Edge AI Solutions

๐Ÿ“š Tech Blog

Gain insights into our engineering philosophy. We share deep dives into model compression methodologies and NPU acceleration techniques to help you stay ahead.

๐Ÿ‘‰ Read our Tech Blog

๐Ÿ”— Connect with Us

---
ยฉ 2026 Nota Inc. All rights reserved.