-
nota-ai/Solar-Open-100B-NotaMoEQuant-NVFP4
Text Generation β’ 59B β’ Updated β’ 116 β’ 3 -
nota-ai/Solar-Open-100B-Nota-FP8
Text Generation β’ Updated β’ 35 β’ 31 -
nota-ai/Solar-Open-100B-NotaMoEQuant-Int4
Text Generation β’ Updated β’ 2.12k β’ 44 -
nota-ai/Qwen3-30B-A3B-NotaMoEQuant-Int4
Text Generation β’ 0.6B β’ Updated β’ 5 β’ 8
AI & ML interests
Hardware-aware AI Model Optimization
Recent Activity
Nota AI bridges the gap between high-performance AI models and edge devices.
From our automated optimization platform to bespoke AI solutions, we ensure your AI functions efficientlyβeverywhere it is needed.
π Spotlight
Sovereign AI Foundation Model Project
Nota AI participates in the Sovereign AI Foundation Model Project, a key initiative by the South Korean government (NIPA) to develop global-tier foundation models. As a core optimization partner, we focus on compressing massive LLMs for practical deployment.
Optimized Solar-Open-100B Series
π Official Quantized Model: Solar-Open-100B-NotaMoEQuant-Int4
This is the official quantized version of Upstage flagship model, Solar-Open-100B. It is specifically optimized to run on a single A100 (80GB) GPU.
- Why NotaMoEQuant: Our approach directly resolves representational distortion in MoE architectures, delivering superior benchmark accuracy compared to conventional methods.
- Hardware Efficiency: While the original model required four A100 (80GB) GPUs, our optimized version reduces this requirement to just a single A100 (80GB) for standard inference. Only two A100 GPUs are needed for maximum context generation.
π₯ New Release: Solar-Open-100B-NotaMoEQuant-NVFP4
This latest release utilizes NVFP4 quantization and compressed-tensors packing to ensure backend compatibility with Hugging Face and vLLM. It is specifically designed for NVIDIA Blackwell architecture, requiring a minimum of one B100 GPU.
Also available: Solar-Open-100B-Nota-FP8
π Our Core Business
π οΈ AI Platform: NetsPresso"We make AI lighter, faster, and ready for deployment." NetsPresso is our proprietary platform that accelerates model optimization, enabling you to secure on-device latency and accuracy without deep hardware expertise.
π Ready to optimize? Try NetsPresso Now | View Documentation |
π AI Solutions"We provide end-to-end AI solutions powered by our core optimization technology." 1. Nota Vision AgentPowered by Vision Language Models (VLM), this agent goes beyond simple detection to understand complex situational contexts. 2. Edge AI Solutions
|
π Tech BlogGain insights into our engineering philosophy. We share deep dives into model compression methodologies and NPU acceleration techniques to help you stay ahead. π Read our Tech Blog |
π Connect with Us
|
-
nota-ai/Solar-Open-100B-NotaMoEQuant-NVFP4
Text Generation β’ 59B β’ Updated β’ 116 β’ 3 -
nota-ai/Solar-Open-100B-Nota-FP8
Text Generation β’ Updated β’ 35 β’ 31 -
nota-ai/Solar-Open-100B-NotaMoEQuant-Int4
Text Generation β’ Updated β’ 2.12k β’ 44 -
nota-ai/Qwen3-30B-A3B-NotaMoEQuant-Int4
Text Generation β’ 0.6B β’ Updated β’ 5 β’ 8