AI & ML interests

Hardware-aware AI Model Optimization

Recent Activity

Organization Card
Nota AI Banner

Nota AI bridges the gap between high-performance AI models and edge devices.
From our automated optimization platform to bespoke AI solutions, we ensure your AI functions efficientlyβ€”everywhere it is needed.

Website LinkedIn NetsPresso


🌟 Spotlight

Sovereign AI Foundation Model Project

Nota AI participates in the Sovereign AI Foundation Model Project, a key initiative by the South Korean government (NIPA) to develop global-tier foundation models. As a core optimization partner, we focus on compressing massive LLMs for practical deployment.

Optimized Solar-Open-100B Series

πŸ† Official Quantized Model: Solar-Open-100B-NotaMoEQuant-Int4

This is the official quantized version of Upstage flagship model, Solar-Open-100B. It is specifically optimized to run on a single A100 (80GB) GPU.

  • Why NotaMoEQuant: Our approach directly resolves representational distortion in MoE architectures, delivering superior benchmark accuracy compared to conventional methods.
  • Hardware Efficiency: While the original model required four A100 (80GB) GPUs, our optimized version reduces this requirement to just a single A100 (80GB) for standard inference. Only two A100 GPUs are needed for maximum context generation.

πŸ”₯ New Release: Solar-Open-100B-NotaMoEQuant-NVFP4

This latest release utilizes NVFP4 quantization and compressed-tensors packing to ensure backend compatibility with Hugging Face and vLLM. It is specifically designed for NVIDIA Blackwell architecture, requiring a minimum of one B100 GPU.

Also available: Solar-Open-100B-Nota-FP8

πŸš€ Our Core Business

πŸ› οΈ AI Platform: NetsPresso

"We make AI lighter, faster, and ready for deployment."

NetsPresso is our proprietary platform that accelerates model optimization, enabling you to secure on-device latency and accuracy without deep hardware expertise.

  • Develop & Compress: Create lightweight models effortlessly using our Model Zoo and advanced Compressor (Structured Pruning).
  • Optimize & Convert: Maximize speed on verified hardware (NVIDIA, Arm, Qualcomm, etc.) with Graph Optimization and Graph Quantization.
  • Test on Real Devices: Validate performance instantly on actual devices via our Device Farm to eliminate deployment failures.

πŸ‘‰ Ready to optimize? Try NetsPresso Now | View Documentation

🌍 AI Solutions

"We provide end-to-end AI solutions powered by our core optimization technology."

1. Nota Vision Agent

Powered by Vision Language Models (VLM), this agent goes beyond simple detection to understand complex situational contexts.
It interprets video feeds through natural language prompts, delivering real-time insights locally without cloud dependency.

2. Edge AI Solutions

πŸ“š Tech Blog

Gain insights into our engineering philosophy. We share deep dives into model compression methodologies and NPU acceleration techniques to help you stay ahead.

πŸ‘‰ Read our Tech Blog

πŸ”— Connect with Us


Β© 2026 Nota Inc. All rights reserved.

datasets 0

None public yet