AI & ML interests
Hardware-aware AI Model Optimization
Recent Activity
Nota AI bridges the gap between high-performance AI models and edge devices.
From our automated optimization platform to bespoke AI solutions, we ensure your AI functions efficiently—everywhere it is needed.
🌟 Spotlight
World Best LLM (WBL) Project
Nota AI participates in the 'World Best LLM' (WBL) project, a key initiative by the South Korean government (NIPA) to develop global-tier foundation models. As a core optimization partner, we focus on compressing massive LLMs for practical deployment.
🔥 New Release: Solar-Open-100B-NotaMoEQuant-Int4
Quantized Model for Upstage's Solar-Open-100B
This model is optimized using our proprietary NotaMoEQuant, a specialized methodology for Mixture-of-Experts (MoE) architectures.
- Why NotaMoEQuant: Unlike conventional methods (e.g., AutoRound) that overlook expert routing changes during quantization, our approach directly resolves the resulting representational distortion, delivering superior benchmark accuracy.
- Hardware Efficiency: Reduces the GPU requirement for maximum context generation from 4x A100 (80GB) to 2x A100 (80GB), saving up to 50% on inference costs.
Also available: Solar-Open-100B-Nota-FP8
🚀 Our Core Business
🛠️ AI Platform: NetsPresso"We make AI lighter, faster, and ready for deployment." NetsPresso is our proprietary platform that accelerates model optimization, enabling you to secure on-device latency and accuracy without deep hardware expertise.
👉 Ready to optimize? Try NetsPresso Now | View Documentation |
🌍 AI Solutions"We provide end-to-end AI solutions powered by our core optimization technology." 1. Nota Vision AgentPowered by Vision Language Models (VLM), this agent goes beyond simple detection to understand complex situational contexts. 2. Edge AI Solutions
|
📚 Tech BlogGain insights into our engineering philosophy. We share deep dives into model compression methodologies and NPU acceleration techniques to help you stay ahead. |
🔗 Connect with Us
|