LFM2-1.2B-npu / README.md
nexaml's picture
Create README.md
b9562f8 verified
---
base_model:
- LiquidAI/LFM2-1.2B
---
# LFM2-1.2B
Run **LFM2-1.2B** on Qualcomm NPU with [NexaSDK](https://sdk.nexa.ai).
## Quickstart
1. **Install NexaSDK** and create a free account at [sdk.nexa.ai](https://sdk.nexa.ai)
2. **Activate your device** with your access token:
```bash
nexa config set license '<access_token>'
```
3. Run the model locally in one line:
```bash
nexa infer NexaAI/LFM2-1.2B-npu
```
## Model Description
**LFM2-1.2B** is part of Liquid AI’s second-generation **LFM2** family, designed specifically for **on-device and edge AI deployment**.
With **1.2 billion parameters**, it strikes a balance between compact size, strong reasoning, and efficient compute utilization—ideal for running on CPUs, GPUs, or NPUs.
LFM2 introduces a **hybrid Liquid architecture** with **multiplicative gates and short convolutions**, enabling faster convergence and improved contextual reasoning.
It demonstrates up to **3× faster training** and **2× faster inference** on CPU compared to Qwen3, while maintaining superior accuracy across multilingual and instruction-following benchmarks.
## Features
-**Speed & Efficiency** – 2× faster inference and prefill].
- 🧠 **Hybrid Liquid Architecture** – Combines multiplicative gating with convolutional layers for better reasoning and token reuse.
- 🌍 **Multilingual Competence** – Supports diverse languages for global use cases.
- 🛠 **Flexible Deployment** – Runs efficiently on CPU, GPU, and NPU hardware.
- 📈 **Benchmark Performance** – Outperforms similarly-sized models in math, knowledge, and reasoning tasks.
## Use Cases
- Edge AI assistants and voice agents
- Offline reasoning and summarization on mobile or automotive devices
- Local code and text generation tools
- Lightweight multimodal or RAG pipelines
- Domain-specific fine-tuning for vertical applications (e.g., finance, robotics)
## Inputs and Outputs
**Input**
- Text prompts or structured instructions (tokenized sequences for API use).
**Output**
- Natural-language or structured text generations.
- Optionally: logits or embeddings for advanced downstream integration.
## License
This model is released under the **Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0)** license.
Non-commercial use, modification, and redistribution are permitted with attribution.
For commercial licensing, please contact **dev@nexa.ai**.