NexaAI
/

LFM2-1.2B-npu

Model card Files Files and versions

LFM2-1.2B-npu / README.md

nexaml's picture

Create README.md

b9562f8 verified 3 months ago

|

history blame contribute delete

2.47 kB

	---
	base_model:
	- LiquidAI/LFM2-1.2B
	---

	# LFM2-1.2B
	Run LFM2-1.2B on Qualcomm NPU with [NexaSDK](https://sdk.nexa.ai).

	## Quickstart

	1. Install NexaSDK and create a free account at [sdk.nexa.ai](https://sdk.nexa.ai)
	2. Activate your device with your access token:

	```bash
	nexa config set license '<access_token>'
	```
	3. Run the model locally in one line:

	```bash
	nexa infer NexaAI/LFM2-1.2B-npu
	```

	## Model Description
	LFM2-1.2B is part of Liquid AI’s second-generation LFM2 family, designed specifically for on-device and edge AI deployment.
	With 1.2 billion parameters, it strikes a balance between compact size, strong reasoning, and efficient compute utilization—ideal for running on CPUs, GPUs, or NPUs.

	LFM2 introduces a hybrid Liquid architecture with multiplicative gates and short convolutions, enabling faster convergence and improved contextual reasoning.
	It demonstrates up to 3× faster training and 2× faster inference on CPU compared to Qwen3, while maintaining superior accuracy across multilingual and instruction-following benchmarks.

	## Features
	- ⚡ Speed & Efficiency – 2× faster inference and prefill].
	- 🧠 Hybrid Liquid Architecture – Combines multiplicative gating with convolutional layers for better reasoning and token reuse.
	- 🌍 Multilingual Competence – Supports diverse languages for global use cases.
	- 🛠 Flexible Deployment – Runs efficiently on CPU, GPU, and NPU hardware.
	- 📈 Benchmark Performance – Outperforms similarly-sized models in math, knowledge, and reasoning tasks.

	## Use Cases
	- Edge AI assistants and voice agents
	- Offline reasoning and summarization on mobile or automotive devices
	- Local code and text generation tools
	- Lightweight multimodal or RAG pipelines
	- Domain-specific fine-tuning for vertical applications (e.g., finance, robotics)

	## Inputs and Outputs
	Input
	- Text prompts or structured instructions (tokenized sequences for API use).

	Output
	- Natural-language or structured text generations.
	- Optionally: logits or embeddings for advanced downstream integration.

	## License
	This model is released under the Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0) license.
	Non-commercial use, modification, and redistribution are permitted with attribution.
	For commercial licensing, please contact dev@nexa.ai.