Qwen3-4B
Quickstart
Install NexaSDK
Run the model with one line of code:
nexa infer NexaAI/qwen3-4b-ane
Model Description
Qwen3-4B is a 4-billion-parameter general-purpose language model from Alibaba Cloud’s Qwen team.
It balances strong reasoning, multilingual understanding, and efficient deployment, making it suitable for on-device and server environments alike.
Trained on a large, diverse corpus, Qwen3-4B supports dialogue, analysis, and content generation with solid performance at a compact scale.
Features
- Conversational AI: context-aware dialogue and assistant-style responses.
- Content generation: articles, summaries, marketing content, lightweight code generation.
- Reasoning & analysis: step-by-step problem solving and structured explanations.
- Multilingual: robust support across major languages.
- Fine-tunable: efficient to adapt for domain-specific workloads.
Use Cases
- Chatbots and support assistants
- Multilingual content workflows
- Document summarization and extraction
- Lightweight reasoning and analysis tasks
- Custom fine-tuned models for vertical domains
Inputs and Outputs
Input
- Text prompts or chat histories (tokenized sequences when used via APIs)
Output
- Generated text (answers, explanations, creative content)
- Optional logits/probabilities for advanced use cases
License
This repo is licensed under the Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0) license, which allows use, sharing, and modification only for non-commercial purposes with proper attribution. All NPU-related models, runtimes, and code in this project are protected under this non-commercial license and cannot be used in any commercial or revenue-generating applications. Commercial licensing or enterprise usage requires a separate agreement. For inquiries, please contact dev@nexa.ai.