Qwen3-4B

Quickstart

Install NexaSDK
Run the model with one line of code:
```
nexa infer NexaAI/qwen3-4b-ane
```

Model Description

Qwen3-4B is a 4-billion-parameter general-purpose language model from Alibaba Cloud’s Qwen team.
It balances strong reasoning, multilingual understanding, and efficient deployment, making it suitable for on-device and server environments alike.

Trained on a large, diverse corpus, Qwen3-4B supports dialogue, analysis, and content generation with solid performance at a compact scale.

Features

Conversational AI: context-aware dialogue and assistant-style responses.
Content generation: articles, summaries, marketing content, lightweight code generation.
Reasoning & analysis: step-by-step problem solving and structured explanations.
Multilingual: robust support across major languages.
Fine-tunable: efficient to adapt for domain-specific workloads.

Use Cases

Chatbots and support assistants
Multilingual content workflows
Document summarization and extraction
Lightweight reasoning and analysis tasks
Custom fine-tuned models for vertical domains

Inputs and Outputs

Input

Text prompts or chat histories (tokenized sequences when used via APIs)

Output

Generated text (answers, explanations, creative content)
Optional logits/probabilities for advanced use cases

License

This repo is licensed under the Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0) license, which allows use, sharing, and modification only for non-commercial purposes with proper attribution. All NPU-related models, runtimes, and code in this project are protected under this non-commercial license and cannot be used in any commercial or revenue-generating applications. Commercial licensing or enterprise usage requires a separate agreement. For inquiries, please contact dev@nexa.ai.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including NexaAI/Qwen3-4B-ANE

Apple Neural Engine

Collection

Latest SOTA models supported on Apple Neural Engine • 7 items • Updated Dec 3, 2025 • 5