Qwen3-4B-ANE / README.md
nexaml's picture
Update README.md
759eed8 verified

Qwen3-4B

Quickstart

  1. Install NexaSDK

  2. Run the model with one line of code:

    nexa infer NexaAI/qwen3-4b-ane
    

Model Description

Qwen3-4B is a 4-billion-parameter general-purpose language model from Alibaba Cloud’s Qwen team.
It balances strong reasoning, multilingual understanding, and efficient deployment, making it suitable for on-device and server environments alike.

Trained on a large, diverse corpus, Qwen3-4B supports dialogue, analysis, and content generation with solid performance at a compact scale.

Features

  • Conversational AI: context-aware dialogue and assistant-style responses.
  • Content generation: articles, summaries, marketing content, lightweight code generation.
  • Reasoning & analysis: step-by-step problem solving and structured explanations.
  • Multilingual: robust support across major languages.
  • Fine-tunable: efficient to adapt for domain-specific workloads.

Use Cases

  • Chatbots and support assistants
  • Multilingual content workflows
  • Document summarization and extraction
  • Lightweight reasoning and analysis tasks
  • Custom fine-tuned models for vertical domains

Inputs and Outputs

Input

  • Text prompts or chat histories (tokenized sequences when used via APIs)

Output

  • Generated text (answers, explanations, creative content)
  • Optional logits/probabilities for advanced use cases

License

This repo is licensed under the Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0) license, which allows use, sharing, and modification only for non-commercial purposes with proper attribution. All NPU-related models, runtimes, and code in this project are protected under this non-commercial license and cannot be used in any commercial or revenue-generating applications. Commercial licensing or enterprise usage requires a separate agreement. For inquiries, please contact dev@nexa.ai.