NexaAI
/

phi3.5-mini-npu-mobile

Text Generation

Model card Files Files and versions

nexaml commited on Jan 9

Commit

09d188c

·

verified ·

1 Parent(s): e94c8f2

Create README.md

Files changed (1) hide show

README.md +53 -0

README.md ADDED Viewed

	@@ -0,0 +1,53 @@

+---
+pipeline_tag: text-generation
+tags:
+- NPU
+---
+# Phi-3.5-Mini
+Run **Phi-3.5-Mini** optimized for **Qualcomm NPUs** with [nexaSDK](https://sdk.nexa.ai).
+## Model Description
+**Phi-3.5-Mini** is a \~3.8B-parameter instruction-tuned language model from Microsoft’s Phi family.
+It’s designed to deliver strong reasoning and instruction-following quality within a compact footprint, making it ideal for **on-device** and **latency-sensitive** applications. This Turbo build uses Nexa’s Qualcomm NPU path for faster inference and higher throughput while preserving model quality.
+## Features
+* **Lightweight yet capable**: strong performance with small memory and compute budgets.
+* **Conversational AI**: context-aware dialogue for assistants and agents.
+* **Content generation**: drafting, completion, summarization, code comments, and more.
+* **Reasoning & analysis**: math/logic step-by-step problem solving.
+* **Multilingual**: supports understanding and generation across multiple languages.
+* **Customizable**: fine-tune or apply adapters for domain-specific use.
+## Use Cases
+* Personal and enterprise chatbots
+* On-device AI applications and offline assistants
+* Document/report/email summarization
+* Education and tutoring tools
+* Vertical solutions (e.g., healthcare, finance, legal), with proper guardrails
+## Inputs and Outputs
+**Input**:
+* Text prompts or conversation history (tokenized input sequences).
+**Output**:
+* Generated text: responses, explanations, or creative content.
+* Optionally: raw logits/probabilities for advanced downstream tasks.
+## License
+* This model is released under the **Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0)** license.
+* Non-commercial use, modification, and redistribution are permitted with attribution.
+* For commercial licensing, please contact **dev@nexa.ai**.
+## References
+* [Microsoft – Phi Models](https://www.microsoft.com/en-us/research/project/phi-3)
+* [Hugging Face Model Card (Phi-3.5-Mini-Instruct)](https://huggingface.co/microsoft/Phi-3.5-mini-instruct)
+* [Phi-3 Technical Report (blog/overview)](https://azure.microsoft.com/en-us/blog/introducing-phi-3)