| language: | |
| - en | |
| license: other | |
| tags: | |
| - mnn | |
| - on-device | |
| - android | |
| - ios | |
| - quantization | |
| - int4 | |
| - text-generation | |
| - qwen3 | |
| pipeline_tag: text-generation | |
| library_name: mnn | |
| base_model: WhoIsShe/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-MNN | |
| # ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small (MNN Quantized) | |
| # Original model : | |
| * **https://huggingface.co/ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small** | |
| This is a **4-bit quantized** version of the Qwen3-4B-RPG-Roleplay-V2, optimized for **on-device inference** (Android/iOS) using the [Alibaba MNN framework](https://github.com/alibaba/MNN). | |
| ## 🚀 Fast Deployment on Android | |
| ### 1. Download the App | |
| Don't build from scratch! Use the official MNN Chat Android app: | |
| * **[Download APK (GitHub)](https://github.com/alibaba/MNN/releases)** | |
| ### 2. Setup | |
| 1. Download the files from this repo (`llm.mnn`, `llm.mnn.weight`, `config.json`). | |
| 2. Create a folder on your phone: `/sdcard/MNN/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small`. | |
| 3. Copy the files into that folder. | |
| 4. Open the MNN App and select your folder. | |
| ## 💻 Technical Details | |
| * **Framework:** MNN | |
| * **Quantization:** 4-bit Asymmetric (Int4) | |
| * **Model Type:** QWEN3-4B (Uncensored) | |