--- language: - en license: other tags: - mnn - on-device - android - ios - quantization - int4 - text-generation - qwen3 pipeline_tag: text-generation library_name: mnn base_model: WhoIsShe/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-MNN --- # ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small (MNN Quantized) # Original model : * **https://huggingface.co/ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small** This is a **4-bit quantized** version of the Qwen3-4B-RPG-Roleplay-V2, optimized for **on-device inference** (Android/iOS) using the [Alibaba MNN framework](https://github.com/alibaba/MNN). ## 🚀 Fast Deployment on Android ### 1. Download the App Don't build from scratch! Use the official MNN Chat Android app: * **[Download APK (GitHub)](https://github.com/alibaba/MNN/releases)** ### 2. Setup 1. Download the files from this repo (`llm.mnn`, `llm.mnn.weight`, `config.json`). 2. Create a folder on your phone: `/sdcard/MNN/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small`. 3. Copy the files into that folder. 4. Open the MNN App and select your folder. ## 💻 Technical Details * **Framework:** MNN * **Quantization:** 4-bit Asymmetric (Int4) * **Model Type:** QWEN3-4B (Uncensored)