---
language:
- en
license: other
tags:
- mnn
- on-device
- android
- ios
- quantization
- int4
- text-generation
- qwen3
pipeline_tag: text-generation
library_name: mnn
base_model: WhoIsShe/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-MNN
---

# ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small (MNN Quantized)

# Original model :

* **https://huggingface.co/ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small**


This is a **4-bit quantized** version of the Qwen3-4B-RPG-Roleplay-V2, optimized for **on-device inference** (Android/iOS) using the [Alibaba MNN framework](https://github.com/alibaba/MNN).

## 🚀 Fast Deployment on Android

### 1. Download the App
Don't build from scratch! Use the official MNN Chat Android app:
* **[Download APK (GitHub)](https://github.com/alibaba/MNN/releases)**

### 2. Setup
1. Download the files from this repo (`llm.mnn`, `llm.mnn.weight`, `config.json`).
2. Create a folder on your phone: `/sdcard/MNN/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small`.
3. Copy the files into that folder.
4. Open the MNN App and select your folder.

## 💻 Technical Details
* **Framework:** MNN
* **Quantization:** 4-bit Asymmetric (Int4)
* **Model Type:** QWEN3-4B (Uncensored)