File size: 1,169 Bytes
52855b8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
language:
- en
license: other
tags:
- mnn
- on-device
- android
- ios
- quantization
- int4
- text-generation
- qwen3
pipeline_tag: text-generation
library_name: mnn
base_model: WhoIsShe/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-MNN
---

# ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small (MNN Quantized)

# Original model :

* **https://huggingface.co/ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small**


This is a **4-bit quantized** version of the Qwen3-4B-RPG-Roleplay-V2, optimized for **on-device inference** (Android/iOS) using the [Alibaba MNN framework](https://github.com/alibaba/MNN).

## 🚀 Fast Deployment on Android

### 1. Download the App
Don't build from scratch! Use the official MNN Chat Android app:
* **[Download APK (GitHub)](https://github.com/alibaba/MNN/releases)**

### 2. Setup
1. Download the files from this repo (`llm.mnn`, `llm.mnn.weight`, `config.json`).
2. Create a folder on your phone: `/sdcard/MNN/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small`.
3. Copy the files into that folder.
4. Open the MNN App and select your folder.

## 💻 Technical Details
* **Framework:** MNN
* **Quantization:** 4-bit Asymmetric (Int4)
* **Model Type:** QWEN3-4B (Uncensored)