ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small (MNN Quantized)

Original model :

This is a 4-bit quantized version of the Qwen3-4B-RPG-Roleplay-V2, optimized for on-device inference (Android/iOS) using the Alibaba MNN framework.

🚀 Fast Deployment on Android

1. Download the App

Don't build from scratch! Use the official MNN Chat Android app:

2. Setup

  1. Download the files from this repo (llm.mnn, llm.mnn.weight, config.json).
  2. Create a folder on your phone: /sdcard/MNN/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small.
  3. Copy the files into that folder.
  4. Open the MNN App and select your folder.

💻 Technical Details

  • Framework: MNN
  • Quantization: 4-bit Asymmetric (Int4)
  • Model Type: QWEN3-4B (Uncensored)
Downloads last month
8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for WhoIsShe/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-MNN

Unable to build the model tree, the base model loops to the model itself. Learn more.