TeeZee/DarkSapling-7B-v2.0 (MNN Quantized)

Original model :

https://huggingface.co/TeeZee/DarkSapling-7B-v2.0

This is a 4-bit quantized version of the TeeZee/DarkSapling-7B-v2.0, optimized for on-device inference (Android/iOS) using the Alibaba MNN framework.

🚀 Fast Deployment on Android

1. Download the App

Don't build from scratch! Use the official MNN Chat Android app:

Download APK (GitHub)

2. Setup

Download the files from this repo (llm.mnn, llm.mnn.weight, config.json).
Create a folder on your phone: /sdcard/MNN/DarkSapling-7B-v2.0.
Copy the files into that folder.
Open the MNN App and select your folder.

💻 Technical Details

Framework: MNN
Quantization: 4-bit Asymmetric (Int4)
Model Type: LLAMA2-7B (Uncensored)

Downloads last month: 9

Model tree for WhoIsShe/DarkSapling-7B-v2.0-MNN

Unable to build the model tree, the base model loops to the model itself. Learn more.