blackcloud1199 commited on
Commit
1a4933f
·
verified ·
1 Parent(s): 7276da4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: executorch
4
+ tags:
5
+ - android
6
+ - ios
7
+ - on-device
8
+ - pytorch
9
+ - react-native
10
+ - smollm
11
+ - llama
12
+ base_model: HuggingFaceTB/SmolLM2-1.7B-Instruct
13
+ ---
14
+
15
+ # SmolLM2-1.7B-Executorch-Q8DA4W
16
+
17
+ This repository contains the `smollm2_1_7b_q8da4w.pte` model, exported for use with [ExecuTorch](https://pytorch.org/executorch).
18
+
19
+ ## Details
20
+ - **Base Model**: [HuggingFaceTB/SmolLM2-1.7B-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct)
21
+ - **Format**: `.pte` (ExecuTorch)
22
+ - **Quantization**: Q8DA4W (4-bit linear weights, 8-bit dynamic activations)
23
+ - **Architecture**: llama (compatible with Llama export pipeline)
24
+ - **File Size**: ~1.7 GB
25
+
26
+ ## Features
27
+ - 🚀 Optimized for mobile/edge devices
28
+ - 📱 Compatible with `react-native-executorch`
29
+ - 💡 SmolLM2 is efficient and fast for resource-constrained environments
30
+ - 🗣️ Instruct-tuned for conversational AI
31
+
32
+ ## Usage
33
+ This model is ready to be used in mobile applications (iOS/Android) via the ExecuTorch runtime or `react-native-executorch`.
34
+
35
+ 1. Download `smollm2_1_7b_q8da4w.pte` and the tokenizer files (`tokenizer.json`, `vocab.json`, `merges.txt`).
36
+ 2. Place them in your app's asset folder.
37
+ 3. Load with ExecuTorch runtime.
38
+
39
+ ## Notes
40
+ - SmolLM2 uses **byte-level BPE tokenizer** (similar to GPT-2), not SentencePiece like Llama.
41
+ - Tokenizer files are: `tokenizer.json`, `vocab.json`, `merges.txt`