fmasterpro27 commited on
Commit
439a3cb
·
verified ·
1 Parent(s): 7b72a40

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +68 -1
README.md CHANGED
@@ -1,4 +1,71 @@
1
  ---
2
  base_model:
3
  - ibm-granite/granite-4.0-h-tiny
4
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  base_model:
3
  - ibm-granite/granite-4.0-h-tiny
4
+ license: apache-2.0
5
+ library_name: transformers
6
+ tags:
7
+ - language
8
+ - granite-4.0
9
+ ---
10
+
11
+ # Open4bits / Granite 4.0 H Tiny FP16
12
+
13
+ This repository provides the **Granite 4.0 H Tiny model converted to FP16 (float16) precision**, published by Open4bits for more efficient inference while preserving the original model behavior.
14
+
15
+ The underlying Granite model architecture and weights are **owned by IBM**. This repository contains only a precision-converted version of the original model.
16
+
17
+ The model is suitable for general-purpose text generation tasks and lightweight language model experimentation.
18
+
19
+ ---
20
+
21
+ ## Model Overview
22
+
23
+ Granite is a family of transformer-based language models developed by IBM, designed for efficient and scalable natural language processing tasks.
24
+
25
+ This release uses the **Granite 4.0 H Tiny** variant, focusing on lower memory usage and faster inference while maintaining architectural fidelity to the original model.
26
+
27
+ ---
28
+
29
+ ## Model Details
30
+
31
+ - **Architecture:** Granite 4.0 H Tiny
32
+ - **Precision:** float16 (FP16)
33
+ - **Task:** Text Generation
34
+ - **Weight tying:** Preserved
35
+ - **Compatibility:** Hugging Face Transformers, PyTorch
36
+
37
+ The FP16 conversion reduces VRAM requirements and improves inference speed compared to FP32 versions, making the model suitable for deployment on resource-constrained hardware.
38
+
39
+ ---
40
+
41
+ ## Intended Use
42
+
43
+ This model is intended for:
44
+ - Text generation and completion
45
+ - Research and experimentation
46
+ - Lightweight inference workloads
47
+ - Prototyping and benchmarking language models
48
+
49
+ ---
50
+
51
+
52
+ ## Limitations
53
+
54
+ * Inherits limitations of the Granite Tiny architecture
55
+ * Output quality may be lower than larger Granite variants
56
+ * Not fine-tuned for domain-specific or instruction-following tasks
57
+
58
+ ---
59
+
60
+ ## License
61
+
62
+ This model is released under the **Apache License 2.0**.
63
+ The original Granite model and associated intellectual property are owned by IBM.
64
+
65
+ ---
66
+
67
+ ## Support
68
+
69
+ If you find this model useful, please consider supporting the project.
70
+ Your support helps us continue releasing and maintaining high-quality open models.
71
+ Support us with a heart.