dsikka commited on
Commit
7657f94
·
verified ·
1 Parent(s): 6dd2b6d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - arcee-ai/Trinity-Large-Thinking
4
+ tags:
5
+ - afmoe
6
+ - nvfp4
7
+ - vllm
8
+ - compressed-tensors
9
+ name: RedHatAI/Trinity-Large-Thinking-NVFP4
10
+ ---
11
+
12
+ # NVFP4 Quantized RedHatAI/Trinity-Large-Thinking-NVFP4
13
+
14
+ This is a preliminary version (and subject to change) of NVFP4 quantized [arcee-ai/Trinity-Large-Thinking ](https://huggingface.co/arcee-ai/Trinity-Large-Thinking/tree/main ) model.
15
+ The model has both weights and activations quantized to NVFP4 format with [vllm-project/llm-compressor](https://github.com/vllm-project/llm-compressor).
16
+
17
+ It is compatible and tested against vllm main.