cgg507 commited on
Commit
edf8855
·
verified ·
1 Parent(s): 53cd6ee

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - DavidAU/Qwen2.5-Godzilla-Coder-51B
4
+ ---
5
+
6
+ Quanted using lmdeploy - auto_awq. Going to run benchmarks to see performence.
7
+
8
+ Ran it up on 4x A4000 GPUs with 32k context. Got 30 t/s per request.