File size: 430 Bytes
53a56b2
 
 
 
 
 
 
 
 
adc336c
1
2
3
4
5
6
7
8
9
10
11
---
title: README
emoji: 🐠
colorFrom: pink
colorTo: blue
sdk: static
pinned: false
---

Welcome to the Inference Acceleration Team under Zhejiang Innovation Research Institute. We are dedicated to achieving efficient large model inference on NVIDIA and domestic GPU platforms, with a focus on cutting-edge inference acceleration technologies such as speculative decoding and model quantization. Feel free to explore our space!