bert4torch_config / deepseek-ai /deepseek-math-7b-instruct
1.02 kB
Tongjilibo's picture
rename pre_layernorm and add glm_ocr
88b804b