you should mention this model use deepseek architecture
#3
by CHNtentes - opened
add a special thanks or something
Thanks a lot for your suggestion, we have added THIRD_PARTY_NOTICES.md to clarify this. Please see https://huggingface.co/moonshotai/Kimi-K2-Instruct/blob/main/THIRD_PARTY_NOTICES.md
lsw825 changed discussion status to closed