Where can I find more architectural details (QKV size, vocabulary size etc) for this model?
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
· Sign up or log in to comment