v2ray
/

GPT4chan-8B

@@ -10,6 +10,8 @@ pipeline_tag: text-generation
 library_name: transformers
 ---
 # GPT4chan 8B
 This model is [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) merged with [v2ray/GPT4chan-8B-QLoRA](https://huggingface.co/v2ray/GPT4chan-8B-QLoRA).
 Trained using 8x H100 with global batch size 64, using 2e-4 learning rate, for 4000 steps, which is approximately 5 epochs.

 library_name: transformers
 ---
 # GPT4chan 8B
+![GPT4chan Banner](https://huggingface.co/v2ray/GPT4chan-24B-QLoRA/resolve/main/images/banner.avif)
 This model is [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) merged with [v2ray/GPT4chan-8B-QLoRA](https://huggingface.co/v2ray/GPT4chan-8B-QLoRA).
 Trained using 8x H100 with global batch size 64, using 2e-4 learning rate, for 4000 steps, which is approximately 5 epochs.