Text Generation
Safetensors
qwen2
conversational
Zeta-3 / README.md
DiamondGotCat's picture
Update README.md
ad06a45 verified
metadata
license: mit
language:
  - zho
  - eng
  - fra
  - spa
  - por
  - deu
  - ita
  - rus
  - jpn
  - kor
  - vie
  - tha
  - ara
pipeline_tag: text-generation
datasets:
  - DiamondGotCat/Zeta-2-Dataset
new_version: Zeta-LLM/Zeta-4

image/png

Pushing the limits of the Zeta-2-Dataset. New: Zeta 3

Zeta 3 is a new LLM that challenges itself to outperform Zeta-2, even though it uses the same Zeta-2-Dataset.

Ollama: DiamondGotCat/Zeta-3

Quantized Model (GGUF)

Prompt Template

{{ if .System }}{{ .System }}{{ end }}
{{ if .Prompt }}<USER>{{ .Prompt }}</USER>{{ end }}
<ASSISTANT>

Stop Token

</ASSISTANT>

Computer Spec

Machine: RunPod VM(GPU, NVIDIA A100 PCIe)

This time, I used the RunPod service to study more efficiently.

Thanks to RunPod, I was able to use CUDA and had the optimization options available.

Dataset

Details of the dataset used can be found here

Links

GitHub: Zeta


Zeta is just a small SLM. But don't forget that it has big dreams inside.