Text Generation
Transformers
Safetensors
llama
text-generation-inference
MultivexAI commited on
Commit
54f3a02
·
verified ·
1 Parent(s): f72a055

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ Plyx-15M is intended for quick testing, research into data efficiency, and speci
18
 
19
  Plyx-15M was trained exclusively on a carefully selected set of premium datasets, prioritizing accuracy and structure.
20
 
21
- 1. **`fineweb-pro`** This data is a highly refined subset of general internet content. It was aggressively filtered using advanced, automated tools to remove common errors and noise, giving the model a clean understanding of everyday language.
22
  2. **`fineweb-edu`** Content focused on education and instruction, providing the model with a solid base in clear, organized knowledge.
23
  3. **`finepdfs`** Specialized knowledge sourced from millions of professional reports and complex documents (PDFs). This ensures the model is exposed to formal, technical writing styles and organized information structures.
24
 
 
18
 
19
  Plyx-15M was trained exclusively on a carefully selected set of premium datasets, prioritizing accuracy and structure.
20
 
21
+ 1. **`fineweb-pro`** This data is a highly refined subset of FineWeb. It was aggressively filtered using advanced, automated tools to remove common errors and noise, giving the model a clean understanding of everyday language.
22
  2. **`fineweb-edu`** Content focused on education and instruction, providing the model with a solid base in clear, organized knowledge.
23
  3. **`finepdfs`** Specialized knowledge sourced from millions of professional reports and complex documents (PDFs). This ensures the model is exposed to formal, technical writing styles and organized information structures.
24