Update README.md
Browse files
README.md
CHANGED
|
@@ -4,4 +4,52 @@ datasets:
|
|
| 4 |
- BEE-spoke-data/fineweb-cryptid-5k
|
| 5 |
---
|
| 6 |
|
| 7 |
-
New model tuning stratagy. Adding text to make this long enough.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
- BEE-spoke-data/fineweb-cryptid-5k
|
| 5 |
---
|
| 6 |
|
| 7 |
+
New model tuning stratagy. Adding text to make this long enough.
|
| 8 |
+
|
| 9 |
+
Model Description
|
| 10 |
+
|
| 11 |
+
The Cryptid Detection Model is designed to generate and classify text related to cryptids, which are creatures from folklore and urban legends. The model is trained on a diverse dataset of cryptid-related content, including descriptions, stories, sightings, and various folklore sources.
|
| 12 |
+
Intended Use
|
| 13 |
+
|
| 14 |
+
Primary Use Case: Generating and classifying text about cryptids for entertainment, research, and educational purposes.
|
| 15 |
+
Secondary Use Cases: Assisting in the creation of cryptid-related content for books, articles, and media.
|
| 16 |
+
|
| 17 |
+
Input and Output
|
| 18 |
+
|
| 19 |
+
Input: Text prompts or descriptions.
|
| 20 |
+
Output: Generated text about cryptids or classifications of the input text as related to specific cryptids.
|
| 21 |
+
|
| 22 |
+
Training Data
|
| 23 |
+
|
| 24 |
+
The model was trained on a curated dataset of cryptid-related text, including but not limited to:
|
| 25 |
+
|
| 26 |
+
Books and articles about cryptids.
|
| 27 |
+
Online forums and discussion boards.
|
| 28 |
+
Folklore databases.
|
| 29 |
+
User-submitted stories and sightings.
|
| 30 |
+
|
| 31 |
+
Data Preprocessing
|
| 32 |
+
|
| 33 |
+
Text cleaning: Removal of special characters, HTML tags, and excessive whitespace.
|
| 34 |
+
Tokenization: Breaking down text into tokens for training.
|
| 35 |
+
|
| 36 |
+
Model Performance
|
| 37 |
+
|
| 38 |
+
Metrics: [Accuracy, F1 Score, Precision, Recall, etc.]
|
| 39 |
+
Evaluation: The model was evaluated on a validation set consisting of [describe the validation set].
|
| 40 |
+
|
| 41 |
+
Limitations and Biases
|
| 42 |
+
|
| 43 |
+
Biases: The model may reflect biases present in the training data, such as regional biases in folklore or common myths.
|
| 44 |
+
Limitations: The model may not accurately generate or classify less common or very specific cryptids.
|
| 45 |
+
|
| 46 |
+
Ethical Considerations
|
| 47 |
+
|
| 48 |
+
The model is intended for entertainment and educational purposes. It should not be used as a factual source for scientific research or investigation.
|
| 49 |
+
Users should be aware of the potential for generating content that might be misinterpreted as factual.
|
| 50 |
+
|
| 51 |
+
Future Work
|
| 52 |
+
|
| 53 |
+
Expanding the training dataset to include more diverse sources.
|
| 54 |
+
Improving classification accuracy for less common cryptids.
|
| 55 |
+
Adding functionality for multilingual support.
|