Update README.md
Browse files
README.md
CHANGED
|
@@ -12,12 +12,73 @@ datasets:
|
|
| 12 |
- Nitral-AI/Creative_Writing-ShareGPT
|
| 13 |
---
|
| 14 |
|
| 15 |
-
#
|
| 16 |
|
| 17 |
-
-
|
| 18 |
-
- **License:** apache-2.0
|
| 19 |
-
- **Finetuned from model :** unsloth/gemma-3-1b-it-unsloth-bnb-4bit
|
| 20 |
|
| 21 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
|
| 23 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
- Nitral-AI/Creative_Writing-ShareGPT
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# Halcyon-1B
|
| 16 |
|
| 17 |
+
**Halcyon-1B** is a creatively fine-tuned variant of the **unsloth/gemma-3-1b-it-unsloth-bnb-4bit** model, specifically tailored for imaginative and expressive creative writing tasks. This model has been fine-tuned to excel in storytelling, literary exploration, and nuanced narrative construction.
|
|
|
|
|
|
|
| 18 |
|
| 19 |
+
---
|
| 20 |
+
|
| 21 |
+
## Model Details
|
| 22 |
+
|
| 23 |
+
- **Developed by:** [colesmcintosh](https://huggingface.co/colesmcintosh)
|
| 24 |
+
- **Base Model:** [unsloth/gemma-3-1b-it-unsloth-bnb-4bit](https://huggingface.co/unsloth/gemma-3-1b-it-unsloth-bnb-4bit)
|
| 25 |
+
- **Fine-tuning Methodology:** Trained 2x faster leveraging [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
| 26 |
+
|
| 27 |
+
---
|
| 28 |
+
|
| 29 |
+
## Dataset
|
| 30 |
+
|
| 31 |
+
This model was fine-tuned using the [(Nitral-AI) Creative Writing ShareGPT](https://huggingface.co/datasets/Nitral-AI/Creative_Writing-ShareGPT) dataset.
|
| 32 |
+
|
| 33 |
+
---
|
| 34 |
+
|
| 35 |
+
## Capabilities
|
| 36 |
+
|
| 37 |
+
- **Creative Writing:** Exceptional at generating narratives, stories, poetry, and prose.
|
| 38 |
+
- **Expressive Nuance:** Generates sophisticated, context-aware, and evocative literary outputs.
|
| 39 |
+
- **Versatility:** Suitable for writers, creators, educators, and storytellers looking to harness AI for enhanced creative exploration.
|
| 40 |
+
|
| 41 |
+
---
|
| 42 |
+
|
| 43 |
+
## Intended Use
|
| 44 |
+
|
| 45 |
+
- **Creative Inspiration:** Idea generation, overcoming writer’s block, and expanding narrative horizons.
|
| 46 |
+
- **Educational Tools:** Supporting literature courses, workshops, and creative writing sessions.
|
| 47 |
+
- **Interactive Storytelling:** Enabling interactive fiction, dynamic content creation, and innovative narrative formats.
|
| 48 |
+
|
| 49 |
+
---
|
| 50 |
+
|
| 51 |
+
## Usage
|
| 52 |
+
|
| 53 |
+
You can quickly test Halcyon-1B using Huggingface Transformers:
|
| 54 |
+
|
| 55 |
+
```python
|
| 56 |
+
from unsloth import FastModel
|
| 57 |
+
from transformers import TextStreamer
|
| 58 |
+
|
| 59 |
+
# Load model and tokenizer
|
| 60 |
+
model, tokenizer = FastModel.from_pretrained(
|
| 61 |
+
model_name = "colesmcintosh/Halcyon-1B",
|
| 62 |
+
max_seq_length = 2048,
|
| 63 |
+
load_in_4bit = True,
|
| 64 |
+
)
|
| 65 |
+
|
| 66 |
+
# Format prompt using Gemma-3 chat template
|
| 67 |
+
messages = [{
|
| 68 |
+
"role": "user",
|
| 69 |
+
"content": [{"type" : "text", "text" : "Write a mythological tale about how the oceans came to be."}]
|
| 70 |
+
}]
|
| 71 |
+
|
| 72 |
+
text_ids = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
|
| 73 |
+
text_str = tokenizer.decode(text_ids)
|
| 74 |
|
| 75 |
+
# Generate response
|
| 76 |
+
outputs = model.generate(
|
| 77 |
+
**tokenizer([text_str], return_tensors="pt").to("cuda"),
|
| 78 |
+
max_new_tokens=64,
|
| 79 |
+
temperature=1.0,
|
| 80 |
+
top_p=0.95,
|
| 81 |
+
top_k=64,
|
| 82 |
+
streamer=TextStreamer(tokenizer, skip_prompt=True),
|
| 83 |
+
)
|
| 84 |
+
```
|