DanielG9
/

Story-LLM

Text Generation

Model card Files Files and versions

Story-LLM / README.md

DanielG9's picture

Update README.md

06f55d4 verified about 1 month ago

|

history blame contribute delete

738 Bytes

	---
	license: mit
	datasets:
	- roneneldan/TinyStories
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- LLM
	- pytorch
	- Story
	- Text
	---

	# Story-LLM (TinyStories Transformer)

	![image](https://cdn-uploads.huggingface.co/production/uploads/669ddc1a66813fa1b82936ca/EpYkKhQsRW4D0_v9LOtJ1.png)

	Story-LLM is a small GPT-style Transformer language model trained from scratch on the TinyStories dataset.

	This repository contains pretrained weights and a tokenizer.
	The model uses a custom PyTorch architecture and ByteLevel BPE tokenizer.

	👉 For full documentation, training code, and usage instructions, see the GitHub repository:
	https://github.com/Daniel-Gia/Story-LLM

	---

	### Notes

	- ~29M parameters