SpiceeChat
/

Base-mini

Text Generation

Model card Files Files and versions

Base-mini / README.md

QuantaSparkLabs's picture

QuantaSparkLabs

Update README.md

5870286 verified about 10 hours ago

|

History Blame Contribute Delete

1.16 kB

	---
	license: apache-2.0
	language:
	- en
	tags:
	- tiny-gpt
	- from-scratch
	- spiceechat
	pipeline_tag: text-generation
	---

	<p align="center">
	<img src="https://readme-typing-svg.demolab.com?font=Space+Grotesk&weight=600&size=22&pause=800&color=00E7FF&center=true&vCenter=true&width=600&lines=SpiceeChat%2FBase-mini;7+Million+Parameters;Trained+on+Vibes+and+ML+Jargon" />
	</p>

	Basemini is a tiny GPT-style transformer trained from scratch on synthetic ML sentences.

	For an Experiment

	Load it like any HF model
	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	model = AutoModelForCausalLM.from_pretrained("SpiceeChat/Basemini", trust_remote_code=True)
	tokenizer = AutoTokenizer.from_pretrained("SpiceeChat/Basemini", trust_remote_code=True)
	```

	It runs on CPU. It runs on T4. It runs on a potato.

	You ask it something. It gives you a sentence with the word "transformer" in it. That's the deal.

	Made with sweat, tears, and `torch_fallback=True`. Pull requests welcome. Donations not needed. Moral support accepted.

	<img src="https://capsule-render.vercel.app/api?type=waving&height=80&section=footer&color=0:00E7FF,100:8B5CF6"/>