HDTenEightyP
/

GPT-Usenet

Text Generation

text-generation-inference

Model card Files Files and versions

HDTenEightyP commited on Nov 21, 2025

Commit

c5e4dba

·

verified ·

1 Parent(s): 9666d33

Update README.md

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -1,3 +1,14 @@
----
-license: mit
----

+---
+license: mit
+language:
+- en
+tags:
+- text-generation-inference
+pipeline_tag: text-generation
+---
+An 81-million parameter LLM using GPT-2 encodings.
+Trained using 10GB of USENET posts along with over 1 GB of miscellaneous BBS posts, digitized books, and text documents.
+This model has 10 layers, 10 heads and 640 embeddings, with a context window of 1024 tokens.
+It was able to achieve a training loss of 2.3256 and validation loss of 2.3651.
+Supervised fine-tuning should be performed before use.