thefinalboss
/

CogNet-40M

Text Generation

non-transformer

cognitive-routing

hierarchical-memory

character-level

custom-architecture

Model card Files Files and versions

CogNet-40M / README.md

thefinalboss's picture

Upload README.md with huggingface_hub

39a16d2 verified 22 days ago

|

History Blame Contribute Delete

1.92 kB

	---
	license: mit
	language:
	- en
	- fr
	- code
	tags:
	- non-transformer
	- cognitive-routing
	- hierarchical-memory
	- character-level
	- aicl
	- text-generation
	- custom-architecture
	pipeline_tag: text-generation
	library_name: pytorch
	---

	# CogNet-40M

	A 39.7M parameter non-transformer language model with O(n) cognitive routing and hierarchical memory.

	## Architecture

	\| Component \| Detail \|
	\|-----------\|--------\|
	\| Architecture \| Non-transformer (Cognitive Routing) \|
	\| Parameters \| 39,718,536 (~40M) \|
	\| Hidden Dim \| 512 \|
	\| Blocks \| 6 cognitive blocks \|
	\| Channels \| 6 routing channels x 128 dim \|
	\| FF Dim \| 1024 \|
	\| Max Seq Len \| 256 \|
	\| Tokenizer \| Character-level (136 vocab) \|

	## Hierarchical Memory

	- Working Memory (32 slots): Active processing
	- Episodic Memory (64 slots): Short-term recall
	- Semantic Memory (128 slots): Long-term knowledge

	## Training

	\| Metric \| Value \|
	\|--------\|-------\|
	\| Steps \| 50,000 \|
	\| Batch Size \| 64 \|
	\| LR \| 3e-4 (cosine) \|
	\| Precision \| FP16 AMP \|
	\| GPU \| RTX 5060 Ti 16GB \|
	\| Final Loss \| ~0.005 \|
	\| Final PPL \| ~1.01 \|

	## Quick Start

	```python
	from inference import CogNetInference
	ai = CogNetInference("cognet_best.pt", "tokenizer_v3.json")
	print(ai.generate("Once upon a time"))
	```

	## AICL Integration

	CogNet powers AICL (Architecture Compilation Language) as its native AI engine for code generation, diagnosis, and repair.

	## Files

	\| File \| Size \| Description \|
	\|------\|------\|-------------\|
	\| cognet_best.pt \| 152MB \| FP32 checkpoint \|
	\| cognet_fp16.pt \| 77MB \| FP16 checkpoint \|
	\| tokenizer_v3.json \| - \| Char tokenizer (136 vocab) \|
	\| config.json \| - \| Model config \|
	\| cognet_model.py \| - \| Architecture source \|
	\| inference.py \| - \| Inference script \|

	## Roadmap

	- [x] CogNet-40M (39.7M)
	- [x] HuggingFace integration
	- [x] AICL native engine
	- [ ] CogNet-1B (1B params)
	- [ ] ONNX export

	MIT License. Built with PyTorch on RTX 5060 Ti via QuickPod.