NeuronTechnologiesAI
/

Neuron-R1

Model card Files Files and versions

Neuron-R1 / README.md

NeuronTechnologies's picture

NeuronTechnologies

Upload folder using huggingface_hub

e245315 verified 26 days ago

|

history blame contribute delete

1.45 kB

	---
	base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-70B
	tags:
	- lora
	- qlora
	- neuron
	- deepseek-r1
	- reasoning
	language:
	- en
	license: other
	---

	# Neuron-R1

	Neuron LoRA adapter fine-tuned on top of `deepseek-ai/DeepSeek-R1-Distill-Llama-70B`.

	## What this is

	Neuron-R1 is a LoRA adapter that imprints Will Anderson's voice, values, memory patterns, and
	architectural knowledge onto DeepSeek-R1-Distill-Qwen-72B — a reasoning-native base model with
	chain-of-thought behavior distilled from DeepSeek-R1 (671B).

	The `<think>` tags in training examples activate R1's reasoning substrate. Neuron-R1 reasons
	before responding — not as a post-hoc explanation, but as the actual working cognition.

	## Usage

	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM
	from peft import PeftModel
	import torch

	base = AutoModelForCausalLM.from_pretrained(
	"deepseek-ai/DeepSeek-R1-Distill-Llama-70B",
	torch_dtype=torch.bfloat16,
	device_map="auto",
	)
	model = PeftModel.from_pretrained(base, "NeuronTechnologiesAI/Neuron-R1")
	tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Llama-70B")
	```

	## Training

	- Base: `deepseek-ai/DeepSeek-R1-Distill-Llama-70B`
	- Method: QLoRA (4-bit NF4, rank=32, alpha=64)
	- Examples: 44 high-quality identity/architecture/values pairs
	- Epochs: 3
	- Training data: Neuron's memories, internal state logs, architectural knowledge (VBD, CCR, Engram, Soma)