voxyne / README.md

voxyne v0.1: bf16 base + fp32/int8/int4 ONNX (int4 default) + model card

de4bdad verified 10 days ago

2.85 kB

license: cc-by-nc-4.0
language:
  - en
pipeline_tag: text-generation
tags:
  - byte-level
  - conversational
  - rahuketu
  - voxyne
  - int4
  - edge
  - cpu

Voxyne

A tiny (128.8M-param) byte-level conversational language model with the RahuKetu sigma_K control channel. It runs offline on CPU, and quantized to int4 it stays coherent in ~80 MB — the intended default.

Scope — please read. Voxyne is built to converse, not to know. It has an identity and conversational ability; it is not a knowledge base and will make up facts if asked — knowledge is meant to come from external tools/retrieval (a separate system). Judge it on coherence, not factual recall.

Files — int4 is the default

file	size	use
`voxyne-int4.onnx`	81 MB	DEFAULT — CPU/edge deployment (ONNX Runtime)
`voxyne-int8.onnx`	129 MB	int8 ONNX
`voxyne-fp32.onnx`	511 MB	full-precision ONNX
`voxyne-v0.1.pt`	258 MB	bf16 PyTorch weights (for the `voxyne` package / fine-tuning)

The ONNX graphs are a single-token decode step with an explicit KV cache (inputs h, pk, pv -> outputs logits, npk, npv); the byte embedding and the sigma_K encoder run outside the graph. See the voxyne package for the decode protocol.

Quick start (PyTorch)

pip install voxyne   # then download voxyne-v0.1.pt from this repo

from voxyne import VoxyneConfig, build, load_weights, generate

model, enc = build(VoxyneConfig())
load_weights(model, "voxyne-v0.1.pt")
print(generate(model, enc, "who are you?", device="cpu"))
# -> "I'm Voxyne, an AI assistant created by Ramakrishnan."

Training-data provenance (why the license)

Voxyne's weights are trained on a mix that includes non-commercial sources, so the weights are released under CC BY-NC 4.0 (non-commercial). Free for research, education, and personal use.

stage	sources (examples)	license note
pretrain	FineWeb-Edu, Cosmopedia, TinyStories	permissive / synthetic
grammar	WordNet, FrameNet, GoEmotions	permissive (attribution)
commonsense	ConceptNet, ATOMIC	ConceptNet = CC BY-SA
dialogue	SODA, UltraChat, OASST2, daily_dialog, empathetic_dialogues, WildChat	daily_dialog / empathetic = CC BY-NC; WildChat = AI2 ImpACT
identity	author-written	original

The code (voxyne package) is Apache-2.0; only the weights are NC. A future clean-data retrain (kalki) will carry Apache-licensed weights.

AI-assistance disclosure

Built by Ramakrishnan (ORCID 0009-0006-0905-7275). AI tools assisted with the training/quantization automation and tooling; the model design, direction, and all decisions are the author's.

License

Weights: CC BY-NC 4.0 (non-commercial). Code: Apache-2.0.