Created By Ayjays132 / Young Philly P. / Phillip A. Holland
A public custom-code multimodal model package built to run text, image, video, speech, audio, and routing from one loaded Transformers model object.
Ayjays132/phillnet-2 · AutoTokenizer · AutoModelForCausalLM · trust_remote_code=True
Phillnet-2 is an experimental AXIOM multimodal GPT-OSS runtime packaged as a Hugging Face transformers custom-code model. It is designed as a new-model competitor package: one public repository, one primary load path, and a single runtime object that exposes text generation, code guidance, image generation, short video generation, speech synthesis, audio listening, and route inspection.
The model card is written for builders who want to inspect and run the system locally. The examples below use the same public load path a Hugging Face user would use after upload:
repo_id = "Ayjays132/phillnet-2" tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True)
Phillnet-2 is not presented here as a closed leaderboard claim. It is presented as an integrated multimodal runtime with measured local validation, real generated image examples, bounded lm-eval slices, code smoke testing, public usage code, and a merged 1000-step text continued-training pass.
Ayjays132/phillnet-2
model.safetensors
Validated Loader
Loads through AutoTokenizer and AutoModelForCausalLM with trust_remote_code=True.
Multimodal Runtime
Text, code guidance, image, video, speech, audio listening, and route history are exposed from one model object.
Measured, Not Inflated
Benchmarks are reported as bounded local validation and are not presented as SOTA leaderboard claims.
This repository includes the local modality runtimes used by the model:
- 🖼️ ImageGen/: packaged image route with local UNet/VAE assets.
- 🎬 VideoGen/: fused/composer video route.
- 🔊 Audio/: speech, audio encoding, and Whisper listening route.
- 🧭 AgenticScaffold/, CodeGuidanceSystem/, Tools/: local planning, retrieval, SmolAgent, and guidance runtime pieces.
- 🧩 adapters/: training provenance for the merged 1000-step text LoRA adapter; normal loading already uses the merged root weights.
- 📂 examples/: public example outputs generated through the loaded Phillnet-2 model object.




