Shell-Code-Large is a large-scale corpus of Shell scripting source code comprising approximately 640,000 code samples stored in JSON Lines (.jsonl) format. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, DevOps automation, cloud infrastructure engineering, system administration, and software engineering automation.
By providing a high-volume, language-specific corpus focused exclusively on Shell scripting, Shell-Code-Large enables systematic experimentation in automation workflows, deployment pipelines, infrastructure management, and command-line tooling. These domains remain foundational to Linux systems, cloud-native platforms, CI/CD environments, and modern DevOps practices.
Shell-Code-Large addresses the need for a dedicated Shell-focused dataset at substantial scale, enabling targeted research into scripting patterns, command composition, workflow orchestration, infrastructure automation, and operational engineering practices
I just released Inflect-Nano-v1, an ultra-small 4.63 parameter text-to-speech model.
The main idea is simple: instead of only making the acoustic model tiny and relying on a larger external vocoder, Inflect-Nano-v1 keeps the complete text-to-waveform stack under 5M parameters.
Quick facts: - 4.63M total inference parameters - 3.46M acoustic model - 1.17M vocoder - 24 kHz audio - English-only - Single male voice - Runs locally with a simple PyTorch inference script
Why I made it: Most modern TTS models are much larger, and even many “small TTS” projects depend on a separate vocoder. I wanted to see how far a complete tiny TTS stack could be pushed while still producing usable speech.
It is not SOTA, and I am not trying to claim it competes with large TTS systems. The interesting part is the size-to-functionality ratio.
What works: It can generate arbitrary English speech locally, and the model is small enough to be interesting for:
- local voice assistants - embedded/edge experiments - browser or WASM-style TTS exploration - efficient inference research - tiny-model baselines
Limitations: The quality is still limited. It can sound robotic, stumble on difficult unseen text, and the vocoder is still a clear bottleneck. Long or unusual prompts are less reliable.
So I would frame this as a research/demo release, not a production TTS engine.
I’d love feedback from people interested in: - tiny speech models - vocoders - local TTS - efficient inference - embedded speech synthesis - improving small-model generalization
If people find it useful, I’m interested in putting more training budget into a stronger v2.
Wan2.2-I2V-Fast with highly upscaled sequential frame sampling is now available as a Spaces demo, built using Wan2.2-I2V and FLUX.2-Klein. Try the demo using the links below.👇
Drop a screenshot of ANY website or app, and THE INSPECTOR — a film-noir detective — works it as a crime scene: he circles each UX flaw on the real pixels, names the charge, and files a verdict with a letter grade. A UX audit that plays like a detective thriller.
But the verdict is just the opening statement. Now it goes further:
⚖️ THE TRIAL — put the interface on trial. The guilty UI elements take the stand and defend themselves while the Inspector rules from the evidence. 🖼️ THE RECONSTRUCTION — one click and FLUX.2 Klein rebuilds the worst element FIXED, live. Before/after, on the real pixels. 🔊 THE VOICE — hear the verdict read aloud (Kokoro, local, no keys). 🚨 MOST WANTED — a public rogues' gallery. Book your case onto a shared board where the city's worst interfaces are ranked by their crimes. Booked by the public.
Three small models, all on Modal (scale-to-zero), none over 32B: 👁️ Qwen2.5-VL-7B (vision agent) · 🖼️ FLUX.2 Klein (reconstruction) · 🔊 Kokoro-82M (voice)