vlsi-moe-yarn

A domain-specialized language model for VLSI and chip design reasoning. Built on a custom Qwen-based architecture where 10% of FFN layers are replaced with reasoning-optimized feed-forward blocks, extended to 262,144 tokens via YaRN, and trained through knowledge distillation on a VLSI-domain corpus. Served on AMD Instinct MI300X using ROCm and vLLM.

Model Details

Model Description

vlsi-moe-yarn is a chip-design AI assistant that can reason over RTL code, architecture documents, timing constraints, and full datasheets in a single 262K-token context window. It was created by replacing 10% of the standard Qwen FFN layers with reasoning-specialized feed-forward blocks, then knowledge-distilling from a larger teacher model on hardware design data.

  • Developed by: Vicky (Vickyrrrrrr)
  • Model type: Causal Language Model โ€” Qwen-based with reasoning FFN substitution + YaRN
  • Language(s): English (technical/domain: VLSI, EDA, RTL, chip architecture)
  • License: MIT
  • Finetuned from model: Qwen (base architecture)

Model Usage In :

Uses

Direct Use

Use this model directly as a chip design assistant via the vLLM OpenAI-compatible API. Ask it architecture questions, paste RTL for review, query SDC constraints, or load an entire datasheet into context and interrogate it โ€” no fine-tuning needed.

Downstream Use

  • Plug into agentic frameworks (LangChain, LlamaIndex, AutoGen , CrewAI) as the reasoning backbone
  • Embed in EDA tool pipelines for automated RTL review or constraint generation
  • Power internal VLSI copilot tools at chip design companies or research labs

Out-of-Scope Use

  • General-purpose conversation or creative writing
  • Medical, legal, or financial advice
  • Any domain outside hardware design, RTL, and chip architecture

Bias, Risks, and Limitations

  • The model's knowledge is bounded by its VLSI distillation corpus โ€” it may lack awareness of very recent process nodes, proprietary EDA tool specifics, or niche IP.
  • It can hallucinate plausible-sounding but incorrect timing numbers, constraints, or RTL constructs. Always verify outputs against your
Downloads last month
102
Safetensors
Model size
33B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using vxkyyy/vlsi-moe-yarn 1