VedaLM — 85.87M Parameter Instruction-Following LLM

Parameters: 85.87M
Layers: 14 transformer blocks
Attention: Grouped Query Attention (10Q / 5KV heads)
FFN: SwiGLU activation
Positional Encoding: RoPE (Rotary Position Embedding)
Normalization: RMSNorm
Context Length: 1024 tokens
Vocabulary: 32,000 BPE tokens

A decoder-only transformer trained from scratch, inspired by LLaMA-2 architecture.

Architecture

# See the API Space for usage examples
# huggingface.co/spaces/aryan012234/vedalm-api