Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Vjeong
/
LLM-1B-Lab

Safetensors
English
llm-1b-lab
llama
decoder-only
educational
pretrained
Model card Files Files and versions
xet
Community
LLM-1B-Lab / llm_lab /model
28.9 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 7 commits
Vjeong's picture
Vjeong
Fix dtype mismatch in RoPE cos/sin for mixed precision training
331cfcd 4 days ago
  • __init__.py
    573 Bytes
    docs: translate all Korean comments and docstrings to English about 1 month ago
  • attention.py
    7.75 kB
    Replace F.scaled_dot_product_attention with explicit implementation 5 days ago
  • feedforward.py
    1.95 kB
    Replace F.silu with explicit SiLU implementation in SwiGLUFeedForward 5 days ago
  • llm_model.py
    7.93 kB
    refactor(model): replace single-letter vars with descriptive names for readability about 1 month ago
  • norm.py
    1.42 kB
    docs: translate all Korean comments and docstrings to English about 1 month ago
  • rope.py
    4.28 kB
    Fix dtype mismatch in RoPE cos/sin for mixed precision training 4 days ago
  • transformer_block.py
    2.16 kB
    refactor(model): replace single-letter vars with descriptive names for readability about 1 month ago
  • utils.py
    2.82 kB
    docs: translate all Korean comments and docstrings to English about 1 month ago