Vjeong
/

LLM-1B-Lab

Model card Files Files and versions

LLM-1B-Lab / llm_lab /model

28.9 kB

Ctrl+K

Ctrl+K

2 contributors

History: 7 commits

Vjeong's picture

Fix dtype mismatch in RoPE cos/sin for mixed precision training

331cfcd 4 days ago

__init__.py

573 Bytes
docs: translate all Korean comments and docstrings to English about 1 month ago
attention.py

7.75 kB
Replace F.scaled_dot_product_attention with explicit implementation 5 days ago
feedforward.py

1.95 kB
Replace F.silu with explicit SiLU implementation in SwiGLUFeedForward 5 days ago
llm_model.py

7.93 kB
refactor(model): replace single-letter vars with descriptive names for readability about 1 month ago
norm.py

1.42 kB
docs: translate all Korean comments and docstrings to English about 1 month ago
rope.py

4.28 kB
Fix dtype mismatch in RoPE cos/sin for mixed precision training 4 days ago
transformer_block.py

2.16 kB
refactor(model): replace single-letter vars with descriptive names for readability about 1 month ago
utils.py

2.82 kB
docs: translate all Korean comments and docstrings to English about 1 month ago