Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Vjeong
/
LLM-1B-Lab
like
0
Safetensors
HuggingFaceFW/fineweb-edu
English
llm-1b-lab
llama
decoder-only
educational
pretrained
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
LLM-1B-Lab
/
llm_lab
/
model
28.9 kB
Ctrl+K
Ctrl+K
2 contributors
History:
7 commits
Vjeong
Fix dtype mismatch in RoPE cos/sin for mixed precision training
331cfcd
4 days ago
__init__.py
Safe
573 Bytes
docs: translate all Korean comments and docstrings to English
about 1 month ago
attention.py
Safe
7.75 kB
Replace F.scaled_dot_product_attention with explicit implementation
5 days ago
feedforward.py
Safe
1.95 kB
Replace F.silu with explicit SiLU implementation in SwiGLUFeedForward
5 days ago
llm_model.py
Safe
7.93 kB
refactor(model): replace single-letter vars with descriptive names for readability
about 1 month ago
norm.py
Safe
1.42 kB
docs: translate all Korean comments and docstrings to English
about 1 month ago
rope.py
Safe
4.28 kB
Fix dtype mismatch in RoPE cos/sin for mixed precision training
4 days ago
transformer_block.py
Safe
2.16 kB
refactor(model): replace single-letter vars with descriptive names for readability
about 1 month ago
utils.py
Safe
2.82 kB
docs: translate all Korean comments and docstrings to English
about 1 month ago