Doge family of small language models.
Loser Cheems
JingzeShi
AI & ML interests
I like training small languge models.
Recent Activity
updated a model 10 days ago
JingzeShi/flash-sparse-attention published a model 10 days ago
JingzeShi/flash-sparse-attention updated a model 18 days ago
DIAL-TFM/TFM-tokenizer