https://github.com/jzhang38/LongMamba
Zhang Peiyuan
PY007
AI & ML interests
None yet
Organizations
EasyContext
https://github.com/jzhang38/EasyContext
-
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M
Viewer • Updated • 5.04k • 98 • 2 -
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 3.94k • 68 • 1 -
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 3 • 4 -
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K
Viewer • Updated • 37.9k • 328
LongMamba
https://github.com/jzhang38/LongMamba
EasyContext
https://github.com/jzhang38/EasyContext
-
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M
Viewer • Updated • 5.04k • 98 • 2 -
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 3.94k • 68 • 1 -
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 3 • 4 -
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K
Viewer • Updated • 37.9k • 328
models 5
PY007/slimpajama_LLAMA3_tokenized_chunk_512K_debug
Updated
PY007/vicuna-7b-v1.5
Text Generation • 7B • Updated • 7
PY007/EasyContext-256K-danube2-1.8b
Text Generation • 2B • Updated • 12 • 5
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 3 • 4
PY007/LongMamba_16384_bs128_step400
Updated • 28 • 5
datasets 27
PY007/Attn-QAT
Viewer • Updated • 3 • 94
PY007/bf16_videos
Viewer • Updated • 3 • 122
PY007/nvfp4_videos
Viewer • Updated • 3 • 54
PY007/sage3_videos
Viewer • Updated • 3 • 178
PY007/crush-smol
Viewer • Updated • 4 • 203
PY007/slimpajama_Qwen2_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 6.79k • 329
PY007/slimpajama_Yi1.5_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 7.48k • 55
PY007/slimpajama_llama2_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 7.79k • 104
PY007/slimpajama_LLAMA3_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 6.64k • 54
PY007/wild_chat_llama3_template_tokenized_merged_1M
Viewer • Updated • 1.27k • 101