Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
OpenTransformer
/
AGILLM-4
like
0
PyTorch
transformer
language-model
long-context
agillm
experimental
Model card
Files
Files and versions
xet
Community
main
AGILLM-4
208 kB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
OpenTransformer
Harvest fused QKV projection from n1
18b3e9e
verified
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
1 day ago
AGILLM-4.md
9.05 kB
Harvest fused QKV projection from n1
1 day ago
N1_HARVEST.md
3.16 kB
Harvest fused QKV projection from n1
1 day ago
README.md
1.16 kB
Harvest fused QKV projection from n1
1 day ago
agillm4_config.json
1.32 kB
Add AGILLM-4 training scaffold
1 day ago
anchor_memory.py
3.11 kB
Add AGILLM-4 training scaffold
1 day ago
block_sweep_agillm4.py
5.73 kB
Add AGILLM-4 training scaffold
1 day ago
estimate_agillm4_params.py
2.61 kB
Add AGILLM-4 training scaffold
1 day ago
local_1050ti_micro3x_sdpa_checkpoint_sweep.json
1.82 kB
Add AGILLM-4 training scaffold
1 day ago
local_memory_torture_smoke.jsonl
8.01 kB
Add AGILLM-4 training scaffold
1 day ago
local_profile_micro3x_256_sdpa_vs_sublinear.json
2.19 kB
Add AGILLM-4 training scaffold
1 day ago
local_profile_pico64.json
2.18 kB
Add AGILLM-4 training scaffold
1 day ago
local_smoke_text.jsonl
398 Bytes
Add AGILLM-4 training scaffold
1 day ago
local_sublinear_sweep_pico.json
975 Bytes
Add AGILLM-4 training scaffold
1 day ago
local_sweep_after_mfold_sdpa.json
488 Bytes
Harvest exact M-fold attention from n1
1 day ago
local_sweep_after_mfold_sublinear.json
489 Bytes
Harvest exact M-fold attention from n1
1 day ago
local_sweep_after_qkv_sdpa.json
488 Bytes
Harvest fused QKV projection from n1
1 day ago
local_sweep_after_qkv_sublinear.json
487 Bytes
Harvest fused QKV projection from n1
1 day ago
local_verify_m_fold_after_qkv_agillm4.json
3.55 kB
Harvest fused QKV projection from n1
1 day ago
local_verify_m_fold_after_qkv_fix_agillm4.json
3.55 kB
Harvest fused QKV projection from n1
1 day ago
local_verify_m_fold_agillm4.json
3.55 kB
Harvest exact M-fold attention from n1
1 day ago
local_verify_qkv_agillm4.json
3.28 kB
Harvest fused QKV projection from n1
1 day ago
local_verify_qkv_all_backends_agillm4.json
4.92 kB
Harvest fused QKV projection from n1
1 day ago
local_verify_qkv_sublinear_agillm4.json
819 Bytes
Harvest fused QKV projection from n1
1 day ago
long_context_curriculum.py
4.88 kB
Add AGILLM-4 training scaffold
1 day ago
nB300_agillm4.py
109 kB
Harvest fused QKV projection from n1
1 day ago
profile_agillm4.py
9.18 kB
Add AGILLM-4 training scaffold
1 day ago
run_agillm4_4090_longblock.sh
1.28 kB
Add AGILLM-4 training scaffold
1 day ago
run_agillm4_4090_sublinear_probe.sh
691 Bytes
Add AGILLM-4 training scaffold
1 day ago
run_agillm4_main_b200_b300.sh
1.18 kB
Add AGILLM-4 training scaffold
1 day ago
verify_m_fold_agillm4.py
6.31 kB
Harvest fused QKV projection from n1
1 day ago
verify_qkv_agillm4.py
10.3 kB
Harvest fused QKV projection from n1
1 day ago