1D-subchaneled DeepSeek checkpoints for usage on Google TPUs
Building on HF
Jacob Platin
jrplatin
AI & ML interests
None yet
Recent Activity
updated
a model 29 days ago
jrplatin/DeepSeek-R1-1D-Subchannel-256-Packed updated
a model 29 days ago
jrplatin/DeepSeek-R1-1D-Subchannel-512 updated
a model 29 days ago
jrplatin/DeepSeek-R1-1D-Subchannel-256 Organizations
None yet