Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
BarryAdams
ChessWarrior
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
liked
a model
26 days ago
prithivMLmods/Nanbeige4.1-3B-f32-GGUF
liked
a model
27 days ago
Nanbeige/Nanbeige4.1-3B
reacted
to
marksverdhei
's
post
with ๐
about 1 month ago
Poll: Will 2026 be the year of subquadratic attention? The transformer architecture is cursed by its computational complexity. It is why you run out of tokens and have to compact. But some would argue that this is a feature not a bug and that this is also why these models are so good. We've been doing a lot of research on trying to make equally good models that are computationally cheaper, But so far, none of the approaches have stood the test of time. Or so it seems. Please vote, don't be shy. Remember that the Dunning-Kruger effect is very real, so the person who knows less about transformers than you is going to vote. We want everyone's opinion, no matter confidence. ๐ if you think at least one frontier model* will have no O(n^2) attention by the end of 2026 ๐ฅ If you disagree * Frontier models - models that match / outperform the flagship claude, gemini or chatgpt at the time on multiple popular benchmarks
View all activity
Organizations
None yet
ChessWarrior
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
26 days ago
prithivMLmods/Nanbeige4.1-3B-f32-GGUF
Text Generation
โข
4B
โข
Updated
28 days ago
โข
3.65k
โข
4
liked
a model
27 days ago
Nanbeige/Nanbeige4.1-3B
Text Generation
โข
4B
โข
Updated
16 days ago
โข
642k
โข
โข
1k
liked
a model
2 months ago
gudo7208/CAD-Coder
Text Generation
โข
8B
โข
Updated
Jan 9
โข
617
โข
1
liked
a model
3 months ago
may-ur08/solidworks-hackathon-model
Updated
Dec 20, 2025
โข
1