We need to talk about the 'magic' behind Claude’s CUDA kernels. Is it superior synthetic data, or did Anthropic find a better way to teach LLMs hardware-level logic? Open to all technical theories
Baleeshwar Palavadi
aim143
·
AI & ML interests
None yet
Recent Activity
commented on an article about 1 month ago
We Got Claude to Build CUDA Kernels and teach open models! updated
a dataset almost 2 years ago
aim143/guanaco-llama2-500 liked
a model almost 2 years ago
aim143/tinystarcoder-rlhf-model