AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

christopher 
in bigscience/bloom 2 months ago

[SPAM] Deleted

3
#289 opened 2 months ago by
sarthak-saxena
stas 
posted an update 2 months ago
view post
Post
235
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.
christopher 
in bigscience/bloom 2 months ago

pretokenizer Regex issues?

8
#278 opened almost 2 years ago by
hpcpony
christopher 
in bigscience/bloom 3 months ago

Test PR

#286 opened 3 months ago by
FIRSTACCOUNT69

Test discussion

#287 opened 3 months ago by
FIRSTACCOUNT69

Test discussion

#288 opened 3 months ago by
FIRSTACCOUNT69