BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

craffel authored a paper 20 days ago

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

christopher new activity about 1 month ago

bigscience/mt0-large:why mt0-large is 1.3B while mt5-large is 780M?

christopher new activity about 1 month ago

bigscience/bloom-560m:Geração de Texto

View all activity

authored 2 papers 23 days ago

Multilingual Refusal Alignment for Safer Large Language Models

Paper • 2606.07535 • Published Apr 24

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

Paper • 2606.12250 • Published 25 days ago

in bigscience/mt0-large about 1 month ago

why mt0-large is 1.3B while mt5-large is 780M?

#6 opened almost 2 years ago by

in bigscience/bloom-560m about 1 month ago

Geração de Texto

#63 opened 8 months ago by

alcidesmoreira1963

Adding Evaluation Results

#61 opened over 2 years ago by

leaderboard-pr-bot

in bigscience/T0 about 1 month ago

Hosted inference API: 500 Internal Server Error returned

#4 opened almost 4 years ago by

in bigscience/bloom-1b1 about 1 month ago

Adding Evaluation Results

#41 opened over 2 years ago by

leaderboard-pr-bot

Adding Evaluation Results

#42 opened about 2 years ago by

leaderboard-pr-bot

Add evaluation results on the mathemakitten--winobias_antistereotype_test config and test split of mathemakitten/winobias_antistereotype_test

#32 opened over 3 years ago by

System Requirements

#38 opened over 3 years ago by

Request: DOI

#43 opened over 1 year ago by

in bigscience/bloom 4 months ago

pretokenizer Regex issues?

#278 opened almost 2 years ago by

Test PR

#286 opened 4 months ago by

Test discussion

#287 opened 4 months ago by

Test discussion

#288 opened 4 months ago by

authored a paper 5 months ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Paper • 2601.17277 • Published Jan 24 • 6

authored a paper 5 months ago

INTIMA: A Benchmark for Human-AI Companionship Behavior

Paper • 2508.09998 • Published Aug 4, 2025 • 11

in bigscience/bloomz-560m 7 months ago

Fails to load with transformers v4.57+

#14 opened 7 months ago by

in bigscience/petals-api 8 months ago

Bloom

#2 opened 8 months ago by

authored a paper 8 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 107