Niyati Bafna

niyatibafna

3 13

http://niyatibafna.github.io/

niyatibafna

AI & ML interests

Low resource NLP, interpretability, multilinguality

Recent Activity

updated a dataset 2 days ago

niyatibafna/rashid_icll_outputs

upvoted an article 26 days ago

Best Practices for Open Multilingual LLM Evaluation

upvoted an article 26 days ago

An Analysis of Multilingual Models on Hugging Face

View all activity

Organizations

updated a dataset 2 days ago

niyatibafna/rashid_icll_outputs

Viewer • Updated 2 days ago • 66.7k • 226

upvoted 2 articles 26 days ago

Article

Best Practices for Open Multilingual LLM Evaluation

catherinearnett

•

May 7, 2025

• 8

Article

An Analysis of Multilingual Models on Hugging Face

catherinearnett

•

Sep 18, 2025

• 6

upvoted 2 papers 3 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 49

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

Paper • 2402.03216 • Published Feb 5, 2024 • 10

published a dataset 3 months ago

niyatibafna/rashid_icll_outputs

Viewer • Updated 2 days ago • 66.7k • 226

authored a paper 3 months ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 24

upvoted a paper 3 months ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 24

upvoted a paper 4 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

authored a paper 7 months ago

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

Paper • 2510.16928 • Published Oct 19, 2025 • 4

upvoted a paper 7 months ago

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

Paper • 2510.16928 • Published Oct 19, 2025 • 4

upvoted a paper 8 months ago

SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains

Paper • 2507.07229 • Published Jul 9, 2025 • 11

upvoted an article 10 months ago

Article

mmBERT: ModernBERT goes Multilingual

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 148

upvoted a paper 11 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 78

upvoted 2 papers 12 months ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15, 2025 • 33

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Paper • 2506.22724 • Published Jun 28, 2025 • 10

New activity in niyatibafna/imperfect_english_prompts 12 months ago

add paper abstract to readme

#2 opened 12 months ago by

zouharvi

add paper abstract to readme

#1 opened 12 months ago by

zouharvi

published a dataset 12 months ago

niyatibafna/imperfect_english_prompts

Viewer • Updated Jul 11, 2025 • 2.25M • 121

updated a dataset 12 months ago

niyatibafna/imperfect_english_prompts

Viewer • Updated Jul 11, 2025 • 2.25M • 121

Niyati Bafna

AI & ML interests

Recent Activity

Organizations

niyatibafna's activity

Best Practices for Open Multilingual LLM Evaluation

An Analysis of Multilingual Models on Hugging Face

mmBERT: ModernBERT goes Multilingual

add paper abstract to readme

add paper abstract to readme