Filipino NLP Resources and Models

community

https://filbench.github.io/

filbench

Activity Feed

AI & ML interests

Multilinguality, computational linguistics, Filipino NLP

Recent Activity

ljvmiranda921 authored a paper 6 days ago

Multilinguality at the Edge: Developing Language Models for the Global South

ljvmiranda921 authored a paper 26 days ago

Olmo 3

ljvmiranda921 authored a paper 26 days ago

Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation

View all activity

Papers

FilBench: Can LLMs Understand and Generate Filipino?

View all Papers

ljvmiranda921

authored a paper 6 days ago

Multilinguality at the Edge: Developing Language Models for the Global South

Paper • 2604.21637 • Published 17 days ago

ljvmiranda921

authored 2 papers 26 days ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 32

Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation

Paper • 2604.11290 • Published 27 days ago • 2

ljvmiranda921

submitted a paper to Daily Papers 26 days ago

Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation

Paper • 2604.11290 • Published 27 days ago • 2

ljvmiranda921

updated a Space 27 days ago

FilBench Leaderboard

🥇

An Open LLM Leaderboard for Filipino

ljvmiranda921

updated a collection 6 months ago

FilBench Eval

Collection

FilBench-Eval is an Open LLM Evaluation Suite for Philippine Languages. The eval runner is integrated with HuggingFace's lighteval. • 5 items • Updated Jan 14 • 1

ljvmiranda921

authored a paper 6 months ago

FilBench: Can LLMs Understand and Generate Filipino?

Paper • 2508.03523 • Published Aug 5, 2025 • 1

ljvmiranda921

updated a collection 6 months ago

FilBench Eval

Collection

FilBench-Eval is an Open LLM Evaluation Suite for Philippine Languages. The eval runner is integrated with HuggingFace's lighteval. • 5 items • Updated Jan 14 • 1

ljvmiranda921

published a Space 9 months ago

FilBench Leaderboard

🥇

An Open LLM Leaderboard for Filipino

ljvmiranda921

updated a dataset 10 months ago

filbench/UD_Tagalog-NewsCrawl

Viewer • Updated Jul 23, 2025 • 15.6k • 53 • 1

jcblaise

authored 2 papers 11 months ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

Paper • 2506.01789 • Published Jun 2, 2025 • 15

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

Paper • 2505.24456 • Published May 30, 2025

ljvmiranda921

updated a collection 12 months ago

Universal Dependencies for Tagalog

Collection

Models and dependency parsers for Tagalog using the UD_NewsCrawl dataset • 8 items • Updated May 29, 2025

ljvmiranda921

authored a paper 12 months ago

R3: Robust Rubric-Agnostic Reward Models

Paper • 2505.13388 • Published May 19, 2025 • 11

ljvmiranda921

authored 2 papers about 1 year ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 48

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10, 2025 • 101

jcblaise

authored a paper about 1 year ago

Establishing Baselines for Text Classification in Low-Resource Languages

Paper • 2005.02068 • Published May 5, 2020

AI & ML interests

Recent Activity

Papers

Team members 8

filbench's activity

FilBench Leaderboard

FilBench Leaderboard