👋 Open to Work

6 47 105

Mahamadi NIKIEMA

madoss

https://mnikiema.github.io/

AI & ML interests

AI & ML Engineer focused on low-resource languages. Building multilingual NLP, ASR, and TTS systems for Mooré and French. Fine-tuning LLMs and speech models, curating parallel corpora, and shipping reproducible pipelines. Open to work.

Recent Activity

updated a model 5 days ago

madoss/wav2vec-finetuned

updated a dataset 6 days ago

madoss/faso-speech

published a dataset 11 days ago

burkimbia/moore-instruct-v1

View all activity

Organizations

upvoted an article 21 days ago

Article

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

nvidia

•

21 days ago

• 64

upvoted an article 24 days ago

Article

Introduction to Trimming ✂

lbourdois

•

28 days ago

• 40

upvoted a paper about 2 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 61

upvoted a collection about 2 months ago

BidirLM

Collection

BidirLM is a family of 5 frontier bidirectional encoders, including an omnimodal variant at 2.5B. • 8 items • Updated Apr 15 • 1

upvoted 3 articles 2 months ago

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

lightonai

•

Apr 21

• 42

Article

Building a Fast Multilingual OCR Model with Synthetic Data

nvidia

•

Apr 17

• 34

Article

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

ylacombe

•

Jan 19, 2024

• 48

upvoted 3 papers 2 months ago

upvoted an article 2 months ago

Article

Mastering Tensor Dimensions in Transformers

not-lain

•

Jan 12, 2025

• 185

upvoted an article 3 months ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 52

upvoted a collection 3 months ago

fiNERweb

Collection

A multilingual dataset for NER covering 91 langauges and 25 scripts • 3 items • Updated Dec 16, 2025 • 3

upvoted an article 3 months ago

Article

Introducing AI chunking to semchunk

isaacus

•

Mar 23

• 9

upvoted a paper 3 months ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 23

upvoted a collection 4 months ago

Fine-tune ready versions of the LLMSQL benchmark

Collection

This collection contains the versions of the benchmark in fine-tune ready format • 2 items • Updated Mar 4 • 1

upvoted a paper 4 months ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 107

upvoted 2 articles 4 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

eggie5, martinigoyanes, frisokingma, andreumora, lvwerra, thomwolf, m-ric

•

Feb 4, 2025

• 131

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 169

upvoted a paper 4 months ago

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Paper • 2602.18487 • Published Feb 11 • 6

Mahamadi NIKIEMA

AI & ML interests

Recent Activity

Organizations

madoss's activity

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Introduction to Trimming ✂

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

Building a Fast Multilingual OCR Model with Synthetic Data

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

Mastering Tensor Dimensions in Transformers

How I contributed a new model to the Transformers library using Codex

Introducing AI chunking to semchunk

DABStep: Data Agent Benchmark for Multi-step Reasoning

Mixture of Experts (MoEs) in Transformers