Kenza Benkirane

kenza-ily

·

https://kenza-ily.com

AI & ML interests

Healthcare NLP, XAI, multimodal approaches

Organizations

upvoted 2 articles 3 months ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 62

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted 2 papers 3 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17, 2025 • 37

Alignment Makes Language Models Normative, Not Descriptive

Paper • 2603.17218 • Published Mar 17 • 46

upvoted a collection 4 months ago

Health AI Developer Foundations (HAI-DEF)

Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated Mar 12 • 225

upvoted a paper 4 months ago

PubMedQA: A Dataset for Biomedical Research Question Answering

Paper • 1909.06146 • Published Sep 13, 2019 • 4

upvoted a collection 4 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.7k

upvoted a paper 8 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted an article 8 months ago

Article

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

imomayiz

•

Sep 16, 2025

• 19

upvoted 2 articles over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

Article

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

Kseniase

•

Feb 13, 2025

• 18

upvoted a collection over 1 year ago

Biomedical NLP papers

Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 183 items • Updated Jan 24, 2025 • 43

upvoted a paper over 1 year ago

Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions

Paper • 2402.18060 • Published Feb 28, 2024 • 2

upvoted a collection almost 2 years ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 173

upvoted a paper almost 2 years ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 66

upvoted 2 articles almost 2 years ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

+1

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 114

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted a paper almost 2 years ago

BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains

Paper • 2402.10373 • Published Feb 15, 2024 • 11

upvoted 2 collections over 2 years ago

Educational Resources for Medical LLMs

Curated medical LLM datasets and models for use in curricular content, particularly for medical professionals (e.g. medical students). • 15 items • Updated Dec 1, 2023 • 6

Healthcare Bias Eval Datasets

Benchmarks and other datasets that can be used to evaluate bias in healthcare settings. • 5 items • Updated Dec 9, 2023 • 1