Nicholas Kluge Corrêa's picture

Building on HF

Nicholas Kluge Corrêa PRO

nicholasKluge

Polygl0t

·

https://nkluge-correa.github.io/

AI & ML interests

LLMs and AI Safety

Recent Activity

liked a Space 5 days ago

Polygl0t/Tucano2Cool

updated a Space 21 days ago

TucanoBR/README

upvoted a collection about 1 month ago

View all activity

Organizations

liked a Space 5 days ago

Tucano2Cool Chat Demo

Tucano 2 is the coolest open source Portuguese LLM!

updated a Space 21 days ago

README

Small Portuguese Foundation Models.

upvoted 2 collections about 1 month ago

LilTii

A 0.6B Bengali Language Model that Outperforms Qwen. • 8 items • Updated Mar 5 • 1

LilMoo

A 0.6-billion-parameter Hindi language models trained entirely from scratch. • 9 items • Updated 5 days ago • 2

updated a Space about 1 month ago

TeenyTinyLlama-Chat

Chat with TeenyTinyLlama and explore its fine‑tuning examples

updated a dataset about 1 month ago

Polygl0t/multilingual-personas

Viewer • Updated Apr 7 • 52k • 44

New activity in Polygl0t/multilingual-personas about 1 month ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by

parquet-converter

published a dataset about 1 month ago

Polygl0t/multilingual-personas

Viewer • Updated Apr 7 • 52k • 44

commented a paper about 1 month ago

Tucano 2 Cool: Better Open Source LLMs for Portuguese

Paper • 2603.03543 • Published Mar 3 • 6 •

updated a Space 2 months ago

Tucano2Cool Chat Demo

Tucano 2 is the coolest open source Portuguese LLM!

updated a collection 2 months ago

Tucano2

An open suite of large language models (LLMs) with 0.5-3.7 billion parameters, designed to address the gap in open-source development for Portuguese. • 33 items • Updated Mar 11 • 14

published a Space 2 months ago

Tucano2Cool Chat Demo

Tucano 2 is the coolest open source Portuguese LLM!

upvoted an article 2 months ago

Article

LilTii: A 0.6B Bengali Language Model that Outperforms Qwen

Polygl0t

•

Mar 5

• 3

upvoted a paper 2 months ago

Tucano 2 Cool: Better Open Source LLMs for Portuguese

Paper • 2603.03543 • Published Mar 3 • 6

updated a Space 2 months ago

README

Developing foundation models for low-resource languages.

liked a Space 2 months ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Explore synthetic data experiments on a virtual bookshelf

authored 2 papers 2 months ago

Tucano 2 Cool: Better Open Source LLMs for Portuguese

Paper • 2603.03543 • Published Mar 3 • 6

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

Paper • 2603.03508 • Published Mar 3 • 4

updated 2 collections 2 months ago

Tucano2

An open suite of large language models (LLMs) with 0.5-3.7 billion parameters, designed to address the gap in open-source development for Portuguese. • 33 items • Updated Mar 11 • 14

LilTii

A 0.6B Bengali Language Model that Outperforms Qwen. • 8 items • Updated Mar 5 • 1