Metadata Conditioned LLMs
Collection
Pretraining Data: English NOW corpus (english-corpora.org/now). Paper: arxiv.org/abs/2601.15236. Code: github.com/iamshnoo/metadata_localization • 91 items • Updated
This repo contains the merged chat model for the combined without metadata branch of the metadata localization project. It was produced by supervised fine-tuning on the project QA benchmark after project pretraining.
sft_chatchatwithout_metadatacombined_without_metadata_1bPEFT / LoRAadamw_bnb_8bitbf16=True, gradient_checkpointing=True, use_liger_kernel=Trueper_device_train_batch_size=2, gradient_accumulation_steps=8q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_projThis model is part of the metadata localization release. Related checkpoints and variants are grouped in the public Hugging Face collection Metadata Conditioned LLMs.
Last synced: 2026-04-02 14:48:17 UTC