MuRIL is fine-tuned on Tamil APTFiNER dataset for Fine-grained Named Entity Recognition.

The tagset of MultiCoNER2 is a fine-grained tagset. The fine to coarse level mapping of the tags are as follows:

Location (LOC) : Facility, OtherLOC, HumanSettlement, Station
Creative Work (CW) : VisualWork, MusicalWork, WrittenWork, ArtWork, Software
Group (GRP) : MusicalGRP, PublicCORP, PrivateCORP, AerospaceManufacturer, SportsGRP, CarManufacturer, ORG
Person (PER) : Scientist, Artist, Athlete, Politician, Cleric, SportsManager, OtherPER
Product (PROD) : Clothing, Vehicle, Food, Drink, OtherPROD
Medical (MED) : Medication/Vaccine, MedicalProcedure, AnatomicalStructure, Symptom, Disease

Please read the APTFiNER paper in LREC'26 proceedings

Model performance:

Precision: 58.96
Recall: 65.32
F1: 61.98

Training Parameters:

Epochs: 6
Optimizer: AdamW
Learning Rate: 5e-5
Weight Decay: 0.01
Batch Size: 64

Contributors

Prachuryya Kaushik
Adittya Gupta
Ajanta Maurya
Gautam Sharma
Prof. V Vijaya Saradhi
Prof. Ashish Anand

APTFiNER is a part of the AWED-FiNER collection. Please check: Paper | Agentic Tool | Interactive Demo

Sample Usage

The AWED-FiNER agentic tool can be used to interact with expert models trained using this framework. Below is an example:

pip install smolagents gradio_client

from tool import AWEDFiNERTool

tool = AWEDFiNERTool(
    space_id="prachuryyaIITG/AWED-FiNER"
)

result = tool.forward(
    text="Jude Bellingham joined Real Madrid in 2023.",
    language="English"
)

print(result)

Citation

If you use this model, please cite the following papers:

@inproceedings{kaushik-etal-2026-aptfiner,
  title = {APTFiNER: Annotation Preserving Translation for Fine-grained Named Entity Recognition},
  author = {Kaushik, Prachuryya and Gupta, Adittya and Maurya, Ajanta and Sharma, Gautam and Saradhi, V. V. and Anand, Ashish},
  booktitle = {Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)},
  month = {May},
  year = {2026},
  pages = {7668--7680},
  address = {Palma, Mallorca, Spain},
  publisher = {European Language Resources Association (ELRA)},
  doi = {10.63317/3w7rv4rg7nty}
}

@misc{kaushik2026awedfineragentswebapplications,
      title={AWED-FiNER: Agents, Web applications, and Expert Detectors for Fine-grained Named Entity Recognition across 36 Languages for 6.6 Billion Speakers}, 
      author={Prachuryya Kaushik and Ashish Anand},
      year={2026},
      eprint={2601.10161},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2601.10161}, 
}

@inproceedings{kaushik2026sampurner,
      title={SampurNER: Fine-Grained Named Entity Recognition Dataset for 22 Indian Languages},
      volume={40},
      url={https://ojs.aaai.org/index.php/AAAI/article/view/40405},
      DOI={10.1609/aaai.v40i37.40405},
      number={37},
      journal={Proceedings of the AAAI Conference on Artificial Intelligence},
      author={Kaushik, Prachuryya and Anand, Ashish},
      year={2026},
      month={Mar.},
      pages={31410-31418}
}


@inproceedings{fetahu2023multiconer,
  title={MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition},
  author={Fetahu, Besnik and Chen, Zhiyu and Kar, Sudipta and Rokhlenko, Oleg and Malmasi, Shervin},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
  pages={2027--2051},
  year={2023}
}