lbzg
/

TA-MoELoRA

Initial proposal of a Model card

by JavierSanzCruza - opened Nov 25, 2025

←

Files changed (1) hide show

README.md ADDED Viewed

+---
+language:
+- en
+base_model:
+- codellama/CodeLlama-7b-hf
+pipeline_tag: text-generation
+---
+# Task-Aware MoE LoRA for Universal information Extraction
+This is a novel Universal Information Extraction model. Based on the GoLLIE model (https://huggingface.co/HiTZ/GoLLIE-7B), this model substitutes the LoRA
+adapter by a Mixture of Expert models, with a task-aware router.
+### Model description
+- **Developed by:** Lubingzhi Guo
+- **Institution:** University of Glasgow.
+- **Model type:** Text generation.
+- **Languages:** English
+- **License:** LLaMA2 License for the base and merged model,
+- **Fine-tuned from model:** CODE-LLaMA2 7B (codellama/CodeLlama-7b-hf)
+### Citation
+If you use this model, please cite the following paper:
+> L. Guo, J. Sanz-Cruzado, R. McCreadie. Selecting the Right Experts: Generalizing Information Extraction for Unseen Scenarios via Task-Aware Expert Weighting. 28th European Conference on Artificial Intelligence (ECAI 2025), Bologna, Italy, October 2025, pp. 4161-4168. DOI: https://doi.org/10.3233/FAIA251308