๐ง One Model to Classify, Verify, and Guard โ Meet GLiClass-Instruct
When we first released GLiClass, it was a fast, zero-shot text classifier that could rival cross-encoders at a fraction of the cost. But classification alone wasn't enough. Our real ambition was a single, lightweight model that could handle the diverse range of text-understanding tasks via classification. We are excited to announce GLiClass-Instruct โ a leap forward that transforms GLiClass from a classifier into an instruction-following, multi-task engine. What's new: โช๏ธ Hierarchical labeling: organize labels into structured groups for complex taxonomies โช๏ธ In-context learning via examples: provide few-shot examples to adapt on the fly, no fine-tuning needed โช๏ธ Prompting support: guide classification behavior with natural-language task descriptions โช๏ธ EWC for preventing catastrophic forgetting: add new capabilities without losing old ones โช๏ธ 3x faster inference thanks to FlashDeBERTa New multi-task capabilities: Beyond topic classification and sentiment analysis, GLiClass now supports: โช๏ธ Hallucination detection: verify whether LLM outputs are grounded in context โช๏ธ Rule-following verification: check if content complies with custom guidelines โช๏ธ Safety classification: detect prompt injections, jailbreaks, and harmful requests These tasks are crucial for building reliable and efficient agentic systems, where every LLM output needs to be verified, every user input needs to be screened, and every response needs to follow the rules, all at minimal latency. We release 3 instruction-following models (edge, base, large), with the large model matching SoTA classification models while unlocking entirely new task categories. ๐ Explore more: GitHub repo: https://github.com/Knowledgator/GLiClass Models: https://huggingface.co/knowledgator/gliclass-multitask-large-v1.0 Our other solutions: https://www.knowledgator.com/
When we introduced the **GLiNER bi-encoder** in 2024, it enabled efficient zero-shot NER across hundreds of entity types. But that was just the beginning. Our bigger goal was always clear: **linking text to millions of entities dynamically, without retraining**.
In other words: **true entity linking at scale** โก
This unlocks powerful applications: โช๏ธ More precise search with real-world entity disambiguation โช๏ธ Knowledge graph construction across diverse document collections โช๏ธ Wikification โ turning raw text into richly linked, navigable knowledge
After nearly two years of research + engineering, this vision is now real.
Weโre excited to release **GLinker** โ a **production-ready**, zero-shot entity linking system powered by our novel **GLiNER bi-encoder**. It efficiently detects entity spans of any length and matches them directly to entity descriptions โ **no retraining required**.
Weโre Knowledgator, the team behind open-source NLP models like GLiNER, GLiClass, and many other used for zero-shot text classification and information extraction.
If youโve explored them on Hugging Face or used our frameworks from GitHub, weโd love your input: ๐งฉ Which of our models, like GLiNER or zero-shot classifiers, do you find helpful in your practical workflows? ๐งฉ Howโs the setup, performance, and accuracy been for you? ๐งฉ Anything confusing, buggy, or missing that would make your workflow smoother?
Your feedback helps us improve speed, clarity, and stability for everyone in the open-source community.
๐ Reproducing DeepSeek R1 for Text-to-Graph Extraction
Iโve been working on replicating DeepSeek R1, focusing on zero-shot text-to-graph extractionโa challenging task where LMs extract entities and relations from text based on predefined types.
๐ง Key Insight: Language models struggle when constrained by entity/relation types. Supervised training alone isnโt enough, but reinforcement learning (RL), specifically Guided Reward Policy Optimization (GRPO), shows promise.
๐ก Why GRPO? It trains the model to generate structured graphs, optimizing multiple reward functions (format, JSON validity, and extraction accuracy). It allows the model to learn from both positive and hard negative examples dynamically. RL can be fine-tuned to emphasize relation extraction improvements.
๐ Early Results: Even with limited training, F1 scores consistently improved, and we saw clear benefits from RL-based optimization. More training = better performance!
๐ฌ Next Steps: Weโre scaling up experiments with larger models and high-quality data. Stay tuned for updates! Meanwhile, check out one of our experimental models here: Ihor/Text2Graph-R1-Qwen2.5-0.5b
๐ Welcome the New and Improved GLiNER-Multitask! ๐
Since the release of our beta version, GLiNER-Multitask has received many positive responses. It's been embraced in many consulting, research, and production environments. Thank you everyone for your feedback, it helped us rethink the strengths and weaknesses of the first model and we are excited to present the next iteration of this multi-task information extraction model.
๐ก Whatโs New? Here are the key improvements in this latest version: ๐น Expanded Task Support: Now includes text classification and other new capabilities. ๐น Enhanced Relation Extraction: Significantly improved accuracy and robustness. ๐น Improved Prompt Understanding: Optimized for open-information extraction tasks. ๐น Better Named Entity Recognition (NER): More accurate and reliable results.
๐ง How We Made It Better: These advancements were made possible by: ๐น Leveraging a better and more diverse dataset. ๐น Using a larger backbone model for increased capacity. ๐น Implementing advanced model merging techniques. ๐น Employing self-learning strategies for continuous improvement. ๐น Better training strategies and hyperparameters tuning.
Auto-regressive LMs have ruled, but encoder-based architectures like GLiNER are proving to be just as powerful for information extraction while offering better efficiency and interpretability. ๐โจ
Past encoder backbones were limited by small pre-training datasets and old techniques, but with innovations like LLM2Vec, we've transformed decoders into high-performing encoders! ๐๐ก
Whatโs New? ๐นConverted Llama & Qwen decoders to advanced encoders ๐นImproved GLiNER architecture to be able to work with rotary positional encoding ๐นNew GLiNER (zero-shot NER) & GLiClass (zero-shot classification) models
๐ Meet the new GLiNER architecture ๐ GLiNER revolutionized zero-shot NER by demonstrating that lightweight encoders can achieve excellent results. We're excited to continue R&D with this spirit ๐ฅ. Our new bi-encoder and poly-encoder architectures were developed to address the main limitations of the original GLiNER architecture and bring the following new possibilities:
๐น An unlimited number of entities can be recognized at once. ๐นFaster inference when entity embeddings are preprocessed. ๐นBetter generalization to unseen entities.
While the bi-encoder architecture can lack inter-label understanding, we developed a poly-encoder architecture with post-fusion. It achieves the same or even better results on many benchmarking datasets compared to the original GLiNER, while still offering the listed advantages of bi-encoders. Now, itโs possible to run GLiNER with hundreds of entities much faster and more reliably.