Update README.md
Browse files
README.md
CHANGED
|
@@ -12,22 +12,29 @@ pinned: false
|
|
| 12 |
Welcome to **InfiX-ai**! We are a research-driven organization dedicated to building Collaborative Generative AI (Co-GenAI). Our current focus lies in advancing model collaboration, improving reasoning capabilities, and developing efficient training methods. Our long-term vision is to enable a decentralized future where everyone can access and collaborate with AI models equally to solve complex problems.
|
| 13 |
|
| 14 |
---
|
| 15 |
-
### Models
|
| 16 |
-
#### Reaoning-enhanced Continual Pretrain
|
| 17 |
|
| 18 |
- [InfiR](https://huggingface.co/papers/2502.11573): **InfiR** aims to advance AI systems by improving reasoning, reducing adoption barriers, and addressing privacy concerns through smaller model sizes.
|
| 19 |
|
| 20 |
-
#### Efficient Post-training for Reasoning
|
| 21 |
- [InfiAlign](http://huggingface.co/collections/InfiX-ai/infialign-688600ca363ab9ef0b78f7a6): **InfiAlign** is a scalable and data-efficient post-training framework that combines supervised fine-tuning (SFT) and reinforcement learning (RL) with a high-quality data selection pipeline to enhance reasoning in large language models.
|
| 22 |
|
| 23 |
-
#### Advanced
|
| 24 |
- [InfiGUI](https://huggingface.co/papers/2501.04575): **InfiGUIAgent** is a GUI agent powered by a multimodal large language model, designed to perform multi-step task automation through native reasoning. It is trained with a two-stage supervised pipeline that combines GUI understanding with hierarchical and expectation-reflection reasoning.
|
| 25 |
- [InfiGUI-R1-3B](https://huggingface.co/InfiX-ai/InfiGUI-R1-3B): **InfiGUI-R1** is an MLLM-based GUI agent developed through our Actor2Reasoner framework.
|
| 26 |
|
| 27 |
-
#### Model over Models
|
| 28 |
- [InfiFusion](https://huggingface.co/collections/InfiX-ai/infifusion-683c7d7f00c71614ba8ceb96): **InfiFusion** is a logit-level fusion pipeline based on Universal Logit Distillation, enhanced with Top-K filtering and logits standardization. It supports both pairwise and unified fusion strategies to balance performance and efficiency.
|
| 29 |
- [InfiGFusion](https://huggingface.co/InfiX-ai/InfiGFusion-14B): **InfiGFusion** is a structure-aware extension that builds co-activation graphs from logits and aligns them via an efficient Gromov-Wasserstein loss approximation, capturing cross-dimension semantic dependencies for stronger reasoning.
|
| 30 |
- [InfiFPO](https://huggingface.co/InfiX-ai/InfiFPO-14B): **InfiFPO** is a lightweight fusion method during the preference alignment phase that injects fused model behavior into preference learning, enabling richer signal during DPO-style fine-tuning.
|
| 31 |
|
| 32 |
---
|
| 33 |
### News📰
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
Welcome to **InfiX-ai**! We are a research-driven organization dedicated to building Collaborative Generative AI (Co-GenAI). Our current focus lies in advancing model collaboration, improving reasoning capabilities, and developing efficient training methods. Our long-term vision is to enable a decentralized future where everyone can access and collaborate with AI models equally to solve complex problems.
|
| 13 |
|
| 14 |
---
|
| 15 |
+
### 🤖 Models
|
| 16 |
+
#### 🧠 Reaoning-enhanced Continual Pretrain
|
| 17 |
|
| 18 |
- [InfiR](https://huggingface.co/papers/2502.11573): **InfiR** aims to advance AI systems by improving reasoning, reducing adoption barriers, and addressing privacy concerns through smaller model sizes.
|
| 19 |
|
| 20 |
+
#### ⚡ Efficient Post-training for Reasoning
|
| 21 |
- [InfiAlign](http://huggingface.co/collections/InfiX-ai/infialign-688600ca363ab9ef0b78f7a6): **InfiAlign** is a scalable and data-efficient post-training framework that combines supervised fine-tuning (SFT) and reinforcement learning (RL) with a high-quality data selection pipeline to enhance reasoning in large language models.
|
| 22 |
|
| 23 |
+
#### 🖥️ Advanced Vision-Native GUI Agent Model
|
| 24 |
- [InfiGUI](https://huggingface.co/papers/2501.04575): **InfiGUIAgent** is a GUI agent powered by a multimodal large language model, designed to perform multi-step task automation through native reasoning. It is trained with a two-stage supervised pipeline that combines GUI understanding with hierarchical and expectation-reflection reasoning.
|
| 25 |
- [InfiGUI-R1-3B](https://huggingface.co/InfiX-ai/InfiGUI-R1-3B): **InfiGUI-R1** is an MLLM-based GUI agent developed through our Actor2Reasoner framework.
|
| 26 |
|
| 27 |
+
#### 🔗 Model over Models
|
| 28 |
- [InfiFusion](https://huggingface.co/collections/InfiX-ai/infifusion-683c7d7f00c71614ba8ceb96): **InfiFusion** is a logit-level fusion pipeline based on Universal Logit Distillation, enhanced with Top-K filtering and logits standardization. It supports both pairwise and unified fusion strategies to balance performance and efficiency.
|
| 29 |
- [InfiGFusion](https://huggingface.co/InfiX-ai/InfiGFusion-14B): **InfiGFusion** is a structure-aware extension that builds co-activation graphs from logits and aligns them via an efficient Gromov-Wasserstein loss approximation, capturing cross-dimension semantic dependencies for stronger reasoning.
|
| 30 |
- [InfiFPO](https://huggingface.co/InfiX-ai/InfiFPO-14B): **InfiFPO** is a lightweight fusion method during the preference alignment phase that injects fused model behavior into preference learning, enabling richer signal during DPO-style fine-tuning.
|
| 31 |
|
| 32 |
---
|
| 33 |
### News📰
|
| 34 |
+
|
| 35 |
+
- 🔥[2025/5/20] Our paper "[InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
|
| 36 |
+
](https://arxiv.org/abs/2505.13893) released. More information can be found in [the repository](https://github.com/InfiXAI/InfiGFusion). Model is available [here](https://huggingface.co/InfiX-ai/InfiGFusion-14B)
|
| 37 |
+
- 🔥[2025/4/19] Our paper "[InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners](https://arxiv.org/abs/2504.14239)" released. More information can be found in [the repository](https://github.com/Reallm-Labs/InfiGUI-R1).
|
| 38 |
+
- 🔥[2025/1/9] Our paper "[InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection](https://arxiv.org/abs/2501.04575)" released.
|
| 39 |
+
|
| 40 |
+
|