congkai commited on
Commit
8b125cb
·
verified ·
1 Parent(s): b310e5b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -19,10 +19,11 @@ Welcome to **InfiX-ai**! We are a research-driven organization dedicated to buil
19
 
20
  #### ⚡ Efficient Post-training for Reasoning
21
  - [InfiAlign](http://huggingface.co/collections/InfiX-ai/infialign-688600ca363ab9ef0b78f7a6): **InfiAlign** is a scalable and data-efficient post-training framework that combines supervised fine-tuning (SFT) and reinforcement learning (RL) with a high-quality data selection pipeline to enhance reasoning in large language models.
 
22
 
23
  #### 🖥️ Advanced Vision-Native GUI Agent Model
24
- - [InfiGUI](https://huggingface.co/papers/2501.04575): **InfiGUIAgent** is a GUI agent powered by a multimodal large language model, designed to perform multi-step task automation through native reasoning. It is trained with a two-stage supervised pipeline that combines GUI understanding with hierarchical and expectation-reflection reasoning.
25
- - [InfiGUI-R1-3B](https://huggingface.co/InfiX-ai/InfiGUI-R1-3B): **InfiGUI-R1** is an MLLM-based GUI agent developed through our Actor2Reasoner framework.
26
 
27
  #### 🔗 Model over Models
28
  - [InfiFusion](https://huggingface.co/collections/InfiX-ai/infifusion-683c7d7f00c71614ba8ceb96): **InfiFusion** is a logit-level fusion pipeline based on Universal Logit Distillation, enhanced with Top-K filtering and logits standardization. It supports both pairwise and unified fusion strategies to balance performance and efficiency.
@@ -33,7 +34,9 @@ Welcome to **InfiX-ai**! We are a research-driven organization dedicated to buil
33
  ### News📰
34
 
35
  - 🔥[2025/5/20] Our paper "[InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
36
- ](https://arxiv.org/abs/2505.13893) released. More information can be found in [the repository](https://github.com/InfiXAI/InfiGFusion). Model is available [here](https://huggingface.co/InfiX-ai/InfiGFusion-14B)
 
 
37
  - 🔥[2025/4/19] Our paper "[InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners](https://arxiv.org/abs/2504.14239)" released. More information can be found in [the repository](https://github.com/Reallm-Labs/InfiGUI-R1).
38
  - 🔥[2025/1/9] Our paper "[InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection](https://arxiv.org/abs/2501.04575)" released.
39
 
 
19
 
20
  #### ⚡ Efficient Post-training for Reasoning
21
  - [InfiAlign](http://huggingface.co/collections/InfiX-ai/infialign-688600ca363ab9ef0b78f7a6): **InfiAlign** is a scalable and data-efficient post-training framework that combines supervised fine-tuning (SFT) and reinforcement learning (RL) with a high-quality data selection pipeline to enhance reasoning in large language models.
22
+ - [InfiMMR](https://arxiv.org/abs/2505.23091): **InfiMMR** is a novel three-phase curriculum framework that systematically enhances multimodal reasoning capabilities in small language models through foundational reasoning activation, cross-modal adaptation, and multimodal reasoning enhancement.
23
 
24
  #### 🖥️ Advanced Vision-Native GUI Agent Model
25
+ - [InfiGUIAgent](https://huggingface.co/papers/2501.04575): **InfiGUIAgent** is a GUI agent that embeds native hierarchical and expectation-reflection reasoning through a unique two-stage supervised pipeline, enabling robust, multi-step GUI task automation.
26
+ - [InfiGUI-R1](https://huggingface.co/InfiX-ai/InfiGUI-R1-3B): **InfiGUI-R1** is a GUI agent developed via the Actor2Reasoner framework, which evolves a reactive model into a deliberative reasoner capable of sophisticated planning and error recovery through spatial reasoning distillation and reinforcement learning.
27
 
28
  #### 🔗 Model over Models
29
  - [InfiFusion](https://huggingface.co/collections/InfiX-ai/infifusion-683c7d7f00c71614ba8ceb96): **InfiFusion** is a logit-level fusion pipeline based on Universal Logit Distillation, enhanced with Top-K filtering and logits standardization. It supports both pairwise and unified fusion strategies to balance performance and efficiency.
 
34
  ### News📰
35
 
36
  - 🔥[2025/5/20] Our paper "[InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
37
+ ](https://arxiv.org/abs/2505.13893)" released. More information can be found in [the repository](https://github.com/InfiXAI/InfiGFusion). Model is available [here](https://huggingface.co/InfiX-ai/InfiGFusion-14B)
38
+ - 🔥[2025/5/20] Our paper "[InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
39
+ ](https://arxiv.org/abs/2505.13878)" released. More information can be found in [the repository](https://github.com/InfiXAI/InfiFPO). Model is available [here](https://huggingface.co/InfiX-ai/InfiFPO-14B)
40
  - 🔥[2025/4/19] Our paper "[InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners](https://arxiv.org/abs/2504.14239)" released. More information can be found in [the repository](https://github.com/Reallm-Labs/InfiGUI-R1).
41
  - 🔥[2025/1/9] Our paper "[InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection](https://arxiv.org/abs/2501.04575)" released.
42