SeqPE: Transformer with Sequential Position Encoding Paper β’ 2506.13277 β’ Published Jun 16, 2025 β’ 4
A Frustratingly Simple Decoding Method for Neural Text Generation Paper β’ 2305.12675 β’ Published May 22, 2023
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Paper β’ 2311.16511 β’ Published Nov 25, 2023 β’ 1
Data Augmentation for Text Generation Without Any Augmented Data Paper β’ 2105.13650 β’ Published May 28, 2021
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models Paper β’ 2401.08294 β’ Published Jan 16, 2024
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation Paper β’ 2206.02369 β’ Published Jun 6, 2022
ALR$^2$: A Retrieve-then-Reason Framework for Long-context Question Answering Paper β’ 2410.03227 β’ Published Oct 4, 2024
RePo: Language Models with Context Re-Positioning Paper β’ 2512.14391 β’ Published 26 days ago β’ 8
RePo: Language Models with Context Re-Positioning Paper β’ 2512.14391 β’ Published 26 days ago β’ 8
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper β’ 2510.26697 β’ Published Oct 30, 2025 β’ 116