microsoft
/

xdoc-base-websrc

Model card Files Files and versions

xdoc-base-websrc / README.md

estyle's picture

Create README.md

f46dd0a over 3 years ago

|

history blame contribute delete

777 Bytes

	---
	license: mit
	---

	# XDoc
	## Introduction

	XDoc is a unified pre-trained model that deals with different document formats in a single model. With only 36.7% parameters, XDoc achieves comparable or better performance on downstream tasks, which is cost-effective for real-world deployment.

	[XDoc: Unified Pre-training for Cross-Format Document Understanding](https://arxiv.org/abs/2210.02849)
	Jingye Chen, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei, [EMNLP 2022](#)

	## Citation

	If you find XDoc helpful, please cite us:
	```
	@article{chen2022xdoc,
	title={XDoc: Unified Pre-training for Cross-Format Document Understanding},
	author={Chen, Jingye and Lv, Tengchao and Cui, Lei and Zhang, Cha and Wei, Furu},
	journal={arXiv preprint arXiv:2210.02849},
	year={2022}
	}
	```