Spaces:

chendl
/

compositional_test

Runtime error

App Files Files Community

compositional_test / transformers /docs /source /en /model_doc /barthez.mdx

chendl

add requirements

a1d409e about 2 years ago

raw

history blame contribute delete

2.64 kB

	<!--Copyright 2020 The HuggingFace Team. All rights reserved.

	Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
	the License. You may obtain a copy of the License at

	http://www.apache.org/licenses/LICENSE-2.0

	Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
	an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
	specific language governing permissions and limitations under the License.
	-->

	# BARThez

	## Overview

	The BARThez model was proposed in [BARThez: a Skilled Pretrained French Sequence-to-Sequence Model](https://arxiv.org/abs/2010.12321) by Moussa Kamal Eddine, Antoine J.-P. Tixier, Michalis Vazirgiannis on 23 Oct,
	2020.

	The abstract of the paper:


	*Inductive transfer learning, enabled by self-supervised learning, have taken the entire Natural Language Processing
	(NLP) field by storm, with models such as BERT and BART setting new state of the art on countless natural language
	understanding tasks. While there are some notable exceptions, most of the available models and research have been
	conducted for the English language. In this work, we introduce BARThez, the first BART model for the French language
	(to the best of our knowledge). BARThez was pretrained on a very large monolingual French corpus from past research
	that we adapted to suit BART's perturbation schemes. Unlike already existing BERT-based French language models such as
	CamemBERT and FlauBERT, BARThez is particularly well-suited for generative tasks, since not only its encoder but also
	its decoder is pretrained. In addition to discriminative tasks from the FLUE benchmark, we evaluate BARThez on a novel
	summarization dataset, OrangeSum, that we release with this paper. We also continue the pretraining of an already
	pretrained multilingual BART on BARThez's corpus, and we show that the resulting model, which we call mBARTHez,
	provides a significant boost over vanilla BARThez, and is on par with or outperforms CamemBERT and FlauBERT.*

	This model was contributed by [moussakam](https://huggingface.co/moussakam). The Authors' code can be found [here](https://github.com/moussaKam/BARThez).


	### Examples

	- BARThez can be fine-tuned on sequence-to-sequence tasks in a similar way as BART, check:
	[examples/pytorch/summarization/](https://github.com/huggingface/transformers/tree/main/examples/pytorch/summarization/README.md).


	## BarthezTokenizer

	[[autodoc]] BarthezTokenizer

	## BarthezTokenizerFast

	[[autodoc]] BarthezTokenizerFast