Pclanglais commited on
Commit
4d6dba6
·
verified ·
1 Parent(s): 480139d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - fr
6
+ - de
7
+ - es
8
+ pipeline_tag: token-classification
9
+ ---
10
+ **BibTexer** is a specialized language models trained by PleIAs for the structured extraction of bibliographies in a Bibtex format.
11
+
12
+ Bibtexer act like a reversed Zotero: given an unstructured list of references, the model will return a series of Bibtex entries that can be loaded in any bibliographic databases.
13
+
14
+ Like all models from PleIAs Bad Data Toolbox, BibTexer has been volontary trained on diverse and challenging data sources, covering nearly all the styles featured on Zotero, as well as examples of broken text sources (line jump, digitization artifact).
15
+
16
+ BibTexer has been trained on multilingual styles and formats and should work correctly on most European languages.
17
+
18
+ ## Example