Commit History

hardcoded some stuff don't mind me
156546b

mjuvilla commited on

wip
ee8848f

mjuvilla commited on

wip
4bd3d3d

mjuvilla commited on

wip
9a08d44

mjuvilla commited on

updated gradio app to support two different models of translator (with and without fastalign), refactored a bit of the code as well
d93743a

mjuvilla commited on

added more comments
abb96fe

mjuvilla commited on

re-organised the code a bit, tried to use batch processing, didn't work
0078c2b

mjuvilla commited on

Working version using the non-quantized version of the model. Splits paragraphs and sentences, translates them and then reconstructs the paragraph.
4ad157a

mjuvilla commited on

fixed error, added appropiate prompt
7e69fd3

mjuvilla commited on

wip
d8a47d0

mjuvilla commited on

wip
c4d3881

mjuvilla commited on

forgot to add this file in the previous commit
02b96b1

mjuvilla commited on

created classes for running the translation models either from a local model or a huggingface endpoint. for now main.py only supports local models
74b84b5

mjuvilla commited on

separated the translator that uses the HF endpoint
e3ab4a2

mjuvilla commited on

fixed a couple of bugs
1d63d00

mjuvilla commited on

wip
596034b

mjuvilla commited on

Improved feedback to the user
5ba0b27

mjuvilla commited on

Fixed some bugs, added dockerfile
687559d

mjuvilla commited on

Fixed a lot of error, now the script should crash much less often.
a961430

mjuvilla commited on

changed nltk tokenizer to multilingual tokenizers
8f73cdd

mjuvilla commited on

Fixed bug when processing docx files
975e338

mjuvilla commited on

added language dropdown menus to document translation tab
d4f1fb2

mjuvilla commited on

enforce the usage of the hf token
1e1b3f2

mjuvilla commited on

turns out odts work a bit differently and runs could have more than one tag id, which complicates things quite a lot
59de1ea

mjuvilla commited on

Updated readme and added salamandraTA7b translator class
dd9512b

mjuvilla commited on

integrated any-doc into the gradle app, separated the translation side to make it easier to implement other translation models
306a4fd

mjuvilla commited on

forgot to remove placeholder text
809a6a0

mjuvilla commited on

moved scripts to src folder, created new create that hopefully should be able to work with any type of document
ea5f41e

mjuvilla commited on