Ahmet Yildirim
commited on
Commit
·
2237cb4
1
Parent(s):
1b6c21d
- Update readme
Browse files
README.md
CHANGED
|
@@ -73,7 +73,7 @@ These functions receive similar parameters.
|
|
| 73 |
| output\_directory | yes | yes | | None | to output recursively in output\_directory. The written files will have extension ".tagged" or ".lang" according to the function called. |
|
| 74 |
| one\_sentence\_per\_line | yes | yes | True / False | False | not to apply sentence boundary detection and consider each line as a sentence in the input or the input file(s). |
|
| 75 |
| lang\_per\_sentence| yes | no | True / False | False | identify the language per sentence and output the tags according to the language identified for that sentence. If this is not set, and lang is "au" then the whole input (or a file if input\_directory is used) is used to identify the language. |
|
| 76 |
-
| write\_output\_to | yes | yes | a file path, a file handle, or "list" | sys.stdout | to specify where to write the output. If a file path is provided, the output will be written to that file. The file is overwritten. If a file handle is provided, then the output is written
|
| 77 |
| output\_tsv | yes | yes | True/False | False | to specify the output format. The default is the json format. If multiple sentences exist, each line is a single valid json but not the whole output. This option cannot be used along with write\_output\_to="list" |
|
| 78 |
| lang\_per\_item | no | yes | True/False | False | consider each item in the list given as separate input for language identification. |
|
| 79 |
| fast\_mode | no | yes | True/False | False | identify languages of the files in the input directory in fast mode. This mode uses only the beginning of the files in identification. This method is much more faster for many files but is not as accurate as if this paramer is set to False. |
|
|
|
|
| 73 |
| output\_directory | yes | yes | | None | to output recursively in output\_directory. The written files will have extension ".tagged" or ".lang" according to the function called. |
|
| 74 |
| one\_sentence\_per\_line | yes | yes | True / False | False | not to apply sentence boundary detection and consider each line as a sentence in the input or the input file(s). |
|
| 75 |
| lang\_per\_sentence| yes | no | True / False | False | identify the language per sentence and output the tags according to the language identified for that sentence. If this is not set, and lang is "au" then the whole input (or a file if input\_directory is used) is used to identify the language. |
|
| 76 |
+
| write\_output\_to | yes | yes | a file path, a file handle, or "list" | sys.stdout | to specify where to write the output. If a file path is provided, the output will be written to that file. The file is overwritten. If a file handle is provided, then the output is written there. If "list" is given as parameters, then the function returns a python "list". |
|
| 77 |
| output\_tsv | yes | yes | True/False | False | to specify the output format. The default is the json format. If multiple sentences exist, each line is a single valid json but not the whole output. This option cannot be used along with write\_output\_to="list" |
|
| 78 |
| lang\_per\_item | no | yes | True/False | False | consider each item in the list given as separate input for language identification. |
|
| 79 |
| fast\_mode | no | yes | True/False | False | identify languages of the files in the input directory in fast mode. This mode uses only the beginning of the files in identification. This method is much more faster for many files but is not as accurate as if this paramer is set to False. |
|