BEREL-seg: TBD
State-of-the-art language model for Rabbinic Hebrew, released [here] - add link.
This model is fine-tuned from BEREL_3.0 for the prefix segmentation task.
Sample usage:
from transformers import AutoModel, AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained('dicta-il/BEREL-seg')
model = AutoModel.from_pretrained('dicta-il/BEREL-seg', trust_remote_code=True)
model.eval()
sentence = 'ืืื ืืฉืื ืืจืืืดื ืืคืืจืืฉื ืขื ืืชืืจื, ืฉืืืืจ ืืืืข ืืืคืืจืกื ืืื ืืขืื ืืขืืื ืฉืืื ืืืงืจื ืืืฆื ืืืื ืคืฉืืื ืืฃ ืขื ืคื ืฉืืืจืฉ ืืืช.'
print(model.predict([sentence], tokenizer))
Output:
[
[
[ "[CLS]" ],
[ "ื", "ืื" ],
[ "ืืฉืื" ],
[ "ื", "ืจืื\"ื" ],
[ "ื", "ืคืืจืืฉื" ],
[ "ืขื" ],
[ "ื", "ืชืืจื" ],
[ ", " ],
[ "ืฉืื", "ืืจ" ],
[ "ืืืืข" ],
[ "ื", "ืืคืืจืกื" ],
[ "ื", "ืื" ],
[ "ืืขืื" ],
[ "ื", "ืขืืื" ],
[ "ืฉ", "ืืื" ],
[ "ื", "ืืงืจื" ],
[ "ืืืฆื" ],
[ "ืคืฉืืื" ],
[ "ืืฃ" ],
[ "ืขื" ],
[ "ืคื" ],
[ "ืฉืื", "ืจืฉ" ],
[ "ืืืช" ],
[ "." ],
[ "[SEP]" ]
]
]
Citation
If you use BEREL-seg in your research, please cite tbd
BibTeX:
tbd
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for dicta-il/BEREL-seg
Base model
dicta-il/BEREL_3.0