just record your voice and send to the model
Mohamed Rashad PRO
AI & ML interests
Computer Vision, Robotics, Natural Language Processing
Recent Activity
new activity
6 days ago
MohamedRashad/PersonaPlex:add audio file upload instead of forcing users to use their mic
updated
a Space
6 days ago
MohamedRashad/PersonaPlex
Organizations
replied to
their
post
2 days ago
Post
689
I made a demo for the latest PersonaPlex model from nvidia, Try it out here:
MohamedRashad/PersonaPlex
MohamedRashad/PersonaPlex
posted
an
update
8 days ago
Post
689
I made a demo for the latest PersonaPlex model from nvidia, Try it out here:
MohamedRashad/PersonaPlex
MohamedRashad/PersonaPlex
Post
3408
I have update my https://huggingface.co/collections/MohamedRashad/arabic-speech-datasets
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
posted
an
update
28 days ago
Post
3408
I have update my https://huggingface.co/collections/MohamedRashad/arabic-speech-datasets
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
replied to
their
post
5 months ago
the output of the model is json. that's what is crazy about it in my opinion
Post
3279
If someone is interested in trying the new
rednote-hilab/dots.ocr model. I made this space for you:
MohamedRashad/Dots-OCR
MohamedRashad/Dots-OCR
posted
an
update
6 months ago
Post
3279
If someone is interested in trying the new
rednote-hilab/dots.ocr model. I made this space for you:
MohamedRashad/Dots-OCR
MohamedRashad/Dots-OCR
Post
1929
For anyone who wants to try the new Voxtral models, you can do this from here:
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
posted
an
update
6 months ago
Post
1929
For anyone who wants to try the new Voxtral models, you can do this from here:
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
posted
an
update
8 months ago
Post
1898
I think we just got the best Image to Markdown VLM out there and it's hosted here:
MohamedRashad/Nanonets-OCR
MohamedRashad/Nanonets-OCR
Post
392
I just updated an old (non working) space i had with the implementation of a cool research paper named UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
posted
an
update
8 months ago
Post
392
I just updated an old (non working) space i had with the implementation of a cool research paper named UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
posted
an
update
9 months ago
Post
1095
I have processed and cleaned the famous SADA2022 dataset from SADIA for Arabic ASR and other related tasks and uploaded it here:
MohamedRashad/SADA22
Edit:
I also added another dataset from SADIA named SCC22
MohamedRashad/SCC22
MohamedRashad/SADA22
Edit:
I also added another dataset from SADIA named SCC22
MohamedRashad/SCC22
replied to
their
post
10 months ago
Speech data in audio and text format
replied to
their
post
10 months ago
Start with gathering high quality data first. This is by far the biggest hurdle against TTS systems out there.
posted
an
update
10 months ago
Post
2726
I collected the recitations of the holy quran from 20 different reciters and uploaded the full dataset here:
MohamedRashad/Quran-Recitations
Check it out ๐ฅท
MohamedRashad/Quran-Recitations
Check it out ๐ฅท
Post
2173
For those interested in trying the new
canopylabs/orpheus-3b-0.1-ft model i made a space for you:
MohamedRashad/Orpheus-TTS
MohamedRashad/Orpheus-TTS
posted
an
update
10 months ago
Post
2173
For those interested in trying the new
canopylabs/orpheus-3b-0.1-ft model i made a space for you:
MohamedRashad/Orpheus-TTS
MohamedRashad/Orpheus-TTS
Post
3540
I think we have released the best Arabic model under 25B at least based on https://huggingface.co/spaces/inceptionai/AraGen-Leaderboard
Yehia = https://huggingface.co/ALLaM-AI/ALLaM-7B-Instruct-preview + GRPO
and its ranked number one model under the 25B parameter size mark.
Now, i said "i think" not "i am sure" because this model used the same metric of evaluation the AraGen developers use (the 3C3H) as a reward model to improve its responses and this sparks the question. Is this something good for users or is it another type of overfitting that we don't want ?
I don't know if this is a good thing or a bad thing but what i know is that you can try it from here:
Navid-AI/Yehia-7B-preview
or Download it for your personal experiments from here:
Navid-AI/Yehia-7B-preview
Ramadan Kareem ๐
Yehia = https://huggingface.co/ALLaM-AI/ALLaM-7B-Instruct-preview + GRPO
and its ranked number one model under the 25B parameter size mark.
Now, i said "i think" not "i am sure" because this model used the same metric of evaluation the AraGen developers use (the 3C3H) as a reward model to improve its responses and this sparks the question. Is this something good for users or is it another type of overfitting that we don't want ?
I don't know if this is a good thing or a bad thing but what i know is that you can try it from here:
Navid-AI/Yehia-7B-preview
or Download it for your personal experiments from here:
Navid-AI/Yehia-7B-preview
Ramadan Kareem ๐