Major Marc
ohhimarc
AI & ML interests
None yet
Recent Activity
updated a collection 25 days ago
translation updated a collection 2 months ago
embeddings updated a collection 3 months ago
embeddingsOrganizations
None yet
Inference APIs
translation
LLMs
-
DiscoResearch/DiscoLM_German_7b_v1
Text Generation • 7B • Updated • 365 • 67 -
DiscoResearch/Llama3-DiscoLeo-Instruct-8B-v0.1-4bit-awq
Text Generation • 8B • Updated • 7 -
google/gemma-2b-it
Text Generation • 3B • Updated • 80.8k • • 876 -
google/gemma-2-9b-it
Text Generation • 9B • Updated • 509k • • 795
OCR
-
openbmb/MiniCPM-o-2_6
Any-to-Any • 9B • Updated • 240k • 1.29k -
microsoft/trocr-large-printed
Image-to-Text • 0.6B • Updated • 134k • 179 -
NAMAA-Space/Qari-OCR-0.2.2.1-VL-2B-Instruct
Image-Text-to-Text • Updated • 9.19k • 23 -
oddadmix/Qari-OCR-0.2.2.1-VL-2B-Instruct-merged
Image-Text-to-Text • 2B • Updated • 145 • 1
embeddings
-
sentence-transformers/paraphrase-multilingual-mpnet-base-v2
Sentence Similarity • 0.3B • Updated • 5.23M • • 459 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 44.9M • • 1.22k -
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 467k • 510 -
BAAI/bge-m3
Sentence Similarity • Updated • 21M • • 2.98k
cv
sentiment_analysis
-
citizenlab/distilbert-base-multilingual-cased-toxicity
Text Classification • Updated • 21.6k • • 22 -
oliverguhr/german-sentiment-bert
Text Classification • 0.1B • Updated • 184k • • 69 -
textdetox/xlmr-large-toxicity-classifier
Text Classification • 0.3B • Updated • 34.5k • • 17 -
gokceuludogan/convbert-base-turkish-mc4-toxicity-uncased
Text Classification • Updated • 79 • 3
forecasting
transcription
summarization
coding
ASR
-
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.98M • • 5.66k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.65M • • 2.99k -
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.53k • 457 -
badrex/mms-300m-arabic-dialect-identifier
Audio Classification • 0.3B • Updated • 3.47k • 8
NER
-
mdarhri00/named-entity-recognition
Token Classification • Updated • 35 • 50 -
eventdata-utd/conflibert-named-entity-recognition
Token Classification • 0.1B • Updated • 53 • 10 -
tomaarsen/span-marker-xlm-roberta-base-multinerd
Token Classification • 0.3B • Updated • 71 • 36 -
NAMAA-Space/gliner_arabic-v2.1
Token Classification • Updated • 198 • 16
text correction
sentiment_analysis
-
citizenlab/distilbert-base-multilingual-cased-toxicity
Text Classification • Updated • 21.6k • • 22 -
oliverguhr/german-sentiment-bert
Text Classification • 0.1B • Updated • 184k • • 69 -
textdetox/xlmr-large-toxicity-classifier
Text Classification • 0.3B • Updated • 34.5k • • 17 -
gokceuludogan/convbert-base-turkish-mc4-toxicity-uncased
Text Classification • Updated • 79 • 3
Inference APIs
forecasting
translation
transcription
LLMs
-
DiscoResearch/DiscoLM_German_7b_v1
Text Generation • 7B • Updated • 365 • 67 -
DiscoResearch/Llama3-DiscoLeo-Instruct-8B-v0.1-4bit-awq
Text Generation • 8B • Updated • 7 -
google/gemma-2b-it
Text Generation • 3B • Updated • 80.8k • • 876 -
google/gemma-2-9b-it
Text Generation • 9B • Updated • 509k • • 795
summarization
OCR
-
openbmb/MiniCPM-o-2_6
Any-to-Any • 9B • Updated • 240k • 1.29k -
microsoft/trocr-large-printed
Image-to-Text • 0.6B • Updated • 134k • 179 -
NAMAA-Space/Qari-OCR-0.2.2.1-VL-2B-Instruct
Image-Text-to-Text • Updated • 9.19k • 23 -
oddadmix/Qari-OCR-0.2.2.1-VL-2B-Instruct-merged
Image-Text-to-Text • 2B • Updated • 145 • 1
coding
embeddings
-
sentence-transformers/paraphrase-multilingual-mpnet-base-v2
Sentence Similarity • 0.3B • Updated • 5.23M • • 459 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 44.9M • • 1.22k -
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 467k • 510 -
BAAI/bge-m3
Sentence Similarity • Updated • 21M • • 2.98k
ASR
-
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.98M • • 5.66k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.65M • • 2.99k -
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.53k • 457 -
badrex/mms-300m-arabic-dialect-identifier
Audio Classification • 0.3B • Updated • 3.47k • 8
cv
NER
-
mdarhri00/named-entity-recognition
Token Classification • Updated • 35 • 50 -
eventdata-utd/conflibert-named-entity-recognition
Token Classification • 0.1B • Updated • 53 • 10 -
tomaarsen/span-marker-xlm-roberta-base-multinerd
Token Classification • 0.3B • Updated • 71 • 36 -
NAMAA-Space/gliner_arabic-v2.1
Token Classification • Updated • 198 • 16