zeroMN
/

SHMT

Audio-Text-to-Text

Eval Results (legacy)

Model card Files Files and versions

zeroMN commited on Jan 6, 2025

Commit

3408d82

·

verified ·

1 Parent(s): 3019492

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -76,8 +76,7 @@ The model can be fine-tuned for specific tasks such as visual question answering
 ### Out-of-Scope Use
-The `Evolutionary Multi-Modal Model` model is not designed for tasks that require highly specialized knowledge or domain-specific expertise beyond its current capabilities. It may not perform well on tasks that require fine-grained recognition or highly specialized audio processing.
 ## Bias, Risks, and Limitations
 ### Recommendations

 ### Out-of-Scope Use
+The Evolved Multimodal Model is not suitable for tasks that require high expertise or domain-specific expertise beyond its current capabilities. The number of speech frames still needs to be fine-tuned by yourself.
 ## Bias, Risks, and Limitations
 ### Recommendations