| --- |
| license: apache-2.0 |
| --- |
| |
| To reverse engineer the model, you need to download it to: runtime/models/MiniCPM-V-4_5 |
| https://modelscope.ai/models/OpenBMB/MiniCPM-V-4_5/files |
|
|
| You need to extract caption_python.7z to the runtime directory. This is the Python environment. Due to the large number of subfiles, it can only be uploaded as a compressed package. If you don't want to download it or feel it's risky, you can download Codex and have it download a new environment for you. |
| |
| --- |
| Detailed tutorial: https://youtu.be/h27Sedb_v08 |
|
|
| Features: |
|
|
| 1. Edits videos to the frame rate/resolution needed for training. |
|
|
| 2. Includes cropping functionality to remove unwanted subtitles/black borders. |
|
|
| 3. Offers faster frame range selection. |
|
|
| 4. Records timeline, cropping box, and cue words for each data point, allowing for easy secondary editing without the need for manual adjustments and proofreading required by traditional editing tools. |
|
|
| 5. Includes a cue word derivation function. Requires 16GB of VRAM for local operation. Low VRAM mode can be enabled in the settings if VRAM is insufficient. |
|
|
| 6. English language can be enabled in the settings. |
|
|
| 7. Supports batch conversion of frame rate/resolution for existing datasets. |
|
|
|
|
|
|
|  |
|
|