Update README.md
Browse files
README.md
CHANGED
|
@@ -117,11 +117,11 @@ Benchmarked on Xiaomi 17 Pro Max.
|
|
| 117 |
<td><p style="text-align: left">GPU</p></td>
|
| 118 |
<td><p style="text-align: left">dynamic_int8</p></td>
|
| 119 |
<td><p style="text-align: right">1280</p></td>
|
| 120 |
-
<td><p style="text-align: right">
|
| 121 |
-
<td><p style="text-align: right">
|
| 122 |
-
<td><p style="text-align: right">
|
| 123 |
-
<td><p style="text-align: right">
|
| 124 |
-
<td><p style="text-align: right">
|
| 125 |
<td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.litertlm">🔗</a></p></td>
|
| 126 |
</tr>
|
| 127 |
|
|
@@ -129,11 +129,11 @@ Benchmarked on Xiaomi 17 Pro Max.
|
|
| 129 |
<td><p style="text-align: left">NPU</p></td>
|
| 130 |
<td><p style="text-align: left">dynamic_int8</p></td>
|
| 131 |
<td><p style="text-align: right">1280</p></td>
|
| 132 |
-
<td><p style="text-align: right">
|
| 133 |
-
<td><p style="text-align: right">
|
| 134 |
-
<td><p style="text-align: right">
|
| 135 |
-
<td><p style="text-align: right">
|
| 136 |
-
<td><p style="text-align: right">
|
| 137 |
<td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.sm8850.litertlm">🔗</a></p></td>
|
| 138 |
</tr>
|
| 139 |
|
|
@@ -143,6 +143,7 @@ Benchmarked on Xiaomi 17 Pro Max.
|
|
| 143 |
|
| 144 |
Notes:
|
| 145 |
* Model Size: measured by the size of the file on disk.
|
|
|
|
| 146 |
* Benchmark is run with cache enabled and initialized. During the first run, the latency and memory usage may differ.
|
| 147 |
|
| 148 |
|
|
|
|
| 117 |
<td><p style="text-align: left">GPU</p></td>
|
| 118 |
<td><p style="text-align: left">dynamic_int8</p></td>
|
| 119 |
<td><p style="text-align: right">1280</p></td>
|
| 120 |
+
<td><p style="text-align: right">2,220 tk/s</p></td>
|
| 121 |
+
<td><p style="text-align: right">64 tk/s</p></td>
|
| 122 |
+
<td><p style="text-align: right">0.55 s</p></td>
|
| 123 |
+
<td><p style="text-align: right">813 MB</p></td>
|
| 124 |
+
<td><p style="text-align: right">1103 MB</p></td>
|
| 125 |
<td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.litertlm">🔗</a></p></td>
|
| 126 |
</tr>
|
| 127 |
|
|
|
|
| 129 |
<td><p style="text-align: left">NPU</p></td>
|
| 130 |
<td><p style="text-align: left">dynamic_int8</p></td>
|
| 131 |
<td><p style="text-align: right">1280</p></td>
|
| 132 |
+
<td><p style="text-align: right">11,272 tk/s</p></td>
|
| 133 |
+
<td><p style="text-align: right">106 tk/s</p></td>
|
| 134 |
+
<td><p style="text-align: right">0.12 s</p></td>
|
| 135 |
+
<td><p style="text-align: right">616 MB</p></td>
|
| 136 |
+
<td><p style="text-align: right">899 MB</p></td>
|
| 137 |
<td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.sm8850.litertlm">🔗</a></p></td>
|
| 138 |
</tr>
|
| 139 |
|
|
|
|
| 143 |
|
| 144 |
Notes:
|
| 145 |
* Model Size: measured by the size of the file on disk.
|
| 146 |
+
* TTFT includes encoding time for 1 image and corresponding text prompt.
|
| 147 |
* Benchmark is run with cache enabled and initialized. During the first run, the latency and memory usage may differ.
|
| 148 |
|
| 149 |
|