shirakiin commited on
Commit
5a8654b
·
verified ·
1 Parent(s): 8c9d7e7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -10
README.md CHANGED
@@ -117,11 +117,11 @@ Benchmarked on Xiaomi 17 Pro Max.
117
  <td><p style="text-align: left">GPU</p></td>
118
  <td><p style="text-align: left">dynamic_int8</p></td>
119
  <td><p style="text-align: right">1280</p></td>
120
- <td><p style="text-align: right">- tk/s</p></td>
121
- <td><p style="text-align: right">- tk/s</p></td>
122
- <td><p style="text-align: right">- s</p></td>
123
- <td><p style="text-align: right">- MB</p></td>
124
- <td><p style="text-align: right">- MB</p></td>
125
  <td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.litertlm">&#128279;</a></p></td>
126
  </tr>
127
 
@@ -129,11 +129,11 @@ Benchmarked on Xiaomi 17 Pro Max.
129
  <td><p style="text-align: left">NPU</p></td>
130
  <td><p style="text-align: left">dynamic_int8</p></td>
131
  <td><p style="text-align: right">1280</p></td>
132
- <td><p style="text-align: right">- tk/s</p></td>
133
- <td><p style="text-align: right">- tk/s</p></td>
134
- <td><p style="text-align: right">- s</p></td>
135
- <td><p style="text-align: right">- MB</p></td>
136
- <td><p style="text-align: right">- MB</p></td>
137
  <td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.sm8850.litertlm">&#128279;</a></p></td>
138
  </tr>
139
 
@@ -143,6 +143,7 @@ Benchmarked on Xiaomi 17 Pro Max.
143
 
144
  Notes:
145
  * Model Size: measured by the size of the file on disk.
 
146
  * Benchmark is run with cache enabled and initialized. During the first run, the latency and memory usage may differ.
147
 
148
 
 
117
  <td><p style="text-align: left">GPU</p></td>
118
  <td><p style="text-align: left">dynamic_int8</p></td>
119
  <td><p style="text-align: right">1280</p></td>
120
+ <td><p style="text-align: right">2,220 tk/s</p></td>
121
+ <td><p style="text-align: right">64 tk/s</p></td>
122
+ <td><p style="text-align: right">0.55 s</p></td>
123
+ <td><p style="text-align: right">813 MB</p></td>
124
+ <td><p style="text-align: right">1103 MB</p></td>
125
  <td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.litertlm">&#128279;</a></p></td>
126
  </tr>
127
 
 
129
  <td><p style="text-align: left">NPU</p></td>
130
  <td><p style="text-align: left">dynamic_int8</p></td>
131
  <td><p style="text-align: right">1280</p></td>
132
+ <td><p style="text-align: right">11,272 tk/s</p></td>
133
+ <td><p style="text-align: right">106 tk/s</p></td>
134
+ <td><p style="text-align: right">0.12 s</p></td>
135
+ <td><p style="text-align: right">616 MB</p></td>
136
+ <td><p style="text-align: right">899 MB</p></td>
137
  <td><p style="text-align: left"><a style="text-decoration: none" href="https://huggingface.co/litert-community/FastVLM-0.5B/resolve/main/FastVLM-0.5B.sm8850.litertlm">&#128279;</a></p></td>
138
  </tr>
139
 
 
143
 
144
  Notes:
145
  * Model Size: measured by the size of the file on disk.
146
+ * TTFT includes encoding time for 1 image and corresponding text prompt.
147
  * Benchmark is run with cache enabled and initialized. During the first run, the latency and memory usage may differ.
148
 
149