qaihm-bot commited on
Commit
76c8c6f
·
verified ·
1 Parent(s): d130e22

See https://github.com/quic/ai-hub-models/releases/v0.40.0 for changelog.

Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -48,6 +48,7 @@ More details on model performance across various devices, can be found
48
 
49
  | Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
50
  |---|---|---|---|---|---|
 
51
  | Falcon3-7B-Instruct | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | GENIE | 14.02985 | 0.1265205 - 4.048656 | -- | Use Export Script |
52
  | Falcon3-7B-Instruct | w4a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | GENIE | 9.96829 | 0.1973798 - 6.3161536 | -- | Use Export Script |
53
 
 
48
 
49
  | Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
50
  |---|---|---|---|---|---|
51
+ | Falcon3-7B-Instruct | w4a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | GENIE | 15.8303 | 0.10903 - 3.488966 | -- | Use Export Script |
52
  | Falcon3-7B-Instruct | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | GENIE | 14.02985 | 0.1265205 - 4.048656 | -- | Use Export Script |
53
  | Falcon3-7B-Instruct | w4a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | GENIE | 9.96829 | 0.1973798 - 6.3161536 | -- | Use Export Script |
54