Upload 大模型.txt
#2
by
ZetianUser
- opened
大模型.txt
ADDED
|
@@ -0,0 +1,30 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
prompt+Image index + question
|
| 3 |
+
|
| 4 |
+
|
| 5 |
+
tensor([[ 1, 319, 13563, 1546, 263, 12758, 5199, 322, 385, 23116,
|
| 6 |
+
21082, 20255, 29889, 450, 20255, 4076, 8444, 29892, 13173, 29892,
|
| 7 |
+
322, 1248, 568, 6089, 304, 278, 5199, 29915, 29879, 5155,
|
| 8 |
+
29889, 3148, 1001, 29901, 29871, -200, 29871, 13, 5618, 1203,
|
| 9 |
+
4318, 2444, 1556, 22910, 29973, 319, 29901, 9795, 342, 374,
|
| 10 |
+
550, 350, 29901, 824, 342, 432, 2873, 315, 29901, 3018,
|
| 11 |
+
2416, 3578, 360, 29901, 29352, 29889, 319, 1799, 9047, 13566,
|
| 12 |
+
29901]], device='cuda:0')
|
| 13 |
+
|
| 14 |
+
<s> A chat between a curious human and an artificial intelligence assistant
|
| 15 |
+
646
|
| 16 |
+
|
| 17 |
+
576
|
| 18 |
+
|
| 19 |
+
input 71 prompt+Image index token(1) + question
|
| 20 |
+
|
| 21 |
+
input_embedding (1,646,4096)
|
| 22 |
+
num image token 646 - (71 - 1) = 576
|
| 23 |
+
|
| 24 |
+
|
| 25 |
+
squence 176
|
| 26 |
+
score 175 shape (1, 32000)
|
| 27 |
+
attentions 175 32 shape (1,32,1,646+num_generate_token)
|
| 28 |
+
past kv 32 (1,32(head),820(squence length),128)
|
| 29 |
+
|
| 30 |
+
生成时output.sequences比hidden_state长度多1,因为最后会拼接一个<\s>即end_of_sequence的特殊token以表示序列结束。且模型输入后会先生成一个<s>即start_of_sequence,
|