Serbian
dict2vec
rankas commited on
Commit
2792d46
·
verified ·
1 Parent(s): f90a0d8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +171 -1
README.md CHANGED
@@ -8,4 +8,174 @@ base_model:
8
  - te-sla/Word2VecSr
9
  tags:
10
  - dict2vec
11
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  - te-sla/Word2VecSr
9
  tags:
10
  - dict2vec
11
+ ---
12
+
13
+
14
+ <table style="width:100%;height:100%">
15
+ <tr>
16
+ <td colspan=2>
17
+ <h4><i class="highlight-container"><b class="highlight">Word2Vec Sr</b></i></h4>
18
+ </td>
19
+ </tr>
20
+ <tr style="width:100%;height:100%">
21
+ <td width=50%>
22
+ <p>Обучаван над корпусом српског језика - 9.5 милијарди речи</p>
23
+ <p>Међу датотекама се налазе два модела (CBOW и SkipGram варијанте)</p>
24
+ </td>
25
+ <td>
26
+ <p>Trained on the Serbian language corpus - 9.5 billion words</p>
27
+ <p>There are two models among the files (CBOW and SkipGram variants)</p>
28
+ </td>
29
+ </tr>
30
+ </table>
31
+
32
+
33
+ ```python
34
+ from gensim.models import Word2Vec
35
+ model = Word2Vec.load("TeslaSG")
36
+ examples = [
37
+ ("dim", "zavesa"),
38
+ ("staklo", "zavesa"),
39
+ ("ormar", "zavesa"),
40
+ ("prozor", "zavesa"),
41
+ ("draperija", "zavesa")
42
+ ]
43
+ for e in examples:
44
+ model.wv.similarity(e[0], e[1]))
45
+ ```
46
+ ```
47
+ 0.5193785
48
+ 0.5763144
49
+ 0.59982747
50
+ 0.6022524
51
+ 0.7117646
52
+ ```
53
+
54
+ <div class="inline-flex flex-col" style="line-height: 1.5;padding-right:50px">
55
+ <div style="text-align: center; margin-top: 3px; font-size: 16px; font-weight: 800">Author</div>
56
+ <a href="https://huggingface.co/procesaur">
57
+ <div class="flex">
58
+ <div
59
+ style="display:DISPLAY_1; margin-left: auto; margin-right: auto; width: 92px; height:92px; border-radius: 50%;
60
+ background-size: cover; background-image: url(&#39;https://cdn-uploads.huggingface.co/production/uploads/1673534533167-63bc254fb8c61b8aa496a39b.jpeg?w=200&h=200&f=face&#39;)">
61
+ </div>
62
+ </div>
63
+ </a>
64
+ <div style="text-align: center; font-size: 16px; font-weight: 800">Mihailo Škorić</div>
65
+ <div>
66
+ <a href="https://huggingface.co/procesaur">
67
+ <div style="text-align: center; font-size: 14px;">@procesaur</div>
68
+ </a>
69
+ </div>
70
+ </div>
71
+ </div>
72
+
73
+ <div class="inline-flex flex-col" style="line-height: 1.5;">
74
+ <div style="text-align: center; margin-top: 3px; font-size: 16px; font-weight: 800">Computation</div>
75
+ <a href="https://tesla.rgf.bg.ac.rs">
76
+ <div class="flex">
77
+ <div
78
+ style="display:DISPLAY_1; margin-left: auto; margin-right: auto; width: 92px; height:92px; border-radius: 50%;
79
+ background-size: cover; background-image: url(https://cdn-avatars.huggingface.co/v1/production/uploads/63bc254fb8c61b8aa496a39b/TfM_-sc8-b34ddfhHBGTA.png?w=200&h=200&f=face)">
80
+ </div>
81
+ </div>
82
+ </a>
83
+ <div style="text-align: center; font-size: 16px; font-weight: 800">TESLA project</div>
84
+ <div>
85
+ <a href="https://huggingface.co/te-sla">
86
+ <div style="text-align: center; font-size: 14px;">@te-sla</div>
87
+ </a>
88
+ </div>
89
+ </div>
90
+ </div>
91
+ <br/><br/>
92
+ <div id="zastava">
93
+ <div class="grb">
94
+ <img src="https://www.ai.gov.rs/img/logo_60x120-2.png" style="position:relative; left:30px; z-index:10; height:85px">
95
+ </div>
96
+ <table width=100% style="border:0px">
97
+ <tr style="background-color:#C6363C;width:100%;border:0px;height:30px"><td style="width:100vw"></td></tr>
98
+ <tr style="background-color:#0C4076;width:100%;border:0px;height:30px"><td></td></tr>
99
+ <tr style="background-color:#ffffff;width:100%;border:0px;height:30px"><td></td></tr>
100
+ </table>
101
+ </div>
102
+
103
+ <table style="width:100%;height:100%">
104
+ <tr style="width:100%;height:100%">
105
+ <td width=50%>
106
+ <p>Истраживање jе спроведено уз подршку Фонда за науку Републике Србиjе, #7276, Text Embeddings – Serbian Language Applications – TESLA</p>
107
+ </td>
108
+ <td>
109
+ <p>This research was supported by the Science Fund of the Republic of Serbia, #7276, Text Embeddings - Serbian Language Applications - TESLA</p>
110
+ </td>
111
+ </tr>
112
+ </table>
113
+
114
+
115
+
116
+ <style>
117
+ .ffeat: {
118
+ color:red
119
+ }
120
+
121
+ .cover {
122
+ width: 100%;
123
+ margin-bottom: 5pt
124
+ }
125
+
126
+ .highlight-container, .highlight {
127
+ position: relative;
128
+ text-decoration:none
129
+ }
130
+
131
+ .highlight-container {
132
+ display: inline-block;
133
+
134
+ }
135
+
136
+ .highlight{
137
+ color:white;
138
+ text-transform:uppercase;
139
+ font-size: 16pt;
140
+ }
141
+
142
+ .highlight-container{
143
+ padding:5px 10px
144
+ }
145
+
146
+ .highlight-container:before {
147
+ content: " ";
148
+ display: block;
149
+ height: 100%;
150
+ width: 100%;
151
+ margin-left: 0px;
152
+ margin-right: 0px;
153
+ position: absolute;
154
+ background: #e80909;
155
+ transform: rotate(2deg);
156
+ top: -1px;
157
+ left: -1px;
158
+ border-radius: 20% 25% 20% 24%;
159
+ padding: 10px 18px 18px 10px;
160
+ }
161
+
162
+ div.grb, #zastava>table {
163
+ position:absolute;
164
+ top:0px;
165
+ left: 0px;
166
+ margin:0px
167
+ }
168
+
169
+ div.grb>img, #zastava>table{
170
+ margin:0px
171
+ }
172
+
173
+ #zastava {
174
+ position: relative;
175
+ margin-bottom:120px
176
+ }
177
+
178
+ p {
179
+ font-size:14pt
180
+ }
181
+ </style>