Peraboom commited on
Commit
cf7f01b
·
1 Parent(s): e0bf7fc
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -1,3 +1,4 @@
1
  ---
2
  license: other
3
  ---
 
 
1
  ---
2
  license: other
3
  ---
4
+ This is distilled model from Bert Base uncased. It has 6 layers, 6 heads and 384 hidden Size. It has 29.8M parameter. Performance wise, it has the potential of 87% performance of bert base with has 12 layers and 12 heads with 110M parameters.