Oleg Dmitriev
qilowoq
AI & ML interests
NLP (mainly in Russian)
Organizations
Job opportunity
1
#5 opened 10 months ago
by
1000scores
Sliding window vs. Global Attention
6
#41 opened over 1 year ago
by
tanliboy
Adding `safetensors` variant of this model
#4 opened about 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened about 1 year ago
by
SFconvertbot
How can we access the logits from this model output?
5
#3 opened about 2 years ago
by
vishwasprabhub
Methodology questions
2
#2 opened over 2 years ago
by
justinbarton
Different size between tokenizer vocab and embedding
2
#1 opened over 2 years ago
by
demharters
Different size between tokenizer vocab and embedding
2
#1 opened over 2 years ago
by
demharters