Is this the working implementation of "HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization" ?
#2 opened over 2 years ago
by
Pranab11
Adding `safetensors` variant of this model
#1 opened almost 3 years ago
by
SFconvertbot