LexiMind / scripts /build_discovery_dataset.py

Commit History

Full training results & evaluation with BERTScore
1e95f87
Running

OliverPerrin commited on

Medium training run
b93250a

OliverPerrin commited on

Fix literary summaries: exclude plays/epics, use human summaries from BookSum
f1cb860

OliverPerrin commited on

Fix literary summaries: use only BookSum data (not Gutenberg excerpts)
d57b866

OliverPerrin commited on

Fix regression: restore balanced training settings, add technical manual filter
9710200

OliverPerrin commited on

Major improvements: real titles, anti-overfitting, better demo UX
0fe274c

OliverPerrin commited on

Add English language filter to all data downloads
8573220

OliverPerrin commited on

Updated Training run, fixed dataset langauge issue
e3422d2

OliverPerrin commited on

Improve discovery dataset quality
5c41b92

OliverPerrin commited on

Use HF Datasets for discovery demo
218e2b1

OliverPerrin commited on