gpt2_base_prefix_682k / scripts /preprocess_data.py
augustocsc's picture
GPT-2 Base trained on prefix dataset (682K)
5faf2eb verified
# Script para pré-processar dados (raw -> processed)