Surromind
/

RetrievalLLM-preview

Text Generation

Model card Files Files and versions

daebum commited on Mar 21, 2025

Commit

771b5ea

·

verified ·

1 Parent(s): ffca536

Update README.md

Files changed (1) hide show

README.md +11 -11

README.md CHANGED Viewed

@@ -41,17 +41,17 @@ Command r plus 모델을 이용하여 자체 구축한 RAG 특화 데이터셋,
 ```
 ## 학습 환경 및 파라미터
-튜닝 환경 : H100(80GB) * 8
--tokenizer_model_mex_length 4500
--use_flash_attn True
--num_train_epochs 3.0
--weight_decay 0.001
--lr_scheduler_type "linear"
--per_device_train_batch_size 1
--gradient_accumulation_steps 64
--learning_rate 5e-06
--bf16 True
--deepspeed ds_stage2.json
 ## 사용 데이터셋
 - AIhub 16 행정 문서 대상 기계독해 데이터

 ```
 ## 학습 환경 및 파라미터
+- 튜닝 환경 : H100(80GB) * 8
+- tokenizer_model_mex_length 4500
+- use_flash_attn True
+- num_train_epochs 3.0
+- weight_decay 0.001
+- lr_scheduler_type "linear"
+- per_device_train_batch_size 1
+- gradient_accumulation_steps 64
+- learning_rate 5e-06
+- bf16 True
+- deepspeed ds_stage2.json
 ## 사용 데이터셋
 - AIhub 16 행정 문서 대상 기계독해 데이터