Configuration Parsing Warning:In adapter_config.json: "peft.task_type" must be a string

paddleocr-nepali-stage1-secondrun

This model is a fine-tuned version of strangervisionhf/PaddleOCR-VL-1.5-hf-transformers-v5.2.0.dev0 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.0002
train_batch_size: 4
eval_batch_size: 4
seed: 42
gradient_accumulation_steps: 8
total_train_batch_size: 32
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 1125
training_steps: 11256

Training Loss	Epoch	Step	Validation Loss
0.6124	0.8889	1000	0.2542
0.3151	1.7778	2000	0.1858
0.4257	2.6667	3000	0.1601
0.2122	3.5556	4000	0.1526
0.1546	4.4444	5000	0.1481
0.0744	5.0027	5628	0.1476
0.2220	5.3333	6000	0.1535
0.2524	6.2222	7000	0.1541
0.1360	7.1111	8000	0.1520
0.1078	8.0	9000	0.1484
0.2665	8.8889	10000	0.1480

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Finetuned

Adapter

(4)

this model