Updated Readme
Browse files
README.md
CHANGED
|
@@ -45,6 +45,17 @@ MedOCR-Vision is a vision-language model fine-tuned specifically for optical cha
|
|
| 45 |
- **Target Modules**: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj, out_proj, fc1, fc2, linear_1, linear_2
|
| 46 |
- **Precision**: Mixed (BF16/FP16)
|
| 47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
## Intended Uses
|
| 49 |
|
| 50 |
### Primary Use Cases
|
|
@@ -282,16 +293,12 @@ _ = model.generate(
|
|
| 282 |
- Implement appropriate error handling and validation
|
| 283 |
- Consider the implications of automated medical document processing
|
| 284 |
|
| 285 |
-
## Model Card Authors
|
| 286 |
-
|
| 287 |
-
- **Naazim** (naazimsnh02)
|
| 288 |
-
|
| 289 |
## Citation
|
| 290 |
|
| 291 |
```bibtex
|
| 292 |
@misc{medocr-vision-2025,
|
| 293 |
title={MedOCR-Vision: Medical Document OCR with PaddleOCR-VL},
|
| 294 |
-
author={Naazim},
|
| 295 |
year={2025},
|
| 296 |
publisher={HuggingFace},
|
| 297 |
howpublished={\url{https://huggingface.co/naazimsnh02/medocr-vision}}
|
|
@@ -300,7 +307,7 @@ _ = model.generate(
|
|
| 300 |
|
| 301 |
## Additional Resources
|
| 302 |
|
| 303 |
-
- **Code Repository**: https://github.com/
|
| 304 |
- **Training Dataset**: https://huggingface.co/datasets/naazimsnh02/medocr-vision-dataset
|
| 305 |
- **Training Notebook**: Available in the repository
|
| 306 |
- **ERNIE Challenge**: Submitted for ERNIE AI Developer Challenge
|
|
@@ -317,10 +324,6 @@ This model is released under the MIT License. Please refer to individual dataset
|
|
| 317 |
- **LLM Providers**: Nebius and Novita for data processing
|
| 318 |
- **PaddleOCR Team**: For the excellent OCR framework
|
| 319 |
|
| 320 |
-
## Model Card Contact
|
| 321 |
-
|
| 322 |
-
For questions or issues, please open an issue in the [GitHub repository](https://github.com/Naazimsnh02/medocr-vision).
|
| 323 |
-
|
| 324 |
---
|
| 325 |
|
| 326 |
**Model Version**: 1.0
|
|
|
|
| 45 |
- **Target Modules**: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj, out_proj, fc1, fc2, linear_1, linear_2
|
| 46 |
- **Precision**: Mixed (BF16/FP16)
|
| 47 |
|
| 48 |
+
## Performance Highlights
|
| 49 |
+
|
| 50 |
+
### Model Improvements Over Base Model
|
| 51 |
+
|
| 52 |
+
Our fine-tuned model demonstrates significant improvements across multiple metrics:
|
| 53 |
+
|
| 54 |
+
- ✅ **Enhanced Information Extraction**: Captures more complete medical information including headers, test values, and reference ranges
|
| 55 |
+
- ✅ **Better Document Understanding**: Improved coverage of document structure and context
|
| 56 |
+
- ✅ **Medical Domain Specialization**: Superior performance on medical terminology and clinical data
|
| 57 |
+
- ✅ **Comprehensive Coverage**: Extracts significantly more relevant content from medical documents
|
| 58 |
+
|
| 59 |
## Intended Uses
|
| 60 |
|
| 61 |
### Primary Use Cases
|
|
|
|
| 293 |
- Implement appropriate error handling and validation
|
| 294 |
- Consider the implications of automated medical document processing
|
| 295 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 296 |
## Citation
|
| 297 |
|
| 298 |
```bibtex
|
| 299 |
@misc{medocr-vision-2025,
|
| 300 |
title={MedOCR-Vision: Medical Document OCR with PaddleOCR-VL},
|
| 301 |
+
author={Syed Naazim Hussain},
|
| 302 |
year={2025},
|
| 303 |
publisher={HuggingFace},
|
| 304 |
howpublished={\url{https://huggingface.co/naazimsnh02/medocr-vision}}
|
|
|
|
| 307 |
|
| 308 |
## Additional Resources
|
| 309 |
|
| 310 |
+
- **Code Repository**: https://github.com/naazimsnh02/medocr-vision
|
| 311 |
- **Training Dataset**: https://huggingface.co/datasets/naazimsnh02/medocr-vision-dataset
|
| 312 |
- **Training Notebook**: Available in the repository
|
| 313 |
- **ERNIE Challenge**: Submitted for ERNIE AI Developer Challenge
|
|
|
|
| 324 |
- **LLM Providers**: Nebius and Novita for data processing
|
| 325 |
- **PaddleOCR Team**: For the excellent OCR framework
|
| 326 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 327 |
---
|
| 328 |
|
| 329 |
**Model Version**: 1.0
|