docs: fix YAML, broken image links, add library_name for inference
Browse files
README.md
CHANGED
|
@@ -85,7 +85,6 @@ The SOCAR Historical Documents AI System is a sophisticated document intelligenc
|
|
| 85 |
- **Ulvi Bashirov** - [GitHub](https://github.com/ulvibashir)
|
| 86 |
- **Samir Mehdiyev** - [GitHub](https://github.com/samirmehd1yev)
|
| 87 |
|
| 88 |
-

|
| 89 |
|
| 90 |
| Rank | Team Name | Total Score out of 300 | Total Score out of 500 | Total Presentation Score | Total Score |
|
| 91 |
|------|-----------|------------------------|------------------------|--------------------------|-------------|
|
|
@@ -215,7 +214,6 @@ We built **3 specialized Jupyter notebooks** to test every component independent
|
|
| 215 |
- `slide_4_summary_table.png` - Complete results table
|
| 216 |
- `slide_5_success_rates.png` - CSR vs WSR side-by-side
|
| 217 |
|
| 218 |
-

|
| 219 |
|
| 220 |
**Decision**: **Llama-4-Maverick-17B** selected for OCR endpoint.
|
| 221 |
|
|
@@ -477,7 +475,6 @@ We conducted comprehensive benchmarks to select the optimal language model for o
|
|
| 477 |
|
| 478 |
### Quality Score Comparison
|
| 479 |
|
| 480 |
-

|
| 481 |
|
| 482 |
**Key Findings**:
|
| 483 |
- **GPT-4.1** and **Llama-4-Maverick** tied at **52.0** quality score
|
|
@@ -495,7 +492,6 @@ We conducted comprehensive benchmarks to select the optimal language model for o
|
|
| 495 |
|
| 496 |
### Comprehensive Metrics Breakdown
|
| 497 |
|
| 498 |
-

|
| 499 |
|
| 500 |
**Breakdown by Category**:
|
| 501 |
|
|
@@ -519,7 +515,6 @@ We conducted comprehensive benchmarks to select the optimal language model for o
|
|
| 519 |
|
| 520 |
### Multi-Dimensional Performance Profile
|
| 521 |
|
| 522 |
-

|
| 523 |
|
| 524 |
**Radar Chart Dimensions**:
|
| 525 |
|
|
@@ -549,7 +544,6 @@ We conducted comprehensive benchmarks to select the optimal language model for o
|
|
| 549 |
|
| 550 |
### Response Time Analysis
|
| 551 |
|
| 552 |
-

|
| 553 |
|
| 554 |
**Latency Comparison** (Lower is Better):
|
| 555 |
|
|
@@ -578,7 +572,6 @@ We conducted comprehensive benchmarks to select the optimal language model for o
|
|
| 578 |
|
| 579 |
### Complete Overview Dashboard
|
| 580 |
|
| 581 |
-

|
| 582 |
|
| 583 |
**Four-Panel Analysis**:
|
| 584 |
|
|
@@ -609,49 +602,6 @@ We conducted comprehensive benchmarks to select the optimal language model for o
|
|
| 609 |
|
| 610 |
---
|
| 611 |
|
| 612 |
-
## Live Demo Screenshots
|
| 613 |
-
|
| 614 |
-
### Web Interface
|
| 615 |
-
|
| 616 |
-
#### Landing Page
|
| 617 |
-

|
| 618 |
-
|
| 619 |
-
*Main interface with OCR and LLM tabs for document processing and Q&A*
|
| 620 |
-
|
| 621 |
-
#### LLM Question Answering
|
| 622 |
-
|
| 623 |
-

|
| 624 |
-
|
| 625 |
-
*Interactive Q&A interface for querying SOCAR historical documents in Azerbaijani*
|
| 626 |
-
|
| 627 |
-

|
| 628 |
-
|
| 629 |
-
*Complete answer with source citations and document references*
|
| 630 |
-
|
| 631 |
-
#### OCR Processing
|
| 632 |
-
|
| 633 |
-

|
| 634 |
-
|
| 635 |
-
*PDF upload and OCR processing initiation*
|
| 636 |
-
|
| 637 |
-

|
| 638 |
-
|
| 639 |
-
*Real-time OCR processing with page-by-page progress*
|
| 640 |
-
|
| 641 |
-

|
| 642 |
-
|
| 643 |
-
*Completed OCR extraction with formatted markdown output*
|
| 644 |
-
|
| 645 |
-
#### Additional Views
|
| 646 |
-
|
| 647 |
-

|
| 648 |
-
|
| 649 |
-
*Document selection and upload interface*
|
| 650 |
-
|
| 651 |
-

|
| 652 |
-
|
| 653 |
-
*Advanced features and settings panel*
|
| 654 |
-
|
| 655 |
---
|
| 656 |
|
| 657 |
## Key Features
|
|
|
|
| 85 |
- **Ulvi Bashirov** - [GitHub](https://github.com/ulvibashir)
|
| 86 |
- **Samir Mehdiyev** - [GitHub](https://github.com/samirmehd1yev)
|
| 87 |
|
|
|
|
| 88 |
|
| 89 |
| Rank | Team Name | Total Score out of 300 | Total Score out of 500 | Total Presentation Score | Total Score |
|
| 90 |
|------|-----------|------------------------|------------------------|--------------------------|-------------|
|
|
|
|
| 214 |
- `slide_4_summary_table.png` - Complete results table
|
| 215 |
- `slide_5_success_rates.png` - CSR vs WSR side-by-side
|
| 216 |
|
|
|
|
| 217 |
|
| 218 |
**Decision**: **Llama-4-Maverick-17B** selected for OCR endpoint.
|
| 219 |
|
|
|
|
| 475 |
|
| 476 |
### Quality Score Comparison
|
| 477 |
|
|
|
|
| 478 |
|
| 479 |
**Key Findings**:
|
| 480 |
- **GPT-4.1** and **Llama-4-Maverick** tied at **52.0** quality score
|
|
|
|
| 492 |
|
| 493 |
### Comprehensive Metrics Breakdown
|
| 494 |
|
|
|
|
| 495 |
|
| 496 |
**Breakdown by Category**:
|
| 497 |
|
|
|
|
| 515 |
|
| 516 |
### Multi-Dimensional Performance Profile
|
| 517 |
|
|
|
|
| 518 |
|
| 519 |
**Radar Chart Dimensions**:
|
| 520 |
|
|
|
|
| 544 |
|
| 545 |
### Response Time Analysis
|
| 546 |
|
|
|
|
| 547 |
|
| 548 |
**Latency Comparison** (Lower is Better):
|
| 549 |
|
|
|
|
| 572 |
|
| 573 |
### Complete Overview Dashboard
|
| 574 |
|
|
|
|
| 575 |
|
| 576 |
**Four-Panel Analysis**:
|
| 577 |
|
|
|
|
| 602 |
|
| 603 |
---
|
| 604 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 605 |
---
|
| 606 |
|
| 607 |
## Key Features
|