Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,8 @@ tags:
|
|
| 9 |
- presentation-templates
|
| 10 |
- information-retrieval
|
| 11 |
- gemma
|
| 12 |
-
base_model:
|
|
|
|
| 13 |
datasets:
|
| 14 |
- cyberagent/crello
|
| 15 |
language:
|
|
@@ -21,7 +22,7 @@ language:
|
|
| 21 |
## Model Details
|
| 22 |
|
| 23 |
### Model Description
|
| 24 |
-
A fine-tuned text generation model for query generation from presentation template metadata. This model uses LoRA adapters to efficiently fine-tune Google Gemma-3-4B
|
| 25 |
|
| 26 |
**Developed by:** Mudasir Syed (mudasir13cs)
|
| 27 |
|
|
@@ -31,14 +32,14 @@ A fine-tuned text generation model for query generation from presentation templa
|
|
| 31 |
|
| 32 |
**License:** Apache 2.0
|
| 33 |
|
| 34 |
-
**Finetuned from model:** unsloth/gemma-3-4b-it
|
| 35 |
|
| 36 |
**Paper:** [Field-Adaptive Dense Retrieval of Structured Documents](https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544)
|
| 37 |
|
| 38 |
### Model Sources
|
| 39 |
- **Repository:** https://github.com/mudasir13cs/hybrid-search
|
| 40 |
- **Paper:** https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544
|
| 41 |
-
- **Base Model:** https://huggingface.co/unsloth/gemma-3-4b-it
|
| 42 |
|
| 43 |
## Uses
|
| 44 |
|
|
@@ -93,8 +94,8 @@ print(generated_text)
|
|
| 93 |
- **Domain:** Presentation templates with field-adaptive metadata
|
| 94 |
|
| 95 |
### Training Procedure
|
| 96 |
-
- **Architecture:** Google Gemma-3-4B
|
| 97 |
-
- **Base Model:** unsloth/gemma-3-4b-it
|
| 98 |
- **Loss Function:** Cross-entropy loss
|
| 99 |
- **Optimizer:** AdamW
|
| 100 |
- **Learning Rate:** 2e-4
|
|
@@ -137,7 +138,7 @@ print(generated_text)
|
|
| 137 |
## Technical Specifications
|
| 138 |
|
| 139 |
### Model Architecture and Objective
|
| 140 |
-
- **Base Architecture:** Google Gemma-3-4B
|
| 141 |
- **Adaptation:** LoRA adapters for parameter-efficient fine-tuning
|
| 142 |
- **Objective:** Generate relevant search queries from template metadata for field-adaptive dense retrieval
|
| 143 |
- **Input:** Template metadata (title, description, industries, categories, tags)
|
|
|
|
| 9 |
- presentation-templates
|
| 10 |
- information-retrieval
|
| 11 |
- gemma
|
| 12 |
+
base_model:
|
| 13 |
+
- unsloth/gemma-3-4b-it-unsloth-bnb-4bit
|
| 14 |
datasets:
|
| 15 |
- cyberagent/crello
|
| 16 |
language:
|
|
|
|
| 22 |
## Model Details
|
| 23 |
|
| 24 |
### Model Description
|
| 25 |
+
A fine-tuned text generation model for query generation from presentation template metadata. This model uses LoRA adapters to efficiently fine-tune Google Gemma-3-4B for generating diverse and relevant search queries as part of the Field-Adaptive Dense Retrieval framework.
|
| 26 |
|
| 27 |
**Developed by:** Mudasir Syed (mudasir13cs)
|
| 28 |
|
|
|
|
| 32 |
|
| 33 |
**License:** Apache 2.0
|
| 34 |
|
| 35 |
+
**Finetuned from model:** unsloth/gemma-3-4b-it-unsloth-bnb-4bit
|
| 36 |
|
| 37 |
**Paper:** [Field-Adaptive Dense Retrieval of Structured Documents](https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544)
|
| 38 |
|
| 39 |
### Model Sources
|
| 40 |
- **Repository:** https://github.com/mudasir13cs/hybrid-search
|
| 41 |
- **Paper:** https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544
|
| 42 |
+
- **Base Model:** https://huggingface.co/unsloth/gemma-3-4b-it-unsloth-bnb-4bit
|
| 43 |
|
| 44 |
## Uses
|
| 45 |
|
|
|
|
| 94 |
- **Domain:** Presentation templates with field-adaptive metadata
|
| 95 |
|
| 96 |
### Training Procedure
|
| 97 |
+
- **Architecture:** Google Gemma-3-4B with LoRA adapters
|
| 98 |
+
- **Base Model:** unsloth/gemma-3-4b-it-unsloth-bnb-4bit
|
| 99 |
- **Loss Function:** Cross-entropy loss
|
| 100 |
- **Optimizer:** AdamW
|
| 101 |
- **Learning Rate:** 2e-4
|
|
|
|
| 138 |
## Technical Specifications
|
| 139 |
|
| 140 |
### Model Architecture and Objective
|
| 141 |
+
- **Base Architecture:** Google Gemma-3-4B transformer decoder
|
| 142 |
- **Adaptation:** LoRA adapters for parameter-efficient fine-tuning
|
| 143 |
- **Objective:** Generate relevant search queries from template metadata for field-adaptive dense retrieval
|
| 144 |
- **Input:** Template metadata (title, description, industries, categories, tags)
|