mudasir13cs commited on
Commit
367df65
·
verified ·
1 Parent(s): 31b6e68

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -7
README.md CHANGED
@@ -9,7 +9,8 @@ tags:
9
  - presentation-templates
10
  - information-retrieval
11
  - gemma
12
- base_model: unsloth/gemma-3-4b-it
 
13
  datasets:
14
  - cyberagent/crello
15
  language:
@@ -21,7 +22,7 @@ language:
21
  ## Model Details
22
 
23
  ### Model Description
24
- A fine-tuned text generation model for query generation from presentation template metadata. This model uses LoRA adapters to efficiently fine-tune Google Gemma-3-4B-IT for generating diverse and relevant search queries as part of the Field-Adaptive Dense Retrieval framework.
25
 
26
  **Developed by:** Mudasir Syed (mudasir13cs)
27
 
@@ -31,14 +32,14 @@ A fine-tuned text generation model for query generation from presentation templa
31
 
32
  **License:** Apache 2.0
33
 
34
- **Finetuned from model:** unsloth/gemma-3-4b-it
35
 
36
  **Paper:** [Field-Adaptive Dense Retrieval of Structured Documents](https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544)
37
 
38
  ### Model Sources
39
  - **Repository:** https://github.com/mudasir13cs/hybrid-search
40
  - **Paper:** https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544
41
- - **Base Model:** https://huggingface.co/unsloth/gemma-3-4b-it
42
 
43
  ## Uses
44
 
@@ -93,8 +94,8 @@ print(generated_text)
93
  - **Domain:** Presentation templates with field-adaptive metadata
94
 
95
  ### Training Procedure
96
- - **Architecture:** Google Gemma-3-4B-IT with LoRA adapters
97
- - **Base Model:** unsloth/gemma-3-4b-it
98
  - **Loss Function:** Cross-entropy loss
99
  - **Optimizer:** AdamW
100
  - **Learning Rate:** 2e-4
@@ -137,7 +138,7 @@ print(generated_text)
137
  ## Technical Specifications
138
 
139
  ### Model Architecture and Objective
140
- - **Base Architecture:** Google Gemma-3-4B-IT transformer decoder
141
  - **Adaptation:** LoRA adapters for parameter-efficient fine-tuning
142
  - **Objective:** Generate relevant search queries from template metadata for field-adaptive dense retrieval
143
  - **Input:** Template metadata (title, description, industries, categories, tags)
 
9
  - presentation-templates
10
  - information-retrieval
11
  - gemma
12
+ base_model:
13
+ - unsloth/gemma-3-4b-it-unsloth-bnb-4bit
14
  datasets:
15
  - cyberagent/crello
16
  language:
 
22
  ## Model Details
23
 
24
  ### Model Description
25
+ A fine-tuned text generation model for query generation from presentation template metadata. This model uses LoRA adapters to efficiently fine-tune Google Gemma-3-4B for generating diverse and relevant search queries as part of the Field-Adaptive Dense Retrieval framework.
26
 
27
  **Developed by:** Mudasir Syed (mudasir13cs)
28
 
 
32
 
33
  **License:** Apache 2.0
34
 
35
+ **Finetuned from model:** unsloth/gemma-3-4b-it-unsloth-bnb-4bit
36
 
37
  **Paper:** [Field-Adaptive Dense Retrieval of Structured Documents](https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544)
38
 
39
  ### Model Sources
40
  - **Repository:** https://github.com/mudasir13cs/hybrid-search
41
  - **Paper:** https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12352544
42
+ - **Base Model:** https://huggingface.co/unsloth/gemma-3-4b-it-unsloth-bnb-4bit
43
 
44
  ## Uses
45
 
 
94
  - **Domain:** Presentation templates with field-adaptive metadata
95
 
96
  ### Training Procedure
97
+ - **Architecture:** Google Gemma-3-4B with LoRA adapters
98
+ - **Base Model:** unsloth/gemma-3-4b-it-unsloth-bnb-4bit
99
  - **Loss Function:** Cross-entropy loss
100
  - **Optimizer:** AdamW
101
  - **Learning Rate:** 2e-4
 
138
  ## Technical Specifications
139
 
140
  ### Model Architecture and Objective
141
+ - **Base Architecture:** Google Gemma-3-4B transformer decoder
142
  - **Adaptation:** LoRA adapters for parameter-efficient fine-tuning
143
  - **Objective:** Generate relevant search queries from template metadata for field-adaptive dense retrieval
144
  - **Input:** Template metadata (title, description, industries, categories, tags)