Files changed (1) hide show
  1. README.md +45 -13
README.md CHANGED
@@ -10,14 +10,31 @@ tags:
10
  ---
11
 
12
 
13
- ### Model Description
14
 
15
- This model is a finetuned version of Llama3.2-3B-Instruct specifically designed for generating multilingual outputs across multiple Indic languages. The model has been trained on a diverse and curated dataset comprising Hindi, Punjabi, Marathi, Malayalam, Oriya, Kannada, Gujarati, Bengali, Urdu, Tamil, and Telugu. It is optimized to handle natural language tasks such as translation, summarization, and conversational generation across these languages effectively.
16
 
17
- - **Developed by:** [More Information Needed]
 
 
 
 
18
  - **Model type:** Finetuned LLaMA (Language Model for Multilingual Text Generation)
19
- - **Language(s) (NLP):** Hindi, Punjabi, Marathi, Malayalam, Oriya, Kannada, Gujarati, Bengali, Urdu, Tamil, Telugu
20
- - **Finetuned from model:** Llama3.2-3B-Instruct
 
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
  ## How to Get Started with the Model
23
 
@@ -52,9 +69,8 @@ print(outputs[0]["generated_text"][-1])
52
 
53
  ### Training Data
54
 
55
- The training dataset included a diverse collection of text sources in:
56
-
57
- - Hindi, Punjabi, Marathi, Malayalam, Oriya, Kannada, Gujarati, Bengali, Urdu, Tamil, and Telugu.
58
 
59
  ### Training Parameters
60
 
@@ -64,11 +80,27 @@ The training dataset included a diverse collection of text sources in:
64
  - **Learning Rate**: 5e-05
65
 
66
 
67
- ## Environmental Impact
68
 
69
- - **Hardware Type:** T4
70
- - **Hours used:** 29 hours
71
  - **Cloud Provider:** Google Cloud Platform
72
- - **Compute Region:** asia-southeast1
73
- - **Carbon Emitted:** Total emissions are estimated to be 0.85 kgCO$_2$eq of which 100 percents were directly offset by the cloud provider.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
74
 
 
10
  ---
11
 
12
 
13
+ # Model Information
14
 
15
+ BGPT is a finetuned version of Llama3.2-3B-Instruct, specifically optimized for generating high-quality multilingual outputs across 11 Indic languages. The model demonstrates strong capabilities in translation, summarization, and conversational tasks while maintaining the base model's performance characteristics.
16
 
17
+ ## Model Developer
18
+ Harsh Bande
19
+
20
+ ## Model Architecture
21
+ - **Base Model:** Llama3.2-3B-Instruct
22
  - **Model type:** Finetuned LLaMA (Language Model for Multilingual Text Generation)
23
+ - **Architecture Type:** Auto-regressive language model with optimized transformer architecture
24
+ - **Adaptation Method:** LoRA (Low-Rank Adaptation)
25
+ - **Model Type:** Instruction-tuned multilingual text generation model
26
+
27
+ ## Supported Languages
28
+ Hindi, Punjabi, Marathi, Malayalam, Oriya, Kannada, Gujarati, Bengali, Urdu, Tamil, and Telugu
29
+
30
+ # Intended Use
31
+
32
+ ## Primary Use Cases
33
+ - Multilingual text generation
34
+ - Cross-lingual translation
35
+ - Text summarization
36
+ - Conversational AI in Indic languages
37
+ - Language understanding and generation tasks
38
 
39
  ## How to Get Started with the Model
40
 
 
69
 
70
  ### Training Data
71
 
72
+ - **Dataset Composition:** Curated collection of text from 11 Indic languages
73
+ - **Languages Covered:** Hindi, Punjabi, Marathi, Malayalam, Oriya, Kannada, Gujarati, Bengali, Urdu, Tamil, and Telugu
 
74
 
75
  ### Training Parameters
76
 
 
80
  - **Learning Rate**: 5e-05
81
 
82
 
83
+ ## Hardware and Environmental Impact
84
 
85
+ ### Training Infrastructure
86
+ - **Hardware:** T4 GPU
87
  - **Cloud Provider:** Google Cloud Platform
88
+ - **Region:** asia-southeast1
89
+ - **Training Duration:** 29 hours
90
+
91
+ ### Environmental Impact Assessment
92
+ - **Carbon Emissions:** 0.85 kgCO₂eq
93
+ - **Carbon Offset:** 100% offset by the cloud provider
94
+ - **Location:** asia-southeast1 region
95
+
96
+ ## Limitations and Biases
97
+ - The model's performance may vary across different Indic languages
98
+ - The model inherits both capabilities and limitations of the base Llama3.2-3B-Instruct model
99
+ - Users should conduct appropriate testing for their specific use cases
100
+
101
+ ## License
102
+ [More Information Needed]
103
+
104
+ ## Citation and References
105
+ [More Information Needed]
106