sgattup commited on
Commit
7bd602e
ยท
verified ยท
1 Parent(s): a76f169

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +115 -14
README.md CHANGED
@@ -1,22 +1,123 @@
1
  ---
2
- base_model: unsloth/gemma-4-e4b-it-unsloth-bnb-4bit
3
- tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - gemma4
8
- - trl
9
- license: apache-2.0
10
  language:
11
  - en
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
- # Uploaded model
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
- - **Developed by:** sgattup
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** unsloth/gemma-4-e4b-it-unsloth-bnb-4bit
19
 
20
- This gemma4 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
1
  ---
 
 
 
 
 
 
 
 
2
  language:
3
  - en
4
+ license: apache-2.0
5
+ tags:
6
+ - indian-culture
7
+ - culture
8
+ - history
9
+ - philosophy
10
+ - fine-tuned
11
+ - unsloth
12
+ - llama
13
+ - lora
14
+ base_model: unsloth/llama-3.2-3B-bnb-4bit
15
+ pipeline_tag: text-generation
16
  ---
17
 
18
+ # ๐Ÿ‡ฎ๐Ÿ‡ณ Indian Culture LLM
19
+
20
+ A fine-tuned language model focused on **Indian culture, history, philosophy, arts, and traditions** โ€” built to answer questions about one of the world's oldest and richest civilizations.
21
+
22
+ ## What This Model Knows
23
+
24
+ This model has been trained on high-quality instruction pairs covering:
25
+
26
+ - **Hindu Mythology** โ€” Mahabharata, Ramayana, Puranas, key deities and their stories
27
+ - **Indian Philosophy** โ€” Vedanta, Advaita, Yoga (Patanjali's 8 limbs), Bhakti movement, Jainism, Buddhism, Sikhism
28
+ - **Classical Arts** โ€” Bharatanatyam, Kathak, Odissi, Kuchipudi, Manipuri, Mohiniyattam, Kathakali, Carnatic and Hindustani music, Raga system, Gharanas
29
+ - **Festivals** โ€” Diwali, Holi, Navratri, Durga Puja, Onam, Pongal, Kumbh Mela, Rath Yatra, Thrissur Pooram, Garba/Dandiya
30
+ - **Indian History** โ€” Indus Valley Civilization, Vedic period, Maurya and Gupta Empires, Mughal era, Bhakti movement, Independence movement, Partition
31
+ - **Key Figures** โ€” Gandhi, Ambedkar, Ashoka, Chandragupta, Akbar, Shivaji, Tagore, Vivekananda, Ramanujan, Aryabhata, Tansen, MS Subbulakshmi, Ravi Shankar, Lata Mangeshkar, Mirabai, Kabir Das, Adi Shankaracharya
32
+ - **Ayurveda & Wellness** โ€” Doshas, Panchakarma, Ayurvedic cooking, medicinal plants (Tulsi, Neem, Peepal)
33
+ - **Regional Cuisine** โ€” North vs South Indian food, Mughal culinary contributions, Ayurvedic cooking, Indian spices
34
+ - **Traditional Art Forms** โ€” Madhubani painting, Warli art, Pattachitra, Kolam/Rangoli, Tanjore painting
35
+ - **Sacred Texts** โ€” Bhagavad Gita, Upanishads, Arthashastra, Thirukkural, Panchatantra, Natya Shastra
36
+ - **Architecture & Heritage** โ€” Taj Mahal, Hampi, Nalanda University, Chola temples, Vastu Shastra
37
+
38
+ ## How to Use
39
+
40
+ ```python
41
+ from transformers import AutoModelForCausalLM, AutoTokenizer
42
+ import torch
43
+
44
+ model_id = "sgattup/IndianCultureLLM"
45
+ tokenizer = AutoTokenizer.from_pretrained(model_id)
46
+ model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float16, device_map="auto")
47
+
48
+ ALPACA_PROMPT = """Below is a question or instruction about Indian culture.
49
+ Write a thoughtful, accurate, and detailed response.
50
+
51
+ ### Instruction:
52
+ {}
53
+
54
+ ### Response:
55
+ """
56
+
57
+ def ask(question):
58
+ prompt = ALPACA_PROMPT.format(question)
59
+ inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
60
+ outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7, do_sample=True)
61
+ response = tokenizer.decode(outputs[0], skip_special_tokens=True)
62
+ return response.split("### Response:")[-1].strip()
63
+
64
+ print(ask("What is the significance of Diwali in Indian culture?"))
65
+ print(ask("Explain the eight limbs of Yoga according to Patanjali."))
66
+ print(ask("Who was Adi Shankaracharya and what did he teach?"))
67
+ ```
68
+
69
+ ## Example Outputs
70
+
71
+ **Q: What is a Raga in Indian classical music?**
72
+
73
+ > A Raga is the fundamental melodic framework of Indian classical music โ€” not merely a scale but a complete musical personality with rules about which notes to use, how to ascend and descend, what mood it evokes, and what time of day it should be performed. Different ragas are associated with times of day: Bhairav at dawn, Yaman at dusk...
74
+
75
+ **Q: Who was Dr. B.R. Ambedkar?**
76
+
77
+ > Dr. Bhimrao Ramji Ambedkar (1891โ€“1956) was one of the most transformative figures in modern Indian history. Born into a Dalit family, he overcame severe discrimination to earn degrees from Columbia University and the London School of Economics. He was the principal architect of India's Constitution, which abolished untouchability and guaranteed fundamental rights to all citizens regardless of caste...
78
+
79
+ ## Model Details
80
+
81
+ | Property | Value |
82
+ |---|---|
83
+ | Base Model | LLaMA 3.2 3B (unsloth/llama-3.2-3B-bnb-4bit) |
84
+ | Fine-tuning Method | QLoRA via Unsloth |
85
+ | LoRA Rank | 16 |
86
+ | Training Epochs | 3 |
87
+ | Dataset Size | 75+ high-quality instruction pairs |
88
+ | Language | English |
89
+ | License | Apache 2.0 |
90
+
91
+ ## Limitations
92
+
93
+ - This is an early version trained on a relatively small dataset โ€” responses are informative but may lack depth on niche topics
94
+ - Primarily covers mainstream Hindu/pan-Indian cultural topics; regional and tribal cultures are underrepresented
95
+ - Not a substitute for academic sources on complex historical or religious topics
96
+ - May reflect certain perspectives more than others given the training data
97
+
98
+ ## Future Plans
99
+
100
+ - Expand dataset to 2,000+ examples
101
+ - Add regional language support (Tamil, Telugu, Hindi, Bengali)
102
+ - Cover tribal and indigenous Indian cultures more deeply
103
+ - Add a HuggingFace Space for interactive demos
104
+ - Train a larger 7B version for improved depth
105
+
106
+ ## Training Code
107
+
108
+ Training code and dataset are available at:
109
+ [github.com/sai-educ/indian-culture-llm](https://github.com/sai-educ/indian-culture-llm)
110
 
111
+ ## Citation
 
 
112
 
113
+ If you use this model in your work, please cite:
114
 
115
+ ```
116
+ @misc{IndianCultureLLM2026,
117
+ author = {sgattup},
118
+ title = {Indian Culture LLM},
119
+ year = {2026},
120
+ publisher = {HuggingFace},
121
+ url = {https://huggingface.co/sgattup/IndianCultureLLM}
122
+ }
123
+ ```