my2000cup
/

Gaia-LLM-8B

Generated from Trainer

Model card Files Files and versions

my2000cup commited on May 8, 2025

Commit

138b5f9

·

verified ·

1 Parent(s): af061e1

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -14,17 +14,17 @@ model-index:
 # train_2025-05-05-15-36-22
-This model is a fine-tuned version of [../pretrained/Qwen3-4B](https://huggingface.co/../pretrained/Qwen3-4B) on the wikipedia_zh, petro_books, datasets001, the datasets002, the datasets003, the datasets004 and the datasets006 datasets.
 ## Model description
 Gaia-Petro-LLM is a large language model specialized in the oil and gas industry, fine-tuned from Qwen/Qwen3-4B. It was further pre-trained on a curated 20GB corpus of petroleum engineering texts, including technical documents, academic papers, and domain literature. The model is designed to support domain experts, researchers, and engineers in petroleum-related tasks, providing high-quality, domain-specific language understanding and generation.
 ## Model Details
-Base Model: Qwen/Qwen3-4B
 Domain: Oil & Gas / Petroleum Engineering
 Corpus Size: ~20GB (petroleum engineering)
 Languages: Primarily Chinese; domain-specific English supported
-Repository: my2000cup/Gaia-LLM-4B
 ## Intended uses & limitations
 Technical Q&A in petroleum engineering
@@ -49,7 +49,7 @@ Technical standards and manuals
 from transformers import AutoModelForCausalLM, AutoTokenizer
 # Replace with your model repository
-model_name = "my2000cup/Gaia-LLM-4B"
 # Load tokenizer and model
 tokenizer = AutoTokenizer.from_pretrained(model_name)

 # train_2025-05-05-15-36-22
+This model is a fine-tuned version of [../pretrained/Qwen3-4B](https://huggingface.co/../pretrained/Qwen3-8B) on the wikipedia_zh, petro_books, datasets001, the datasets002, the datasets003, the datasets004 and the datasets006 datasets.
 ## Model description
 Gaia-Petro-LLM is a large language model specialized in the oil and gas industry, fine-tuned from Qwen/Qwen3-4B. It was further pre-trained on a curated 20GB corpus of petroleum engineering texts, including technical documents, academic papers, and domain literature. The model is designed to support domain experts, researchers, and engineers in petroleum-related tasks, providing high-quality, domain-specific language understanding and generation.
 ## Model Details
+Base Model: Qwen/Qwen3-8B
 Domain: Oil & Gas / Petroleum Engineering
 Corpus Size: ~20GB (petroleum engineering)
 Languages: Primarily Chinese; domain-specific English supported
+Repository: my2000cup/Gaia-LLM-8B
 ## Intended uses & limitations
 Technical Q&A in petroleum engineering
 from transformers import AutoModelForCausalLM, AutoTokenizer
 # Replace with your model repository
+model_name = "my2000cup/Gaia-LLM-8B"
 # Load tokenizer and model
 tokenizer = AutoTokenizer.from_pretrained(model_name)