Kylan12 commited on
Commit
64bc028
·
verified ·
1 Parent(s): b424845

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +82 -0
README.md ADDED
@@ -0,0 +1,82 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - qwen2.5
7
+ - fine-tuned
8
+ - lora
9
+ - chemistry
10
+ base_model: Qwen/Qwen2.5-14B-Instruct
11
+ ---
12
+
13
+ # qwen-quantum
14
+
15
+ This model is a fine-tuned version of [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
16
+ using LoRA (Low-Rank Adaptation) on a chemistry dataset.
17
+
18
+ ## Model Description
19
+
20
+ Fine-tuned Qwen2.5-14B model for chemistry domain tasks.
21
+
22
+ ## Available Formats
23
+
24
+ - **GGUF**: `qwen_quantum_merged-q4_k_m.gguf` - Quantized for efficient inference with llama.cpp
25
+
26
+ ## Usage
27
+
28
+ ### Using GGUF (with llama.cpp, Ollama, LM Studio, etc.)
29
+
30
+ ```bash
31
+ # Download the GGUF file
32
+ huggingface-cli download Kylan12/qwen-quantum qwen_quantum_merged-q4_k_m.gguf
33
+
34
+ # Use with llama.cpp
35
+ ./llama.cpp/build/bin/llama-cli -m qwen_quantum_merged-q4_k_m.gguf -p "Your prompt here"
36
+ ```
37
+
38
+ ### Using HuggingFace Transformers
39
+
40
+ ```python
41
+ from transformers import AutoModelForCausalLM, AutoTokenizer
42
+
43
+ model = AutoModelForCausalLM.from_pretrained("Kylan12/qwen-quantum")
44
+ tokenizer = AutoTokenizer.from_pretrained("Kylan12/qwen-quantum")
45
+
46
+ prompt = "What is the IUPAC name for..."
47
+ inputs = tokenizer(prompt, return_tensors="pt")
48
+ outputs = model.generate(**inputs, max_length=200)
49
+ print(tokenizer.decode(outputs[0]))
50
+ ```
51
+
52
+ ## Training Details
53
+
54
+ - **Base Model**: Qwen/Qwen2.5-14B-Instruct
55
+ - **Training Method**: LoRA (Low-Rank Adaptation)
56
+ - **Dataset**: camel-ai/chemistry
57
+ - **LoRA Rank**: 16
58
+ - **LoRA Alpha**: 16
59
+ - **Target Modules**: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
60
+
61
+ ## Limitations
62
+
63
+ This model inherits the limitations of the base Qwen2.5-14B-Instruct model and may have
64
+ additional domain-specific limitations due to the fine-tuning dataset.
65
+
66
+ ## Citation
67
+
68
+ If you use this model, please cite:
69
+
70
+ ```bibtex
71
+ @misc{qwen_quantum,
72
+ author = {Your Name},
73
+ title = {qwen-quantum},
74
+ year = {2025},
75
+ publisher = {HuggingFace},
76
+ url = {https://huggingface.co/Kylan12/qwen-quantum}
77
+ }
78
+ ```
79
+
80
+ ## License
81
+
82
+ This model is released under the Apache 2.0 license, consistent with the base Qwen model.