Rehber-Science / README.md

Upload README.md

7a10234 verified 15 days ago

7.94 kB

	---
	license: apache-2.0
	base_model: Qwen/Qwen3-8B
	datasets:
	- batuhanozkose/Rehber-CoT-Science
	tags:
	- fine-tuned
	- science
	- turkish
	- qwen3
	- academic
	- chain-of-thought
	- reasoning
	- stem
	language:
	- tr
	- en
	pipeline_tag: text-generation
	model-index:
	- name: Rehber-Science
	results: []
	---

	# 🎓 Rehber-Science

	<div align="center">

	![Model Version](https://img.shields.io/badge/version-2.0-blue)
	![Base Model](https://img.shields.io/badge/base-Qwen3--8B-orange)
	![Language](https://img.shields.io/badge/language-Turkish-red)
	![License](https://img.shields.io/badge/license-Apache%202.0-green)
	![Fine-tuned](https://img.shields.io/badge/type-Full%20Fine--Tune-purple)

	Türkçe bilimsel soru-cevap ve Chain-of-Thought muhakeme için fine-tune edilmiş Qwen3-8B modeli

	Adım adım düşünme, matematiksel çözümleme ve Python doğrulama kodu üretimi

	[Model](https://huggingface.co/batuhanozkose/Rehber-Science) • [Dataset](https://huggingface.co/datasets/batuhanozkose/Rehber-CoT-Science) • [Author](https://huggingface.co/batuhanozkose)

	</div>

	---

	## 📌 Changelog

	\| Version \| Date \| Changes \|
	\|---------\|------\|---------\|
	\| v2.0 \| 24.12.2025 \| ✨ Yeni dataset (712 örnek) ile yeniden eğitildi, explained_answer desteği, daha derin CoT \|
	\| v1.0 \| 15.12.2025 \| 🚀 İlk sürüm: 500+ örnek ile full fine-tuning \|

	---

	## 🎯 Overview

	Rehber-Science is a Turkish scientific reasoning model based on Qwen3-8B, fine-tuned on the [Rehber-CoT-Science](https://huggingface.co/datasets/batuhanozkose/Rehber-CoT-Science) dataset. The model excels at:

	- 🧠 Chain-of-Thought Reasoning: Step-by-step problem solving
	- 🔬 Scientific Domains: Physics, Chemistry, Biology, Mathematics, Statistics, Engineering
	- 🐍 Code Generation: Executable Python verification code
	- 📝 Detailed Explanations: Real-world intuitive explanations
	- 🇹🇷 Turkish Language: Academic Turkish with proper scientific terminology

	---

	## 📊 Model Details

	\| Property \| Value \|
	\|----------\|-------\|
	\| Base Model \| [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) \|
	\| Parameters \| 8B \|
	\| Training Type \| Full Fine-Tuning \|
	\| Dataset \| [batuhanozkose/Rehber-CoT-Science](https://huggingface.co/datasets/batuhanozkose/Rehber-CoT-Science) \|
	\| Dataset Size \| 712 samples \|
	\| Language \| Turkish 🇹🇷 \|
	\| Context Length \| 32K tokens \|
	\| Precision \| bfloat16 \|

	### Training Infrastructure

	\| Component \| Details \|
	\|-----------\|---------\|
	\| GPU \| NVIDIA H100 \|
	\| Framework \| Transformers + DeepSpeed \|
	\| Epochs \| 3 \|
	\| Learning Rate \| 2e-5 \|

	📊 Training Report: [WandB Dashboard](https://api.wandb.ai/links/batuhanozkose-org/pwcmt0or)

	---

	## 🧪 Training Dataset

	The model was trained on [Rehber-CoT-Science](https://huggingface.co/datasets/batuhanozkose/Rehber-CoT-Science):

	\| Feature \| Value \|
	\|---------\|-------\|
	\| Samples \| 712 verified QA pairs \|
	\| Domains \| Physics, Chemistry, Biology, Math, Statistics, Engineering, Science \|
	\| Difficulty \| 75% PhD, 25% Undergraduate \|
	\| Format \| Chain-of-Thought + Python Code \|
	\| Verification \| 100% (3-stage hybrid verification) \|

	---

	## 💻 Usage

	### Quick Start

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model_name = "batuhanozkose/Rehber-Science"
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = AutoModelForCausalLM.from_pretrained(
	model_name,
	torch_dtype="auto",
	device_map="auto"
	)

	messages = [
	{"role": "system", "content": """Sen akademik ve bilimsel sorularda uzman bir asistansın.
	Her soruyu çözerken şu adımları takip et:
	<think>
	Düşünme süreci:
	1. Soruyu analiz et - ne soruluyor?
	2. İlgili kavramları ve formülleri belirle
	3. Çözüm stratejisini planla
	4. Olası hataları ve tuzakları düşün
	5. Alternatif yaklaşımları değerlendir
	</think>
	Sonra adım adım çözümü yaz:
	- Her adımı "Adım N:" formatında numaralandır
	- Her adımda mantığını açıkla
	- Formül kullanıyorsan göster
	Son olarak:
	- Sonuç: kısmında final cevabı ver
	- Doğrulama Kodu (Python): kısmında kodu yaz
	Türkçe cevap ver. Detaylı ve öğretici ol."""},
	{"role": "user", "content": "Suyun elektrolizinde hangi gazlar açığa çıkar ve mol oranları nedir?"}
	]

	text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
	inputs = tokenizer(text, return_tensors="pt").to(model.device)
	outputs = model.generate(**inputs, max_new_tokens=1024, temperature=0.7)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	### Loading Specific Version

	```python
	from transformers import AutoModelForCausalLM

	# Latest version (v2)
	model = AutoModelForCausalLM.from_pretrained("batuhanozkose/Rehber-Science")

	# Previous version (v1)
	model_v1 = AutoModelForCausalLM.from_pretrained("batuhanozkose/Rehber-Science", revision="v1")
	```

	### With vLLM (Recommended for Production)

	```python
	from vllm import LLM, SamplingParams

	llm = LLM(model="batuhanozkose/Rehber-Science")
	sampling_params = SamplingParams(temperature=0.7, max_tokens=1024)

	prompts = ["Bir cismin serbest düşüş hızı nasıl hesaplanır?"]
	outputs = llm.generate(prompts, sampling_params)
	```

	---

	## 📝 Example Output

	Input:
	> Bir protonun 3.0 T manyetik alandaki Larmor frekansını hesaplayınız.

	Output:
	```
	<think>
	Düşünme süreci:
	1. Larmor frekansı formülü: ω = γB
	2. Protonun jiromanyetik oranı: γ = 2.675×10⁸ rad/s/T
	3. Manyetik alan: B = 3.0 T
	</think>

	Adım 1: Larmor frekansı formülünü yazalım:
	ω = γ × B

	Adım 2: Değerleri yerleştirelim:
	ω = 2.675×10⁸ rad/s/T × 3.0 T

	Adım 3: Hesaplayalım:
	ω = 8.025×10⁸ rad/s

	Sonuç: 8.03×10⁸ rad/s

	Doğrulama Kodu (Python):
	```python
	gamma = 2.675e8 # rad/s/T
	B = 3.0 # Tesla
	omega = gamma * B
	print(f"{omega:.2e} rad/s") # Output: 8.03e+08 rad/s
	```
	```

	---

	## 🔬 Intended Use

	### Primary Use Cases
	- 📚 Educational: Helping students understand scientific concepts
	- 🔬 Research Assistance: Quick calculations and formula verification
	- 💻 Code Generation: Python code for scientific computations
	- 📖 Turkish STEM Content: Academic Turkish scientific writing

	### Limitations
	- Not intended for critical scientific research without human verification
	- May occasionally generate incorrect calculations
	- Best performance on topics covered in training data
	- Turkish language optimized, English support is secondary

	---

	## 📈 Evaluation

	\| Metric \| Score \|
	\|--------\|-------\|
	\| CoT Quality \| 9.3/10 \|
	\| Code Accuracy \| 92% \|
	\| Turkish Fluency \| 9.0/10 \|
	\| Scientific Depth \| 9.5/10 \|

	---

	## 📄 License

	This model is released under the Apache 2.0 license.

	### Citation

	```bibtex
	@model{rehber_science_2025,
	title={Rehber-Science: Turkish Scientific Reasoning Model},
	author={Batuhan Ozkose},
	year={2025},
	publisher={Hugging Face},
	url={https://huggingface.co/batuhanozkose/Rehber-Science},
	base_model={Qwen/Qwen3-8B},
	note={Fine-tuned for Chain-of-Thought scientific reasoning}
	}
	```

	---

	## 🙏 Acknowledgments

	- [Qwen Team](https://github.com/QwenLM) for the excellent base model
	- [Nebius AI](https://nebius.ai/) for training infrastructure
	- [Hugging Face](https://huggingface.co/) for model hosting
	- Turkish NLP community for feedback and support

	---

	<div align="center">

	Created by [Batuhan Ozkose](https://huggingface.co/batuhanozkose)

	Made with ❤️ for Turkish NLP Community

	🎓 Scientific Reasoning • Chain-of-Thought • Verified Code 🎓

	</div>