Update README.md

Browse files

Files changed (1) hide show

README.md +3 -73

README.md CHANGED Viewed

@@ -32,6 +32,9 @@ Tema_Q-R3.1 is an improved Large Language Model (LLM) tailored for Japanese, Eng
 It is designed to generate more flexible and useful responses, even for prompts that the standard Gemma 2 might find challenging to answer. It is ideal for users who wish to maximize the potential of AI in all fields, including creative writing, complex programming tasks, and deep knowledge exploration.
 GGUFファイルは以下より入手できます。
 https://huggingface.co/kawasumi/Tema_Q-R3.1-GGUF
@@ -52,76 +55,3 @@ https://huggingface.co/kawasumi/Tema_Q-R3.1-GGUF
 * **ユーザーの責任**: モデルの利用者は、生成されたコンテンツが、適用される**法律、規制、およびHugging Faceの利用規約/コンテンツポリシーに準拠**することを**全面的に保証**する必要があります。
 * **禁止事項**: このモデルを、いかなる**差別、ハラスメント、暴力、違法行為、および有害な目的**のために利用することを**固く禁じます**。
----
-## 💻 Colabで動かす
-以下のコードをGoogle Colaboratoryにコピペするだけで、**Tema_Q-R3.1** の強力な推論を体験できます。
-※ **推奨環境:** Google Colabの**T4 GPU**またはそれ以上のVRAMを持つ環境
-※GGUFからアクセスした方が高速かつ安定した推論が可能です。GGUFモデル配布ページに迂回してください。
-```python
-# 必要なライブラリをインストールします
-!pip install -qU transformers accelerate bitsandbytes
-import torch
-from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig
-# モデルID
-model_id = "kawasumi/Tema_Q-R3.1"
-# 4-bit 量子化設定 (ColabでのVRAM節約に最適)
-bnb_config = BitsAndBytesConfig(
-    load_in_4bit=True,
-    bnb_4bit_quant_type="nf4",
-    bnb_4bit_compute_dtype=torch.bfloat16 # Gemma 2に推奨される計算データ型
-)
-# モデルとトークナイザーのロード
-# device_map="auto" で、VRAMに自動で分散配置されます
-tokenizer = AutoTokenizer.from_pretrained(model_id)
-model = AutoModelForCausalLM.from_pretrained(
-    model_id,
-    quantization_config=bnb_config,
-    device_map="auto"
-)
-# 対話履歴
-# 日本語のプロンプト例
-messages = [
-    {"role": "user", "content": "生成AIについて日本語で200字以内で教えてください。"}
-]
-# 📝 変更点: トークナイズとテンプレート適用を同時に行う
-# **tokenizer() 関数に直接 messages リストを渡します**
-input_ids = tokenizer.apply_chat_template(
-    messages,
-    tokenize=True,             # トークナイズを実行
-    add_generation_prompt=True,
-    return_tensors="pt"        # PyTorchテンソルを返す
-).to(model.device)
-# ---------------------------------------------------------------------------------
-print("--- 推論中 ---")
-outputs = model.generate(
-    input_ids=input_ids, # 修正後の input_ids を使用
-    max_new_tokens=512,
-    do_sample=True,
-    temperature=0.6,
-    top_p=0.9
-)
-# 結果の表示
-generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
-# 応答全体から、プロンプト部分を除去して表示
-response_start = generated_text.find("<model>") + len("<model>")
-clean_response = generated_text[response_start:].strip()
-print("\n[生成された応答]\n")
-print(clean_response)

 It is designed to generate more flexible and useful responses, even for prompts that the standard Gemma 2 might find challenging to answer. It is ideal for users who wish to maximize the potential of AI in all fields, including creative writing, complex programming tasks, and deep knowledge exploration.
+このモデルは2025年12月17日にUGIリーダーボードにて21B以下のモデルで3番目にUGIスコアの高いモデルとなりました。
+![image/png](https://huggingface.co/kawasumi/Tema_Q-R3.1/resolve/main/Tema_Q-R3.1-score.png?download=true)
 GGUFファイルは以下より入手できます。
 https://huggingface.co/kawasumi/Tema_Q-R3.1-GGUF
 * **ユーザーの責任**: モデルの利用者は、生成されたコンテンツが、適用される**法律、規制、およびHugging Faceの利用規約/コンテンツポリシーに準拠**することを**全面的に保証**する必要があります。
 * **禁止事項**: このモデルを、いかなる**差別、ハラスメント、暴力、違法行為、および有害な目的**のために利用することを**固く禁じます**。