webbigdata
/

C3TR-Adapter_gguf

Model card Files Files and versions

dahara1 commited on Mar 18, 2024

Commit

dcde4f2

·

verified ·

1 Parent(s): 5e1dc30

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ Although llama.cpp can be used to reduce the size of the file with various quant
 - トップP（--top_p）: この値をさらに低く設定することで、モデルが考慮する単語の範囲を狭め、より一貫性のあるテキストを生成するようになります。
 - 生成する単語数（-n）: この値を減らすことで、モデルが生成するテキストの長さを短くし、不要な追加テキストの生成を防ぐことができます。-1 = 無限大、-2 = 文脈が満たされるまで。
-以下はllama.cppの作者(ggerganov)による推奨パラメーターです
 - -e (改行\nをエスケープ)
 - --temp 0 (最も確率の高いトークンのみを選択)
 - --repeat-penalty 1.0 (繰り返しペナルティをオフ。指示調整済モデルでこれをするのは、決して良い考えとは言えない。)
@@ -42,7 +42,7 @@ Adjust the following parameters as needed
 - Top P (--top_p): Setting this value even lower will narrow the range of words considered by the model and produce more consistent text.
 - Number of words to generate (-n): Reducing this value will shorten the length of text generated by the model and prevent the generation of unnecessary additional text. -1 = infinity(default), -2 = until context filled.
-The following are the recommended parameters by the author of llama.cpp(ggerganov)
 - -e (escape newlines (\n))
 - --temp 0(pick most probable tokens)
 - --repeat-penalty 1.0(disable repetition penalty (it's never a good idea to have this with instruction tuned models)

 - トップP（--top_p）: この値をさらに低く設定することで、モデルが考慮する単語の範囲を狭め、より一貫性のあるテキストを生成するようになります。
 - 生成する単語数（-n）: この値を減らすことで、モデルが生成するテキストの長さを短くし、不要な追加テキストの生成を防ぐことができます。-1 = 無限大、-2 = 文脈が満たされるまで。
+以下はllama.cppの作者(ggerganov)による[推奨パラメーター](https://huggingface.co/google/gemma-7b-it/discussions/38#65d7b14adb51f7c160769fa1)です
 - -e (改行\nをエスケープ)
 - --temp 0 (最も確率の高いトークンのみを選択)
 - --repeat-penalty 1.0 (繰り返しペナルティをオフ。指示調整済モデルでこれをするのは、決して良い考えとは言えない。)
 - Top P (--top_p): Setting this value even lower will narrow the range of words considered by the model and produce more consistent text.
 - Number of words to generate (-n): Reducing this value will shorten the length of text generated by the model and prevent the generation of unnecessary additional text. -1 = infinity(default), -2 = until context filled.
+The following are the [recommended parameters](https://huggingface.co/google/gemma-7b-it/discussions/38#65d7b14adb51f7c160769fa1) by the author of llama.cpp(ggerganov)
 - -e (escape newlines (\n))
 - --temp 0(pick most probable tokens)
 - --repeat-penalty 1.0(disable repetition penalty (it's never a good idea to have this with instruction tuned models)