megagonlabs
/

prompt-based-parsing-gemma-2-9b-lora-v1

@@ -29,7 +29,6 @@ This model is trained using the Universal Dependencies datasets over 7 languages
 ## Terms of Use
 This LoRA adapter package is released under the CC BY-SA 4.0.
 However, please note the following important conditions regarding its usage:
 - This package **does not contain any part of the original Gemma 2 model**.
 - In order to use this package, you must obtain and use the base model distributed from Google:
@@ -38,10 +37,8 @@ However, please note the following important conditions regarding its usage:
 利用規約 (Japanese version of the Terms of Use)
-このLoRAアダプタパッケージは、CC BY-SA 4.0に基づいてリリースされています。
-ただし、使用に関しては以下の重要な利用条件に注意してください。
 - このパッケージには**オリジナルのGemma 2モデルは含まれていません**
 - このパッケージを使用するには、Googleが配布するGemmaモデルを入手して使用する必要があります:
   [Gemma 2 9B base on Hugging Face](https://huggingface.co/google/gemma-2-9b)
@@ -55,8 +52,7 @@ pip install -U vllm==0.7.2 sudachipy sudachidict-core
 ```
 In this first release, we only provide code example using the [sudachipy](https://github.com/WorksApplications/SudachiPy) tokenizer, which matches the token boundaries of UD Japanese datasets.
-Code examples for other languages will be provided in upcoming releases.
 本リリースでは、UD Japanese データセットのトークン境界との親和性の高い[sudachipy](https://github.com/WorksApplications/SudachiPy)をトークナイザーに使用したサンプルコードのみを提供します。
 他の言語向けのサンプルコードは、今後のリリースで提供予定です。
@@ -268,8 +264,7 @@ for sentence, result in zip(input_sentences, results):
 ### Training Data and Hyper-parameters
-We used the train-sets of the UD datasets below for LoRA SFT.
 本モデルのLoRA SFTには次のUDデータセットのtrainセットを使用しました。
 - [UD_English-EWT](https://github.com/UniversalDependencies/UD_English-EWT) r2.15
 - [UD_Japanese-GSD](https://github.com/UniversalDependencies/UD_Japanese-GSD) r2.15
@@ -279,9 +274,8 @@ We used the train-sets of the UD datasets below for LoRA SFT.
 - [UD_German-GSD](https://github.com/UniversalDependencies/UD_German-GSD) r2.15
 - [UD_Slovenian-SSJ](https://github.com/UniversalDependencies/UD_Slovenian-SSJ) r2.15
-We also used the training hyper-parameters below:
-また訓練時には次のパイパーパラメータを使用しました。
 - lr: 5e-5
 - num_train_epochs: 2
 - lora_target_modules: "all-linear"
@@ -289,14 +283,12 @@ We also used the training hyper-parameters below:
 - lora_alpha: 8
 - lora_dropout: 0.05
-The details of the experimental conditions will be released later.
 実験条件の詳細については後日公開予定です。
 ### Evaluation Results
-The accuracies in the table below are based on the simple recovery process applied to the TSV output in Step 3.
 次の表に記載した精度は、Step 3のTSV出力に簡易なリカバリ処理を適用した上で評価を行っています。
 | dataset | UPOS | UAS | LAS |
 | ---- | ---- | ---- | ---- |
@@ -311,6 +303,7 @@ The accuracies in the table below are based on the simple recovery process appli
 ### Framework versions
 - TRL v0.15.2 (for training)
 - vLLM 0.7.2 (for inference)
 ## Citation

 ## Terms of Use
 This LoRA adapter package is released under the CC BY-SA 4.0.
 However, please note the following important conditions regarding its usage:
 - This package **does not contain any part of the original Gemma 2 model**.
 - In order to use this package, you must obtain and use the base model distributed from Google:
 利用規約 (Japanese version of the Terms of Use)
+このLoRAアダプタパッケージは、CC BY-SA 4.0に基づいてリリースされています。
+ただし、使用に関しては以下の重要な利用条件に注意してください。
 - このパッケージには**オリジナルのGemma 2モデルは含まれていません**
 - このパッケージを使用するには、Googleが配布するGemmaモデルを入手して使用する必要があります:
   [Gemma 2 9B base on Hugging Face](https://huggingface.co/google/gemma-2-9b)
 ```
 In this first release, we only provide code example using the [sudachipy](https://github.com/WorksApplications/SudachiPy) tokenizer, which matches the token boundaries of UD Japanese datasets.
+Code examples for other languages will be provided in upcoming releases.
 本リリースでは、UD Japanese データセットのトークン境界との親和性の高い[sudachipy](https://github.com/WorksApplications/SudachiPy)をトークナイザーに使用したサンプルコードのみを提供します。
 他の言語向けのサンプルコードは、今後のリリースで提供予定です。
 ### Training Data and Hyper-parameters
+We used the train-sets of the UD datasets below for LoRA SFT.
 本モデルのLoRA SFTには次のUDデータセットのtrainセットを使用しました。
 - [UD_English-EWT](https://github.com/UniversalDependencies/UD_English-EWT) r2.15
 - [UD_Japanese-GSD](https://github.com/UniversalDependencies/UD_Japanese-GSD) r2.15
 - [UD_German-GSD](https://github.com/UniversalDependencies/UD_German-GSD) r2.15
 - [UD_Slovenian-SSJ](https://github.com/UniversalDependencies/UD_Slovenian-SSJ) r2.15
+We also used the training hyper-parameters below:
+また訓練時には次のパイパーパラメータを使用しています。
 - lr: 5e-5
 - num_train_epochs: 2
 - lora_target_modules: "all-linear"
 - lora_alpha: 8
 - lora_dropout: 0.05
+The details of the experimental conditions will be released later.
 実験条件の詳細については後日公開予定です。
 ### Evaluation Results
+The accuracies in the table below are based on the simple recovery process applied to the TSV output in Step 3.
 次の表に記載した精度は、Step 3のTSV出力に簡易なリカバリ処理を適用した上で評価を行っています。
 | dataset | UPOS | UAS | LAS |
 | ---- | ---- | ---- | ---- |
 ### Framework versions
 - TRL v0.15.2 (for training)
+- PEFT v0.14.0 (for training)
 - vLLM 0.7.2 (for inference)
 ## Citation