haidlir
/

bloom-chatml-id

Text Generation

text-generation-inference

Model card Files Files and versions

Update README.md

#2

by haidlir - opened Jan 16, 2024

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +14 -6

README.md CHANGED Viewed

@@ -20,11 +20,19 @@ pipeline_tag: text-generation
 - https://huggingface.co/datasets/jakartaresearch/indoqa
-**Task**: Chat or Conversational
-**Input**: User's prompt containing chat templated text in string format
-**Output**: Model's generated text in string format
 **Experiment**:
-- Use bos and eos token to replace <|im_start|> and <|im_end|> in ChatML. (Inspired by: https://asmirnov.xyz/doppelganger)
-- Penggunaan padding dan truncation sesuai max_length.
-- Max length = 256, karena telah mengkonsumsi 33.7 GB.

 - https://huggingface.co/datasets/jakartaresearch/indoqa
+**Task**:
+Chat or Conversational
+**Input**:
+User's prompt containing chat templated text in string format
+**Output**:
+Model's generated text in string format
 **Experiment**:
+- Use bos_token and eos_token to replace <|im_start|> and <|im_end|> in ChatML. (Inspired by: https://asmirnov.xyz/doppelganger)
+- Use left padding and left truncation to conform to max_length.
+- Set max_length = 256 in the training process, which consumes 33.7 GB of memory.
+**Notebook**:
+- https://drive.google.com/file/d/11FiaWxGt2HxUirZrHTNLaVmiqrUwejwV/view?usp=drive_link