spankevich
/

llm-course-hw1

Text Generation

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions

spankevich commited on Feb 18, 2025

Commit

7816bb6

·

verified ·

1 Parent(s): 2ac340d

Update README.md

Files changed (1) hide show

README.md +2 -6

README.md CHANGED Viewed

@@ -16,16 +16,12 @@ The training resulted in a validation cross-entropy loss of 1.300, while the tra
 Here are some examples of generated anecdotes starting with the prefix "Заходит":
-"Заходит как-то мужик в магазин. Видит - бармен, а вокруг него, снимает голову. - Ну, как ты думаешь, что ли? - Да нет, сын мой!"
-"Заходит в бар и говорит: — Девушка, а что это вы так много плохая? — А как же вы хотите, что вы не видите, что вы не знаете? — А какая разница? — Потому, что вы можете? — Подумайте, что этот фильм? — Да нет, но ведь этот факт, какой-то я не могу."
 Although the cross-entropy loss is relatively low (1.17, with a vocabulary size of 1024), the actual quality of the generated anecdotes is not very good. The generated text often lacks coherence and logical structure.
 Attached are the charts for quality, learning rate, and training epochs:
 ![output.png](https://cdn-uploads.huggingface.co/production/uploads/67b0bd703230f308b6a233c4/u-hUMHSc2dszOu6Xu7stZ.png)
-This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: [More Information Needed]
-- Docs: [More Information Needed]

 Here are some examples of generated anecdotes starting with the prefix "Заходит":
+1. "Заходит как-то мужик в магазин. Видит - бармен, а вокруг него, снимает голову. - Ну, как ты думаешь, что ли? - Да нет, сын мой!"
+2. "Заходит в бар и говорит: — Девушка, а что это вы так много плохая? — А как же вы хотите, что вы не видите, что вы не знаете? — А какая разница? — Потому, что вы можете? — Подумайте, что этот фильм? — Да нет, но ведь этот факт, какой-то я не могу."
 Although the cross-entropy loss is relatively low (1.17, with a vocabulary size of 1024), the actual quality of the generated anecdotes is not very good. The generated text often lacks coherence and logical structure.
 Attached are the charts for quality, learning rate, and training epochs:
 ![output.png](https://cdn-uploads.huggingface.co/production/uploads/67b0bd703230f308b6a233c4/u-hUMHSc2dszOu6Xu7stZ.png)