sixf0ur commited on
Commit
6a25659
·
verified ·
1 Parent(s): 78b39ef

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -3
README.md CHANGED
@@ -1,3 +1,39 @@
1
- ---
2
- license: cc-by-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ datasets:
4
+ - sixf0ur/ScentSet
5
+ language:
6
+ - en
7
+ tags:
8
+ - chemistry
9
+ - biology
10
+ - climate
11
+ - medical
12
+ - text-generation-inference
13
+ ---
14
+
15
+ # ScentLLaMA
16
+
17
+ A tiny LLaMA-based language model with 600k parameters, pretrained specifically on the synthetic ScentSet dataset (572k entries, ~15M tokens).
18
+ Designed exclusively to describe and classify smells and aromas.
19
+
20
+ ## Model Details
21
+
22
+ - **Parameters:** ~600,000
23
+ - **Task:** Text generation of smell descriptions
24
+ - **Training data:** ScentSet (synthetic dataset of smell descriptions)
25
+ - **Training date:** July 2025
26
+ - **License:** CC BY 4.0
27
+
28
+
29
+ ### Citation
30
+ ```json
31
+ @misc{ScentLLaMA_2025,
32
+ author = {David S.},
33
+ title = {ScentLLaMA: A tiny LLaMA Model for Smell Description Generation},
34
+ year = {2025},
35
+ publisher = {Hugging Face Models},
36
+ howpublished = {\url{https://huggingface.co/sixf0ur/ScentLLaMA}},
37
+ note = {Pretrained on the ScentSet dataset to generate natural language descriptions of smells}
38
+ }
39
+ ```