AfterRain007
/

cryptobertRefined

Text Classification

Sentiment Analysis

text-embeddings-inference

Model card Files Files and versions

AfterRain007 commited on Feb 23, 2024

Commit

0e663aa

·

verified ·

1 Parent(s): 44141a7

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -12,4 +12,17 @@ tags:
 - RoBERTa
 - NLP
 - Cryptocurrency
----

 - RoBERTa
 - NLP
 - Cryptocurrency
+---
+# CryptoBERTRefined
+CryptoBERTRefined is a fine tuned model from [CryptoBERT by Elkulako](https://huggingface.co/ElKulako/cryptobert) model (See the base model to see it's training corpus).
+# Training Process
+Total of 3.803 text have been labelled manually to fine tune the model, and data augmentation is done with Back-Translation using Google Translate API with 10 language ('it', 'fr', "sv", "da", 'pt', 'id', 'pl', 'hr', "bg", "fi").
+# Training Corpus
+Randomly picked text from [kaggle datasets](https://www.kaggle.com/datasets/kaushiksuresh147/bitcoin-tweets)
+Labelled sentiment text from [surgeAI](https://www.surgehq.ai/datasets/crypto-sentiment-dataset)
+# Source Code
+See [Github](https://github.com/AfterRain007/cryptobertRefined) for the source code to finetune cryptoBERT model into cryptoBERTRefined.