modelling101
/

CodeBERT-SO

Text Classification

text-embeddings-inference

Model card Files Files and versions

modelling101 commited on May 27, 2024

Commit

06a164b

·

verified ·

1 Parent(s): 3552e06

Update README.md

Files changed (1) hide show

README.md +23 -3

README.md CHANGED Viewed

@@ -1,3 +1,23 @@
----
-license: cc-by-4.0
----

+---
+license: cc-by-4.0
+language:
+- en
+library_name: transformers
+pipeline_tag: text-classification
+tags:
+- code
+metrics:
+- accuracy
+- f1
+---
+# CodeBERT-SO
+Repository for CodeBERT, fine-tuned on Stack Overflow snippets with respect to NL-PL pairs of 6 languages (Python, Java, JavaScript, PHP, Ruby, Go).
+## Training Objective
+This model is initialized with [CodeBERT-base](https://huggingface.co/microsoft/codebert-base) and trained to classify whether a user will drop out given their posts and code snippets.
+## Training Regime
+Training was done across 8 epochs with a batch size of 8, learning rate of 1e-5, epsilon (weight update denominator) of 1e-8.
+A random 20% sample of the entire dataset was used as the validation set.
+## Performance
+* Final validation accuracy: 0.822
+* Final validation F1: 0.809
+* Final validation loss: 0.5