Text Classification
Transformers
Safetensors
code
bert
Generated from Trainer
text-embeddings-inference
Instructions to use HuggingFaceTB/stack-edu-classifier-c with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use HuggingFaceTB/stack-edu-classifier-c with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="HuggingFaceTB/stack-edu-classifier-c")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("HuggingFaceTB/stack-edu-classifier-c") model = AutoModelForSequenceClassification.from_pretrained("HuggingFaceTB/stack-edu-classifier-c") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -77,7 +77,15 @@ While the macro F1 scores across the 1-5 rating scale are relatively low due to
|
|
| 77 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/61c141342aac764ce1654e43/06siAXeIRjJGnlPehW562.png" width="600">
|
| 78 |
</div>
|
| 79 |
|
| 80 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 81 |
|
| 82 |
### Training hyperparameters
|
| 83 |
|
|
|
|
| 77 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/61c141342aac764ce1654e43/06siAXeIRjJGnlPehW562.png" width="600">
|
| 78 |
</div>
|
| 79 |
|
| 80 |
+
The table below shows Stack-Edu dataset statistics and MultiPL-E scores for the top 4 (in terms of size) programming languages. We use HumanEval for Python evaluation. For the ablation, we started from a mid-training checkpoint of SmolLM2 at 3T tokens which was trained primarily on web data, and perform linear annealing on 200B tokens, uniformly distributed across 15 of the most commonly used programming languages (~14B tokens each).
|
| 81 |
+
|
| 82 |
+
| Language | Before filtering (B tokens) | After filtering (B tokens) | MultiPL-E (Original → Filtered) |
|
| 83 |
+
|------------|-------------------------|---------------------|-------------------------------|
|
| 84 |
+
| Python | 50.6 | 21.8 | 20.7 → 25.6 |
|
| 85 |
+
| C++ | 69.7 | 16.0 | 16.7 → 24.8 |
|
| 86 |
+
| JavaScript | 45.3 | 11.1 | 18.2 → 22.4 |
|
| 87 |
+
| Java | 45.6 | 42.1 | 17.6 → 22.7 |
|
| 88 |
+
|
| 89 |
|
| 90 |
### Training hyperparameters
|
| 91 |
|