File size: 623 Bytes
713632e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
"""Nygaard Code Comment Classification package.

This package provides utilities to download, clean, featurize, train, and
evaluate models for code comment classification across multiple languages
(`java`, `python`, and `pharo`).

Available submodules
--------------------
- `config`: Project paths, constants, and logging setup.
- `dataset`: Dataset download and cleaning routines.
- `features`: Feature extraction (embeddings and labels) utilities.
- `plots`: Basic plotting CLI scaffold.
- `modeling.train`: Training entry points (CatBoost and SetFit).
- `modeling.predict`: Evaluation and MLflow logging utilities.
"""