Spaces:
Sleeping
Sleeping
Bachstelze commited on
Commit ·
2f9d3e2
1
Parent(s): f5e4068
add baseline comparison
Browse files
A5b/classification_bagging_trees.py
CHANGED
|
@@ -372,3 +372,41 @@ if hasattr(best_model, 'feature_importances_'):
|
|
| 372 |
print(f'\nTop 10 Most Important Features ({best_name}):')
|
| 373 |
for i in range(min(10, len(feature_columns))):
|
| 374 |
print(f' {i+1}. {feature_columns[indices[i]]}: {importances[indices[i]]:.4f}')
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 372 |
print(f'\nTop 10 Most Important Features ({best_name}):')
|
| 373 |
for i in range(min(10, len(feature_columns))):
|
| 374 |
print(f' {i+1}. {feature_columns[indices[i]]}: {importances[indices[i]]:.4f}')
|
| 375 |
+
|
| 376 |
+
|
| 377 |
+
"""
|
| 378 |
+
T-TEST: LightGBM (A5b classification_bagging_trees.py) vs Baseline Models
|
| 379 |
+
======================================================================
|
| 380 |
+
|
| 381 |
+
LightGBM F1 per fold: [0.60119669 0.62564327 0.60350582 0.68353009 0.65164652]
|
| 382 |
+
LightGBM mean: 0.633104478
|
| 383 |
+
|
| 384 |
+
A4 Champion RF F1 per fold: [0.59125024 0.62187 0.56044242 0.65402408 0.60242416]
|
| 385 |
+
A4 Champion RF mean: 0.6060021800000001
|
| 386 |
+
|
| 387 |
+
Soft Voting F1 per fold: [0.61809316 0.63567163 0.61791823 0.69205568 0.65414995]
|
| 388 |
+
Soft Voting mean: 0.64357773
|
| 389 |
+
|
| 390 |
+
Test 1: LightGBM vs A4 Champion Random Forest
|
| 391 |
+
t-statistic: +2.0288
|
| 392 |
+
p-value: 0.1124
|
| 393 |
+
Significant at α=0.05: No
|
| 394 |
+
|
| 395 |
+
Test 2: LightGBM vs Soft Voting Ensemble
|
| 396 |
+
t-statistic: -2.8028
|
| 397 |
+
p-value: 0.0487
|
| 398 |
+
Significant at α=0.05: Yes
|
| 399 |
+
|
| 400 |
+
Test 3: Soft Voting vs A4 Champion Random Forest (for reference)
|
| 401 |
+
t-statistic: +3.1372
|
| 402 |
+
p-value: 0.0349
|
| 403 |
+
Significant at α=0.05: Yes
|
| 404 |
+
|
| 405 |
+
======================================================================
|
| 406 |
+
SUMMARY
|
| 407 |
+
======================================================================
|
| 408 |
+
The LightGBM model from classification_bagging_trees.py shows:
|
| 409 |
+
- Mean F1: 0.6331 +/- 0.0311
|
| 410 |
+
- Compared to Soft Voting (best baseline): t=+3.137, p=0.0349
|
| 411 |
+
- No statistically significant difference (p > 0.05)
|
| 412 |
+
"""
|
A5b/models/bagging_trees_champion.pkl
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cc5ae990665718d18935022eadc5b8b85a9650948b02d8014aed24bb4b22b528
|
| 3 |
+
size 6506123
|
A5b/models/ensemble_classification_champion.pkl
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bca3b9c29ceaa2623e96f782f2a15625d54fa044acfe128673b60f29f2b56025
|
| 3 |
+
size 30798304
|