Bachstelze commited on
Commit
2f9d3e2
·
1 Parent(s): f5e4068

add baseline comparison

Browse files
A5b/classification_bagging_trees.py CHANGED
@@ -372,3 +372,41 @@ if hasattr(best_model, 'feature_importances_'):
372
  print(f'\nTop 10 Most Important Features ({best_name}):')
373
  for i in range(min(10, len(feature_columns))):
374
  print(f' {i+1}. {feature_columns[indices[i]]}: {importances[indices[i]]:.4f}')
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
372
  print(f'\nTop 10 Most Important Features ({best_name}):')
373
  for i in range(min(10, len(feature_columns))):
374
  print(f' {i+1}. {feature_columns[indices[i]]}: {importances[indices[i]]:.4f}')
375
+
376
+
377
+ """
378
+ T-TEST: LightGBM (A5b classification_bagging_trees.py) vs Baseline Models
379
+ ======================================================================
380
+
381
+ LightGBM F1 per fold: [0.60119669 0.62564327 0.60350582 0.68353009 0.65164652]
382
+ LightGBM mean: 0.633104478
383
+
384
+ A4 Champion RF F1 per fold: [0.59125024 0.62187 0.56044242 0.65402408 0.60242416]
385
+ A4 Champion RF mean: 0.6060021800000001
386
+
387
+ Soft Voting F1 per fold: [0.61809316 0.63567163 0.61791823 0.69205568 0.65414995]
388
+ Soft Voting mean: 0.64357773
389
+
390
+ Test 1: LightGBM vs A4 Champion Random Forest
391
+ t-statistic: +2.0288
392
+ p-value: 0.1124
393
+ Significant at α=0.05: No
394
+
395
+ Test 2: LightGBM vs Soft Voting Ensemble
396
+ t-statistic: -2.8028
397
+ p-value: 0.0487
398
+ Significant at α=0.05: Yes
399
+
400
+ Test 3: Soft Voting vs A4 Champion Random Forest (for reference)
401
+ t-statistic: +3.1372
402
+ p-value: 0.0349
403
+ Significant at α=0.05: Yes
404
+
405
+ ======================================================================
406
+ SUMMARY
407
+ ======================================================================
408
+ The LightGBM model from classification_bagging_trees.py shows:
409
+ - Mean F1: 0.6331 +/- 0.0311
410
+ - Compared to Soft Voting (best baseline): t=+3.137, p=0.0349
411
+ - No statistically significant difference (p > 0.05)
412
+ """
A5b/models/bagging_trees_champion.pkl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d2475e28a11e89c0c4544064525f6d41d7890e19c5549575ac597d64e076616e
3
- size 6506122
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc5ae990665718d18935022eadc5b8b85a9650948b02d8014aed24bb4b22b528
3
+ size 6506123
A5b/models/ensemble_classification_champion.pkl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:93cecc19fe3e22c357af6ee6778990d7e3b518a36bbb6418a78ecb6795ef4cce
3
- size 30798315
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bca3b9c29ceaa2623e96f782f2a15625d54fa044acfe128673b60f29f2b56025
3
+ size 30798304