Commit History

Honest test: AUC=0.9053 without position leak

61371b7
verified

siddhm11 commited on May 17

Honest test without position leak

e80cff2
verified

siddhm11 commited on May 17

Exa model eval metrics

33df4bd
verified

siddhm11 commited on May 17

Exa-distilled model: nDCG=0.9999, HN-AUC=0.9731, trained on 93K pairs

555d737
verified

siddhm11 commited on May 17

Final model metrics

11437ae
verified

siddhm11 commited on May 16

FINAL production model: intent-based labels, nDCG=0.8725, HN-AUC=0.8325

c8a8dc3
verified

siddhm11 commited on May 16

Add final extraction script (tested, works, needs stable compute for full run)"

36ec200
verified

siddhm11 commited on May 16

Add V6 deployment guide: production-ready model, zero code changes needed

bcd1b92
verified

siddhm11 commited on May 16

Add V6 production results

6300f40
verified

siddhm11 commited on May 16

Add V6 PRODUCTION MODEL: 37-feature schema, drops into app, Hard Neg AUC=0.758

f94f80c
verified

siddhm11 commited on May 16

Add V6 eval results

73d7b62
verified

siddhm11 commited on May 15

Add V6: drop-in replacement, 37 features, cross-survey labels, HN-AUC=0.964

a3f9c6f
verified

siddhm11 commited on May 15

Update CHANGELOG: add V4 and V5 results. V5 achieves Hard Neg AUC=0.837

9856611
verified

siddhm11 commited on May 11

Add V5 eval metrics: nDCG=1.0, HN-AUC=0.837

245a85f
verified

siddhm11 commited on May 11

Add V5: graph+metadata, Hard Neg AUC=0.837 (+7.6% over V4)

a4dc0e5
verified

siddhm11 commited on May 11

Add V4 eval metrics

81f6108
verified

siddhm11 commited on May 10

Add V4 LightGBM: 25 graph features, hard_neg_auc=0.778 (+5.4% over V3)

4808c83
verified

siddhm11 commited on May 10

Update CHANGELOG: add V3 model with new eval framework

e0169cb
verified

siddhm11 commited on May 10

Add V3 eval metrics: nDCG@10=0.9494, hard_neg_auc=0.7380

6f159f9
verified

siddhm11 commited on May 10

Update ML Intern artifact metadata

d58f586
verified

siddhm11 commited on May 10

Add V3 LightGBM: trained on cross-survey authority labels, hard_neg_auc=0.738

247d191
verified

siddhm11 commited on May 10

Add eval v2 design document: explains why old eval was weak and how new one works

4c07339
verified

siddhm11 commited on May 10

Add eval v2: evaluation script with proper metrics for survey reading lists

38004cb
verified

siddhm11 commited on May 10

Add eval v2: extract survey paper reading lists from unarXive 2024

2628be1
verified

siddhm11 commited on May 10

v2 model: production_model/feature_importance_v2.csv

f2c3bd8
verified

siddhm11 commited on Apr 27

v2 model: production_model/eval_metrics_v2.json

7d4c686
verified

siddhm11 commited on Apr 27

v2 model: production_model/reranker_v2.txt

04eae30
verified

siddhm11 commited on Apr 27

add: quick loading snippet for Python users

fec1e1a
verified

siddhm11 commited on Apr 27

docs: add changelog

49c43e6
verified

siddhm11 commited on Apr 27

docs: add changelog tracking project history

bfb1eb8
verified

siddhm11 commited on Apr 27

docs: add detailed integration guide for Steps 5-8

aaf866d
verified

siddhm11 commited on Apr 27

docs: comprehensive README with production results and integration guide

0ef3e5f
verified

siddhm11 commited on Apr 27

Add feature_schema.json

d3097e4
verified

siddhm11 commited on Apr 26

Add baseline_comparison.json

3fe5813
verified

siddhm11 commited on Apr 26

Add feature_importance.csv

a9861f0
verified

siddhm11 commited on Apr 26

Add eval_metrics.json

c97aa65
verified

siddhm11 commited on Apr 26

Add reranker_v1.txt

5fe273b
verified

siddhm11 commited on Apr 26

Add comprehensive test suite

1199957
verified

siddhm11 commited on Apr 26

Add 03_train_lightgbm.py

41fda29
verified

siddhm11 commited on Apr 26

Add 02_generate_training_triples.py

616cfe6
verified

siddhm11 commited on Apr 26

Add 01_fetch_citation_edges.py

c82215c
verified

siddhm11 commited on Apr 26

Add synthetic model file

dc5b2e5
verified

siddhm11 commited on Apr 26

Add pipeline scripts, synthetic model, and test results

0e82b02
verified

siddhm11 commited on Apr 26

Add complete Phase 6 documentation

bca9583
verified

siddhm11 commited on Apr 26

initial commit

8dff027
verified

siddhm11 commited on Apr 26

Commit History

Honest test: AUC=0.9053 without position leak 61371b7 verified

Honest test without position leak e80cff2 verified

Exa model eval metrics 33df4bd verified

Exa-distilled model: nDCG=0.9999, HN-AUC=0.9731, trained on 93K pairs 555d737 verified

Final model metrics 11437ae verified

FINAL production model: intent-based labels, nDCG=0.8725, HN-AUC=0.8325 c8a8dc3 verified

Add final extraction script (tested, works, needs stable compute for full run)" 36ec200 verified

Add V6 deployment guide: production-ready model, zero code changes needed bcd1b92 verified

Add V6 production results 6300f40 verified

Add V6 PRODUCTION MODEL: 37-feature schema, drops into app, Hard Neg AUC=0.758 f94f80c verified

Add V6 eval results 73d7b62 verified

Add V6: drop-in replacement, 37 features, cross-survey labels, HN-AUC=0.964 a3f9c6f verified

Update CHANGELOG: add V4 and V5 results. V5 achieves Hard Neg AUC=0.837 9856611 verified

Add V5 eval metrics: nDCG=1.0, HN-AUC=0.837 245a85f verified

Add V5: graph+metadata, Hard Neg AUC=0.837 (+7.6% over V4) a4dc0e5 verified

Add V4 eval metrics 81f6108 verified

Add V4 LightGBM: 25 graph features, hard_neg_auc=0.778 (+5.4% over V3) 4808c83 verified

Update CHANGELOG: add V3 model with new eval framework e0169cb verified

Add V3 eval metrics: nDCG@10=0.9494, hard_neg_auc=0.7380 6f159f9 verified

Update ML Intern artifact metadata d58f586 verified

Add V3 LightGBM: trained on cross-survey authority labels, hard_neg_auc=0.738 247d191 verified

Add eval v2 design document: explains why old eval was weak and how new one works 4c07339 verified

Add eval v2: evaluation script with proper metrics for survey reading lists 38004cb verified

Add eval v2: extract survey paper reading lists from unarXive 2024 2628be1 verified

v2 model: production_model/feature_importance_v2.csv f2c3bd8 verified

v2 model: production_model/eval_metrics_v2.json 7d4c686 verified

v2 model: production_model/reranker_v2.txt 04eae30 verified

add: quick loading snippet for Python users fec1e1a verified

docs: add changelog 49c43e6 verified

docs: add changelog tracking project history bfb1eb8 verified

docs: add detailed integration guide for Steps 5-8 aaf866d verified

docs: comprehensive README with production results and integration guide 0ef3e5f verified

Add feature_schema.json d3097e4 verified

Add baseline_comparison.json 3fe5813 verified

Add feature_importance.csv a9861f0 verified

Add eval_metrics.json c97aa65 verified

Add reranker_v1.txt 5fe273b verified

Add comprehensive test suite 1199957 verified

Add 03_train_lightgbm.py 41fda29 verified

Add 02_generate_training_triples.py 616cfe6 verified

Add 01_fetch_citation_edges.py c82215c verified

Add synthetic model file dc5b2e5 verified

Add pipeline scripts, synthetic model, and test results 0e82b02 verified

Add complete Phase 6 documentation bca9583 verified

initial commit 8dff027 verified

Honest test: AUC=0.9053 without position leak

61371b7
verified

Honest test without position leak

e80cff2
verified

Exa model eval metrics

33df4bd
verified

Exa-distilled model: nDCG=0.9999, HN-AUC=0.9731, trained on 93K pairs

555d737
verified

Final model metrics

11437ae
verified

FINAL production model: intent-based labels, nDCG=0.8725, HN-AUC=0.8325

c8a8dc3
verified

Add final extraction script (tested, works, needs stable compute for full run)"

36ec200
verified

Add V6 deployment guide: production-ready model, zero code changes needed

bcd1b92
verified

Add V6 production results

6300f40
verified

Add V6 PRODUCTION MODEL: 37-feature schema, drops into app, Hard Neg AUC=0.758

f94f80c
verified

Add V6 eval results

73d7b62
verified

Add V6: drop-in replacement, 37 features, cross-survey labels, HN-AUC=0.964

a3f9c6f
verified

Update CHANGELOG: add V4 and V5 results. V5 achieves Hard Neg AUC=0.837

9856611
verified

Add V5 eval metrics: nDCG=1.0, HN-AUC=0.837

245a85f
verified

Add V5: graph+metadata, Hard Neg AUC=0.837 (+7.6% over V4)

a4dc0e5
verified

Add V4 eval metrics

81f6108
verified

Add V4 LightGBM: 25 graph features, hard_neg_auc=0.778 (+5.4% over V3)

4808c83
verified

Update CHANGELOG: add V3 model with new eval framework

e0169cb
verified

Add V3 eval metrics: nDCG@10=0.9494, hard_neg_auc=0.7380

6f159f9
verified

Update ML Intern artifact metadata

d58f586
verified

Add V3 LightGBM: trained on cross-survey authority labels, hard_neg_auc=0.738

247d191
verified

Add eval v2 design document: explains why old eval was weak and how new one works

4c07339
verified

Add eval v2: evaluation script with proper metrics for survey reading lists

38004cb
verified

Add eval v2: extract survey paper reading lists from unarXive 2024

2628be1
verified

v2 model: production_model/feature_importance_v2.csv

f2c3bd8
verified

v2 model: production_model/eval_metrics_v2.json

7d4c686
verified

v2 model: production_model/reranker_v2.txt

04eae30
verified

add: quick loading snippet for Python users

fec1e1a
verified

docs: add changelog

49c43e6
verified

docs: add changelog tracking project history

bfb1eb8
verified

docs: add detailed integration guide for Steps 5-8

aaf866d
verified

docs: comprehensive README with production results and integration guide

0ef3e5f
verified

Add feature_schema.json

d3097e4
verified

Add baseline_comparison.json

3fe5813
verified

Add feature_importance.csv

a9861f0
verified

Add eval_metrics.json

c97aa65
verified

Add reranker_v1.txt

5fe273b
verified

Add comprehensive test suite

1199957
verified

Add 03_train_lightgbm.py

41fda29
verified

Add 02_generate_training_triples.py

616cfe6
verified

Add 01_fetch_citation_edges.py

c82215c
verified

Add synthetic model file

dc5b2e5
verified

Add pipeline scripts, synthetic model, and test results

0e82b02
verified

Add complete Phase 6 documentation

bca9583
verified

initial commit

8dff027
verified