Spaces:

DaCrow13
/

Hopcroft-Skill-Classification

Sleeping

maurocarlu commited on Jan 9

Commit

7284817

1 Parent(s): d08d694

updating the testing and validation doc

Files changed (1) hide show

docs/testing_and_validation.md CHANGED Viewed

@@ -22,9 +22,6 @@ Behavioral testing evaluates the model's capabilities and robustness beyond simp
 | **Directional Tests** | 10 | **Passed** | Verify that specific changes to the input cause expected changes in the output (e.g., adding specific keywords should increase probability of related skills). |
 | **Minimum Functionality Tests** | 17 | **Passed** | Check basic capabilities and sanity checks (e.g., simple inputs produce valid outputs). |
-### Technical Notes
-- **Training Tests Excluded:** `test_model_training.py` was excluded from the run due to a missing PyTorch dependency in the environment, but the inference tests cover the model's behavior fully.
-- **Robustness:** The model demonstrates excellent consistency across all 36 behavioral scenarios.
 ### How to Regenerate
 To run the behavioral tests and generate the JSON report:

 | **Directional Tests** | 10 | **Passed** | Verify that specific changes to the input cause expected changes in the output (e.g., adding specific keywords should increase probability of related skills). |
 | **Minimum Functionality Tests** | 17 | **Passed** | Check basic capabilities and sanity checks (e.g., simple inputs produce valid outputs). |
 ### How to Regenerate
 To run the behavioral tests and generate the JSON report: