roverdevkit / data /README.md
jjreif's picture
Deploy roverdevkit @ 2676a67
b3d14e3
|
Raw
History Blame Contribute Delete
2.56 kB

Data

Small, curated datasets and citations live here. Large generated datasets (LHS samples) are git-ignored β€” see .gitignore.

Rover data

Three consumers describe the same set of rovers for verification, each holding purpose-specific values, all reconciled against one canonical facts file:

  • rovers.yaml β€” canonical published-facts reference (single source of truth). Holds only published/citable facts (mass, wheels, grousers, landing latitude, traverse/peak-solar/thermal truth, ...) with per-field provenance (value + provenance ∈ {published, derived, imputed} + source). Loaded by roverdevkit/validation/rover_facts.py. It deliberately excludes modeling-derived quantities (chassis mass, torque anchor, panel efficiency, thermal architecture, scenario duty cycles) that legitimately differ per consumer. tests/test_rover_facts.py enforces that the consumers below agree with the published/derived facts here, so the sources cannot silently drift.
  • mass_validation_set.csv β€” published-rover mass and subsystem inputs used by roverdevkit/mass/validation.py to check the bottom-up mass model. Source/provenance details live in each row's citation and imputation_notes.
  • published_traverse_data.csv β€” flown-rover traverse, peak-solar, thermal, and mission-duration truth data used by roverdevkit/validation/rover_comparison.py. Source details live in each row's citation and notes.
  • roverdevkit/validation/rover_registry.py (code, not data) β€” executable design vectors + scenarios + thermal/panel architecture consumed by the evaluator, rediscovery, surrogate sanity check, and webapp.

Other files

  • soil_simulants.csv β€” Bekker parameters (n, k_c, k_phi, cohesion, friction angle) for common lunar soil simulants: FJS-1, JSC-1A, GRC-1, plus Apollo regolith estimates.
  • validation/ β€” single-wheel testbed data digitized from published papers (Ding 2011, Iizuka & Kubota 2011, Wong's datasets). Used as held-out data to sanity-check the evaluator β€” never used for training.
  • analytical/ β€” generated LHS samples from the analytical evaluator. Git-ignored except for schema documentation.

Citation discipline

Every curated data row must carry a citation or provenance note. Prefer the canonical rovers.yaml per-field provenance for published rover facts; use the dedicated citation column where present; otherwise document sources and imputations in notes / imputation_notes. If you can't cite it, don't fit on it.