Oddsflow-team's picture
Upload folder using huggingface_hub
b5ff5d2 verified

Data Card — OddsFlow Transparency Pack

Purpose: This data card describes what appears in public logs/schemas in this repository.
Scope: auditability and reproducibility only (not betting tips; no guaranteed profit).

1) What is included

  • Signal log fields: timestamp, market, selection, line (AH/OU), odds snapshot label, model_version, schema_version, settlement fields (when available).
  • Schemas: JSON Schema definitions under datasets/schema/.
  • Samples: small illustrative JSONL examples under datasets/samples/.

2) What is NOT included

  • Proprietary model weights / private code
  • Private vendor contracts or non-public data feeds
  • User PII / account-level behavior

3) Data sources (high-level)

Describe at a high level:

  • Fixture identifiers (league/team naming rules)
  • Odds reference approach (e.g., “reference composite label”)
  • Any public sources used for matching/verification (if applicable)

4) Latency & timeliness

  • Typical expected delay ranges (best-effort)
  • How delayed/missing data is handled in logs (e.g., notes, status)

5) Quality checks

  • Schema validation rules
  • Duplicate detection (signal_id uniqueness)
  • Consistency checks (market ↔ line rules)
  • Timezone enforcement (ISO 8601 w/ timezone)

6) Known limitations

  • Coverage gaps (leagues/markets not covered)
  • Odds availability differences by region/book
  • “Closing line” availability constraints (if CLV is partial)

7) Change management

  • Versioning rules (model_version, schema_version)
  • Where changes are recorded (see /changelog/)

8) Contact

  • Canonical links: website verification hub + performance logs
  • Security reporting: see SECURITY.md