Bturtel commited on
Commit
414d74b
·
verified ·
1 Parent(s): 24f5f37

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -9,15 +9,22 @@ pinned: false
9
 
10
  # Lightning Rod Labs
11
 
12
- **Generate training data from real-world sources.**
13
 
14
- Go from messy historical data to verified training datasets — no labeling or annotation needed.
15
- Define your criteria in Python, and our SDK treats real-world outcomes as the label, producing high-signal supervision at scale. Models learn causal factors, not just tokens. Raw data to deployable specialized models in hours.
16
 
17
  [Website](https://lightningrod.ai/) · [SDK](https://github.com/lightning-rod-labs/lightningrod-python-sdk) · [Blog](https://blog.lightningrod.ai/)
18
 
19
  ---
20
 
 
 
 
 
 
 
 
 
21
  ## Research & Results
22
 
23
  - **[SEC Risk Prediction](https://arxiv.org/abs/2601.19189)**: Foresight learning on raw SEC filings trains a 32B model to outperform GPT-5 at predicting public company risks.
@@ -26,3 +33,14 @@ Define your criteria in Python, and our SDK treats real-world outcomes as the la
26
  - **[Foresight-32B vs. Frontier LLMs](https://blog.lightningrod.ai/p/foresight-32b-beats-frontier-llms-on-live-polymarket-predictions)**: Live demonstration beating frontier models on Polymarket predictions.
27
 
28
  Foresight-32B is consistently top-ranked on [ForecastBench](https://www.forecastbench.org/tournament/) and [ProphetArena Sports](https://www.prophetarena.co/leaderboard).
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
  # Lightning Rod Labs
11
 
12
+ **Train with Timestamps, Not Labels.**
13
 
14
+ Lightning Rod Labs automatically generates high-quality training data from your documents or public sources — no labeling or extraction required. Define your criteria in Python, and our SDK treats real-world outcomes as the label, producing high-signal supervision at scale. Models learn causal factors, not just tokens. Raw data to deployable specialized models in hours.
 
15
 
16
  [Website](https://lightningrod.ai/) · [SDK](https://github.com/lightning-rod-labs/lightningrod-python-sdk) · [Blog](https://blog.lightningrod.ai/)
17
 
18
  ---
19
 
20
+ ## How It Works
21
+
22
+ We generate grounded, model-ready training data from documents or public sources (Google News, SEC filings, market data). You define your criteria in Python, and our SDK uses the **future as the label** — turning messy, timestamped history into training signal automatically. No labeling pipelines, no extraction, no human annotation.
23
+
24
+ This approach has been used to beat frontier AIs 100x larger on prediction-market benchmarks, and has demonstrated success in financial forecasting, risk estimation, and policy prediction.
25
+
26
+ ---
27
+
28
  ## Research & Results
29
 
30
  - **[SEC Risk Prediction](https://arxiv.org/abs/2601.19189)**: Foresight learning on raw SEC filings trains a 32B model to outperform GPT-5 at predicting public company risks.
 
33
  - **[Foresight-32B vs. Frontier LLMs](https://blog.lightningrod.ai/p/foresight-32b-beats-frontier-llms-on-live-polymarket-predictions)**: Live demonstration beating frontier models on Polymarket predictions.
34
 
35
  Foresight-32B is consistently top-ranked on [ForecastBench](https://www.forecastbench.org/tournament/) and [ProphetArena Sports](https://www.prophetarena.co/leaderboard).
36
+
37
+ ---
38
+
39
+ ## Models & Datasets
40
+
41
+ | Resource | Description |
42
+ |----------|-------------|
43
+ | [Trump-Forecaster](https://huggingface.co/LightningRodLabs/Trump-Forecaster) | RL-tuned gpt-oss-120b LoRA adapter for predicting Trump administration actions. Beats GPT-5 (Brier 0.194 vs 0.200). |
44
+ | [Golf-Forecaster](https://huggingface.co/LightningRodLabs/Golf-Forecaster) | RL-tuned gpt-oss-120b LoRA adapter for predicting professional golf outcomes. Beats GPT-5.1 (Brier 0.207 vs 0.218). |
45
+ | [WWTD-2025](https://huggingface.co/datasets/LightningRodLabs/WWTD-2025) | 2,790 binary forecasting questions about U.S. policy under the Trump administration, with news context and ground-truth resolutions. |
46
+ | [GolfForecasting](https://huggingface.co/datasets/LightningRodLabs/GolfForecasting) | 4,033 binary forecasting questions about professional golf across PGA Tour, LIV Golf, LPGA, and majors. |