Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -32,7 +32,7 @@ This approach has been used to beat frontier AIs 100x larger on prediction-marke
|
|
| 32 |
- **[Outcome-based RL](https://arxiv.org/abs/2505.17989)** (TMLR): Using RL to improve LLM forecasting ability from real-world outcomes.
|
| 33 |
- **[Foresight-32B vs. Frontier LLMs](https://blog.lightningrod.ai/p/foresight-32b-beats-frontier-llms-on-live-polymarket-predictions)**: Live demonstration beating frontier models on Polymarket predictions.
|
| 34 |
|
| 35 |
-
Foresight-32B is consistently top-ranked on [ForecastBench](https://www.forecastbench.org/tournament/) and [ProphetArena
|
| 36 |
|
| 37 |
---
|
| 38 |
|
|
|
|
| 32 |
- **[Outcome-based RL](https://arxiv.org/abs/2505.17989)** (TMLR): Using RL to improve LLM forecasting ability from real-world outcomes.
|
| 33 |
- **[Foresight-32B vs. Frontier LLMs](https://blog.lightningrod.ai/p/foresight-32b-beats-frontier-llms-on-live-polymarket-predictions)**: Live demonstration beating frontier models on Polymarket predictions.
|
| 34 |
|
| 35 |
+
Foresight-32B is consistently top-ranked on [ForecastBench](https://www.forecastbench.org/tournament/) and [ProphetArena](https://www.prophetarena.co/leaderboard), despite being 10x-100x smaller than frontier models.
|
| 36 |
|
| 37 |
---
|
| 38 |
|