Upload 3 files
Browse files- docs/benchmark-overview.md +5 -0
- docs/index.md +3 -0
- docs/legal-reasoning-tests.md +5 -0
docs/benchmark-overview.md
ADDED
|
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Benchmark Overview
|
| 2 |
+
|
| 3 |
+
*Date: 2025-06-30*
|
| 4 |
+
|
| 5 |
+
Here we describe our dataset of statutory audit scenarios, evaluation metrics, and baseline results…
|
docs/index.md
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Welcome to Our LLM Legal Benchmarks Blog
|
| 2 |
+
|
| 3 |
+
This site hosts our ongoing series benchmarking the legal reasoning capabilities of modern LLMs.
|
docs/legal-reasoning-tests.md
ADDED
|
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Legal Reasoning Tests
|
| 2 |
+
|
| 3 |
+
*Date: 2025-07-01*
|
| 4 |
+
|
| 5 |
+
Diving into clause-by-clause reasoning, prompt designs, and error analysis…
|