Files changed (1) hide show
  1. README.md +14 -12
README.md CHANGED
@@ -7,16 +7,15 @@ sdk: docker
7
  python_version: "3.11"
8
  app_port: 7860
9
  tags:
10
- - thousand-token-wood
11
- - nemotron
12
- - fine-tuned
13
- - custom-ui
14
- - tiny-titan
15
- - self-play
16
- - rl
17
- - fighting-game
18
- - modal
19
- pinned: false
20
  ---
21
 
22
  # Duel of Nemotron ⚔️ — Hybrid Self-Play AI Fighter
@@ -34,6 +33,9 @@ This is a tiny-model-implements-the-fast-loop + fine-tuned-LLM-sets-the-directio
34
  pattern: a small CPU policy network for real-time play, a larger fine-tuned
35
  model for strategic depth.
36
 
 
 
 
37
  ## How It Works
38
 
39
  ```
@@ -45,8 +47,8 @@ Browser (React + Three.js) ──fight state──▶ Space backend (HF Space
45
  │ └──▶ Modal Nemotron (A10, cold start)
46
  │ every ~10 moves:
47
  │ returns strategic weights
48
-
49
- └──────────────────weights + reasoning────────┘
50
  ```
51
 
52
  ### Training Pipeline (on Modal A100-40GB)
 
7
  python_version: "3.11"
8
  app_port: 7860
9
  tags:
10
+ - track:wood
11
+ - sponsor:nvidia
12
+ - sponsor:modal
13
+ - achievement:offgrid
14
+ - achievement:welltuned
15
+ - achievement:offbrand
16
+ - achievement:llama
17
+ - achievement:sharing
18
+ - achievement:fieldnotes
 
19
  ---
20
 
21
  # Duel of Nemotron ⚔️ — Hybrid Self-Play AI Fighter
 
33
  pattern: a small CPU policy network for real-time play, a larger fine-tuned
34
  model for strategic depth.
35
 
36
+ https://sankalphs.blogspot.com/2026/06/duel-of-albion.html
37
+ https://x.com/sankalphs/status/2066675345080852567
38
+
39
  ## How It Works
40
 
41
  ```
 
47
  │ └──▶ Modal Nemotron (A10, cold start)
48
  │ every ~10 moves:
49
  │ returns strategic weights
50
+
51
+ └──────────────────weights + reasoning────────-----------
52
  ```
53
 
54
  ### Training Pipeline (on Modal A100-40GB)