Sync README: clarify autoregressive structure, move license to footer
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ tags:
|
|
| 14 |
|
| 15 |
GPT-style transformers (GPT-2, Llama) running as RL policies in continuous-control environments.
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
```text
|
| 20 |
token → next token (LLM generation)
|
|
|
|
| 14 |
|
| 15 |
GPT-style transformers (GPT-2, Llama) running as RL policies in continuous-control environments.
|
| 16 |
|
| 17 |
+
Both LLM generation and RL interaction are autoregressive:
|
| 18 |
|
| 19 |
```text
|
| 20 |
token → next token (LLM generation)
|