AI & ML interests

None defined yet.

Recent Activity

kissin42  updated a model about 10 hours ago
ccnets/causal-gpt-rl
kissin42  updated a Space 2 days ago
ccnets/README
kissin42  published a model 2 days ago
ccnets/causal-gpt-rl
View all activity

Organization Card

Causal GPT-RL

GPT-style transformers (GPT-2, Llama) running as RL policies in continuous-control environments.

action → next state → next action      (RL rollouts)
token  → next token  → next token      (LLM generation)

Stable under self-generated rollouts — long-horizon control without the drift that has historically kept transformers from being usable as RL agents.

Get started

pip install "causal-gpt-rl[hub,mujoco]"
import gymnasium as gym
from causal_gpt_rl.inference import load_runner_from_hub, run_episodes

env = gym.make("Ant-v5")
runner = load_runner_from_hub(
    repo_id="ccnets/causal-gpt-rl",
    subfolder="ant-v5",
    device="cpu",
)
stats = run_episodes(env, runner, num_episodes=5, seed=0)

Available bundles: Ant-v5, HalfCheetah-v5, Walker2d-v5, Humanoid-v5

Released under PolyForm Noncommercial 1.0.0.

datasets 0

None public yet