Instructions to use dipta007/decomposeRL-7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use dipta007/decomposeRL-7b with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="dipta007/decomposeRL-7b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("dipta007/decomposeRL-7b")
model = AutoModelForCausalLM.from_pretrained("dipta007/decomposeRL-7b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use dipta007/decomposeRL-7b with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "dipta007/decomposeRL-7b"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dipta007/decomposeRL-7b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/dipta007/decomposeRL-7b

SGLang

How to use dipta007/decomposeRL-7b with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "dipta007/decomposeRL-7b" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dipta007/decomposeRL-7b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "dipta007/decomposeRL-7b" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dipta007/decomposeRL-7b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use dipta007/decomposeRL-7b with Docker Model Runner:
```
docker model run hf.co/dipta007/decomposeRL-7b
```

dipta007 commited on 8 days ago

Commit

df8336a

verified ·

1 Parent(s): 257b03c

Update README

Browse files

Files changed (1) hide show

README.md +45 -16

README.md CHANGED Viewed

@@ -85,6 +85,8 @@ GRPO is supervised with a sum of seven rewards, grouped into three families:
 ## Quickstart
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -97,16 +99,8 @@ model = AutoModelForCausalLM.from_pretrained(
     device_map="auto",
 )
-evidence_doc = (
-    "The Eiffel Tower is a wrought-iron lattice tower on the Champ de Mars in Paris, "
-    "France. It is named after the engineer Gustave Eiffel, whose company designed and "
-    "built the tower from 1887 to 1889. Locally nicknamed 'La dame de fer', it was "
-    "constructed as the centerpiece of the 1889 World's Fair. The tower is 330 metres "
-    "(1,083 ft) tall."
-)
-claim = "The Eiffel Tower was completed in 1887 and stands 330 metres tall."
-user_prompt = f"""You are tasked with systematically verifying the accuracy of a claim. You will be provided with a claim to verify and an evidence document to consult.
 Here is the evidence document you should consult:
@@ -130,13 +124,37 @@ Stop immediately after the closing </verification> tag.
 Begin your verification process now."""
-messages = [{"role": "user", "content": user_prompt}]
-text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
-inputs = tokenizer([text], return_tensors="pt").to(model.device)
-# max_new_tokens matches training-time max_completion_length
-out = model.generate(**inputs, max_new_tokens=4500, temperature=0.7, do_sample=True)
-response = tokenizer.decode(out[0][inputs.input_ids.shape[1]:], skip_special_tokens=True)
 print(response)
 ```
@@ -154,9 +172,20 @@ def parse_trace(text: str):
     return [(tag, body.strip()) for tag, body in TAG_RE.findall(text)]
 def pretty_print(text: str) -> None:
     cycle_idx = 0
     pending_q = None
-    for tag, body in parse_trace(text):
         if tag == "think":
             print("─" * 78)
             print("🧠  THINK")

 ## Quickstart
+DecomposeRL expects a specific verification prompt around your `claim` + `evidence_doc`. The `build_prompt` helper below wraps them for you so you don't have to construct the full instruction block every time.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
     device_map="auto",
 )
+PROMPT_TEMPLATE = """You are tasked with systematically verifying the accuracy of a claim. You will be provided with a claim to verify and an evidence document to consult.
 Here is the evidence document you should consult:
 Begin your verification process now."""
+def build_prompt(claim: str, evidence_doc: str) -> str:
+    """Wrap a claim + evidence document in the DecomposeRL verification prompt."""
+    return PROMPT_TEMPLATE.format(claim=claim, evidence_doc=evidence_doc)
+def verify(claim: str, evidence_doc: str, max_new_tokens: int = 4500, temperature: float = 0.7) -> str:
+    """Run the model end-to-end on a (claim, evidence_doc) pair and return the raw trace."""
+    messages = [{"role": "user", "content": build_prompt(claim, evidence_doc)}]
+    text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+    inputs = tokenizer([text], return_tensors="pt").to(model.device)
+    out = model.generate(
+        **inputs,
+        max_new_tokens=max_new_tokens,  # matches training-time max_completion_length
+        temperature=temperature,
+        do_sample=True,
+    )
+    return tokenizer.decode(out[0][inputs.input_ids.shape[1]:], skip_special_tokens=True)
+# Usage
+evidence_doc = (
+    "The Eiffel Tower is a wrought-iron lattice tower on the Champ de Mars in Paris, "
+    "France. It is named after the engineer Gustave Eiffel, whose company designed and "
+    "built the tower from 1887 to 1889. Locally nicknamed 'La dame de fer', it was "
+    "constructed as the centerpiece of the 1889 World's Fair. The tower is 330 metres "
+    "(1,083 ft) tall."
+)
+claim = "The Eiffel Tower was completed in 1887 and stands 330 metres tall."
+response = verify(claim, evidence_doc)
 print(response)
 ```
     return [(tag, body.strip()) for tag, body in TAG_RE.findall(text)]
 def pretty_print(text: str) -> None:
+    parsed = parse_trace(text)
+    tags = {tag for tag, _ in parsed}
+    if not parsed or "verification" not in tags:
+        print("⚠️  Could not parse output into the expected "
+              "think/question/answer/verification structure.")
+        print("Raw output:")
+        print("─" * 78)
+        print(text)
+        print("─" * 78)
+        return
     cycle_idx = 0
     pending_q = None
+    for tag, body in parsed:
         if tag == "think":
             print("─" * 78)
             print("🧠  THINK")