amihai4by
/

logic-reasoner-v2

Text Generation

Model card Files Files and versions

amihai4by commited on Jan 25

Commit

ad190b4

·

verified ·

1 Parent(s): 3e68545

upd3

Files changed (1) hide show

README.md +19 -21

README.md CHANGED Viewed

@@ -76,16 +76,16 @@ When used with the provided `Modelfile`, the model outputs **exactly one JSON ob
 ### Schema
-```json
-{
   "verdict": "true | false | uncertain",
   "reason": "string",
   "confidence": 0.0,
   "evidence": ["string"],
   "assumptions": ["string"],
   "next_actions": ["string"]
-}
-Rules
 confidence is a heuristic value between 0.0 and 1.0
 If information is missing, the verdict must be uncertain
@@ -94,36 +94,34 @@ No text outside JSON is expected when the wrapper is used
 Stop behavior is enforced by the Modelfile
-How to run with Ollama
 Create the model locally:
-bash
-Copy code
-ollama create logic-reasoner-v2 -f Modelfile
-Example request:
-bash
-Copy code
-curl http://localhost:11434/api/generate -d '{
   "model": "logic-reasoner-v2",
   "stream": false,
   "prompt": "Input: DCGM exporter reports 0 GPUs across all nodes. Question: Is the system healthy?"
-}'
-Quantization
-Format: GGUF
-Quantization: Q4_K_M
-Optimized for low-latency operational inference
-Provenance
 This model was built and packaged as part of the LLM FUN project on NVIDIA DGX B200 infrastructure using:
-Kubernetes (RKE2)
-Ollama
-OpenWebUI
 The Modelfile is a core part of the model behavior and must be used to reproduce the intended output guarantees.

 ### Schema
+ {
   "verdict": "true | false | uncertain",
   "reason": "string",
   "confidence": 0.0,
   "evidence": ["string"],
   "assumptions": ["string"],
   "next_actions": ["string"]
+ }
+## Rules
 confidence is a heuristic value between 0.0 and 1.0
 If information is missing, the verdict must be uncertain
 Stop behavior is enforced by the Modelfile
+## How to run with Ollama
 Create the model locally:
+ ollama create logic-reasoner-v2 -f Modelfile
+## Example request:
+ curl http://localhost:11434/api/generate -d '{
   "model": "logic-reasoner-v2",
   "stream": false,
   "prompt": "Input: DCGM exporter reports 0 GPUs across all nodes. Question: Is the system healthy?"
+ }'
+## Quantization
+ Format: GGUF
+ Quantization: Q4_K_M
+ Optimized for low-latency operational inference
+## Provenance
 This model was built and packaged as part of the LLM FUN project on NVIDIA DGX B200 infrastructure using:
+ Kubernetes (RKE2)
+ Ollama
+ OpenWebUI
 The Modelfile is a core part of the model behavior and must be used to reproduce the intended output guarantees.