File size: 3,588 Bytes
ecfa30d
 
 
 
 
 
 
 
 
 
 
 
 
 
0057f06
ecfa30d
9f44c6b
84e9bed
9f44c6b
84e9bed
9f44c6b
84e9bed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9f44c6b
84e9bed
 
9f44c6b
ecfa30d
9f44c6b
84e9bed
 
 
 
 
 
 
 
 
 
 
 
 
9f44c6b
ecfa30d
9f44c6b
ecfa30d
9f44c6b
84e9bed
 
 
 
9f44c6b
 
 
ecfa30d
9f44c6b
84e9bed
ecfa30d
 
aa0527a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
---
license: apache-2.0
library_name: docker
tags:
  - finc2e
  - humai
  - governance
  - mlops
  - decision-intelligence
  - dual-metrics
  - docker
  - runtime
  - might-hub
---

# FinC2E Dual-Metrics Runtime (Preview) — v1.0.0

A lightweight, **container-first runtime** for FinC2E that introduces **Dual Metrics** as a foundation for governance-grade AI systems.

This image is part of the shared **Hugging Face → NVIDIA NGC** release flow and is designed to work alongside **FinC2E-Governance**.

---

## What is Dual Metrics?

**Dual Metrics** means measuring two dimensions simultaneously:

### 1) Runtime / System Metrics
- Service health
- Startup telemetry
- Adapter pull timing
- Deployment readiness

### 2) Decision-System Readiness
- Foundations for audit-friendly execution
- Governance-aligned decision trace hooks
- Separation of runtime logic and private model weights

This runtime does **not** expose private weights.  
It securely pulls them at startup using gated access.

---

## Hugging Face Repositories

- **Runtime (this repository):**  
  `MightHubHumAI/FinC2E_DualMetrics_Runtime`

- **Adapter (private):**  
  `MightHubHumAI/FinC2E_Llama33_70B_Adapter`

- **Model Preview:**  
  `MightHubHumAI/FinC2E`

- **Governance Space:**  
  `FinC2E-Governance` / `HumAI FinC2E HQ`

---

## Image Tags

- **Stable (immutable):** `1.0.0`
- **Moving:** `latest`

For reproducible tests and mirrors (NGC / Brev), use **`1.0.0`**.

---

## Pull & Run

### Pull image
```bash
docker pull registry.huggingface.co/MightHubHumAI/FinC2E_DualMetrics_Runtime:1.0.0
Run container
docker run --rm \
  -e HF_TOKEN=YOUR_HF_TOKEN \
  -e ADAPTER_REPO=MightHubHumAI/FinC2E_Llama33_70B_Adapter \
  -p 8000:8000 \
  registry.huggingface.co/MightHubHumAI/FinC2E_DualMetrics_Runtime:1.0.0


HF_TOKEN must have read access to the private adapter repository.

Endpoints
Health
curl http://localhost:8000/health


Example response:

{"status":"ok","service":"FinC2E Runtime"}

Dual Metrics
curl http://localhost:8000/metrics


Example response:

{
  "runtime": "FinC2E_DualMetrics_Runtime",
  "adapter_repo": "MightHubHumAI/FinC2E_Llama33_70B_Adapter",
  "download_seconds": 4.242,
  "timestamp": "2025-09-16T14:30:00Z"
}

Environment Variables

HF_TOKEN (required)
Hugging Face token with permission to access the private adapter repository.

ADAPTER_REPO (optional)
Defaults to MightHubHumAI/FinC2E_Llama33_70B_Adapter.

PORT (optional)
Defaults to 8000.

Security & IP Model

Adapter weights are private and never embedded in the image.

Runtime image is public, auditable, and mirrorable.

This separation enables:

controlled access

governance and audit readiness

Hugging Face → NVIDIA NGC parity

Brev and enterprise deployment flows

Release Notes — v1.0.0

First public runtime release.

Secure adapter pull via HF_TOKEN.

/health and /metrics endpoints available.

Immutable 1.0.0 tag + rolling latest.

HF → NGC → Brev Shared Release Discipline

Hugging Face tag 1.0.0 == NVIDIA NGC tag 1.0.0.

No breaking changes under the same tag.

All future changes increment semver (1.0.1, 1.1.0, 2.0.0).

This ensures deterministic behavior across registries and deployment platforms.

Roadmap

Upcoming iterations include:

Decision trace endpoints (JSONL, audit-ready).

Model routing and orchestration hooks (Galaxy container foundations).

NVIDIA NGC packaging and listing.

Brev deployment profiles for GPU-backed environments.

License

Apache-2.0 applies to runtime code and container packaging.
Model adapters and weights are governed separately under a private license.