girish00 commited on
Commit
007a7c7
·
verified ·
1 Parent(s): 6be41ff

add dedicated endpoint cloud mode

Browse files
Files changed (1) hide show
  1. IMPLEMENTATION.md +3 -0
IMPLEMENTATION.md CHANGED
@@ -57,6 +57,9 @@ Build and run a local fine-tuning pipeline for a coding assistant model with:
57
  - `python infer_cloud.py --repo-id your-username/your-model-name --prompt "Fix this code: def add(a,b) return a+b"`
58
  - If you already logged in with `hf auth login`, the saved token can be used without setting `HF_TOKEN`.
59
  - Add `--no-local-fallback` if you want the command to fail when HF cloud serving is unavailable.
 
 
 
60
 
61
  ## One-Command Execution
62
 
 
57
  - `python infer_cloud.py --repo-id your-username/your-model-name --prompt "Fix this code: def add(a,b) return a+b"`
58
  - If you already logged in with `hf auth login`, the saved token can be used without setting `HF_TOKEN`.
59
  - Add `--no-local-fallback` if you want the command to fail when HF cloud serving is unavailable.
60
+ - For true cloud execution, deploy a Hugging Face Dedicated Inference Endpoint and call:
61
+ - `python infer_cloud.py --endpoint-url "https://your-endpoint-url.endpoints.huggingface.cloud" --prompt "Fix this code: def add(a,b) return a+b" --no-local-fallback`
62
+ - Users should set their own token with `$env:HF_TOKEN="their_huggingface_token"` before calling the endpoint.
63
 
64
  ## One-Command Execution
65