Azure AI Foundry Deployment

Deploy SAM3 to Azure AI Foundry (pending GPU quota).

Quick Deploy

./deployments/azure/deploy.sh

This will build and push the image to Azure Container Registry.

Configuration

Registry: sam3acr.azurecr.io
Image: sam3-foundry:latest
Endpoint: sam3-foundry (to be created)
Resource Group: productionline-test
Instance Type: Standard_NC6s_v3 (Tesla V100) or higher

Status

⏳ Pending GPU Quota Approval

Once GPU quota is approved, create the endpoint:

Create Endpoint (Azure Portal)

Navigate to Azure AI Foundry workspace
Go to Endpoints → Real-time endpoints
Click Create
Select Custom container
Image: sam3acr.azurecr.io/sam3-foundry:latest
Instance type: Standard_NC6s_v3 or higher
Deploy

Create Endpoint (Azure CLI)

# Create endpoint
az ml online-endpoint create \
  --name sam3-foundry \
  --resource-group productionline-test \
  --workspace-name <your-workspace>

# Create deployment
az ml online-deployment create \
  --name sam3-foundry-deployment \
  --endpoint sam3-foundry \
  --model-uri sam3acr.azurecr.io/sam3-foundry:latest \
  --instance-type Standard_NC6s_v3 \
  --instance-count 1

Testing

Once deployed, update the endpoint URL in the test script and run:

python3 scripts/test/test_api.py

For More Information

See docs/DEPLOYMENT.md for complete Azure AI Foundry deployment guide.