Spaces:

BuiMinh
/

Mit

Paused

App Files Files Community

DevLLM commited on Apr 1, 2025

Commit

3baea8e

1 Parent(s): b02e431

Add application file

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.dockerignore +12 -0
.env +197 -0
.env.ci +1 -0
.eslintignore +13 -0
.eslintrc.cjs +44 -0
.github/ISSUE_TEMPLATE/bug-report--chat-ui-.md +43 -0
.github/ISSUE_TEMPLATE/config-support.md +9 -0
.github/ISSUE_TEMPLATE/feature-request--chat-ui-.md +17 -0
.github/ISSUE_TEMPLATE/huggingchat.md +11 -0
.github/release.yml +16 -0
.github/workflows/build-docs.yml +18 -0
.github/workflows/build-image.yml +140 -0
.github/workflows/build-pr-docs.yml +20 -0
.github/workflows/deploy-prod.yml +79 -0
.github/workflows/lint-and-test.yml +52 -0
.github/workflows/trufflehog.yml +17 -0
.github/workflows/upload-pr-documentation.yml +16 -0
.gitignore +15 -0
.husky/lint-stage-config.js +4 -0
.husky/pre-commit +2 -0
.npmrc +1 -0
.prettierignore +14 -0
.prettierrc +7 -0
.vscode/launch.json +11 -0
.vscode/settings.json +11 -0
Dockerfile +95 -0
LICENSE +203 -0
PRIVACY.md +35 -0
PROMPTS.md +72 -0
chart/Chart.yaml +5 -0
chart/env/prod.yaml +677 -0
chart/templates/_helpers.tpl +22 -0
chart/templates/config.yaml +10 -0
chart/templates/deployment.yaml +81 -0
chart/templates/hpa.yaml +45 -0
chart/templates/infisical.yaml +24 -0
chart/templates/ingress.yaml +32 -0
chart/templates/network-policy.yaml +36 -0
chart/templates/service-account.yaml +13 -0
chart/templates/service-monitor.yaml +15 -0
chart/templates/service.yaml +21 -0
chart/values.yaml +67 -0
docs/source/_toctree.yml +64 -0
docs/source/configuration/common-issues.md +7 -0
docs/source/configuration/embeddings.md +105 -0
docs/source/configuration/metrics.md +9 -0
docs/source/configuration/models/multimodal.md +24 -0
docs/source/configuration/models/overview.md +147 -0
docs/source/configuration/models/providers/anthropic.md +117 -0
docs/source/configuration/models/providers/aws.md +35 -0

.dockerignore ADDED Viewed

	@@ -0,0 +1,12 @@

+Dockerfile
+.vscode/
+.idea
+.gitignore
+LICENSE
+README.md
+node_modules/
+.svelte-kit/
+.env*
+!.env
+.env.local
+db

.env ADDED Viewed

	@@ -0,0 +1,197 @@

+# Use .env.local to change these variables
+# DO NOT EDIT THIS FILE WITH SENSITIVE DATA
+### MongoDB ###
+MONGODB_URL=#your mongodb URL here, use chat-ui-db image if you don't want to set this
+MONGODB_DB_NAME=chat-ui
+MONGODB_DIRECT_CONNECTION=false
+### Endpoints config ###
+HF_API_ROOT=https://api-inference.huggingface.co/models
+# HF_TOKEN is used for a lot of things, not only for inference but also fetching tokenizers, etc.
+# We recommend using an HF_TOKEN even if you use a local endpoint.
+HF_TOKEN= #get it from https://huggingface.co/settings/token
+# API Keys for providers, you will need to specify models in the MODELS section but these keys can be kept secret
+OPENAI_API_KEY=#your openai api key here
+ANTHROPIC_API_KEY=#your anthropic api key here
+CLOUDFLARE_ACCOUNT_ID=#your cloudflare account id here
+CLOUDFLARE_API_TOKEN=#your cloudflare api token here
+COHERE_API_TOKEN=#your cohere api token here
+GOOGLE_GENAI_API_KEY=#your google genai api token here
+### Models ###
+## Models can support many different endpoints, check the documentation for more details
+MODELS=`[
+    {
+      "name": "NousResearch/Hermes-3-Llama-3.1-8B",
+      "description": "Nous Research's latest Hermes 3 release in 8B size.",
+      "promptExamples": [
+        {
+          "title": "Write an email from bullet list",
+          "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+        }, {
+          "title": "Code a snake game",
+          "prompt": "Code a basic snake game in python, give explanations for each step."
+        }, {
+          "title": "Assist in a task",
+          "prompt": "How do I make a delicious lemon cheesecake?"
+        }
+      ]
+    }
+]`
+## Text Embedding Models used for websearch
+# Default is a model that runs locally on CPU.
+TEXT_EMBEDDING_MODELS = `[
+  {
+    "name": "Xenova/gte-small",
+    "displayName": "Xenova/gte-small",
+    "description": "Local embedding model running on the server.",
+    "chunkCharLength": 512,
+    "endpoints": [
+      { "type": "transformersjs" }
+    ]
+  }
+]`
+## Removed models, useful for migrating conversations
+# { name: string, displayName?: string, id?: string, transferTo?: string }`
+OLD_MODELS=`[]`
+## Task model
+# name of the model used for tasks such as summarizing title, creating query, etc.
+# if not set, the first model in MODELS will be used
+TASK_MODEL=
+### Authentication ###
+# Parameters to enable open id login
+OPENID_CONFIG=
+MESSAGES_BEFORE_LOGIN=# how many messages a user can send in a conversation before having to login. set to 0 to force login right away
+# if it's defined, only these emails will be allowed to use login
+ALLOWED_USER_EMAILS=`[]`
+# If it's defined, users with emails matching these domains will also be allowed to use login
+ALLOWED_USER_DOMAINS=`[]`
+# valid alternative redirect URLs for OAuth, used for HuggingChat apps
+ALTERNATIVE_REDIRECT_URLS=`[]`
+### Cookies
+# name of the cookie used to store the session
+COOKIE_NAME=hf-chat
+# specify secure behaviour for cookies
+COOKIE_SAMESITE=# can be "lax", "strict", "none" or left empty
+COOKIE_SECURE=# set to true to only allow cookies over https
+### Websearch ###
+## API Keys used to activate search with web functionality. websearch is disabled if none are defined. choose one of the following:
+YDC_API_KEY=#your docs.you.com api key here
+SERPER_API_KEY=#your serper.dev api key here
+SERPAPI_KEY=#your serpapi key here
+SERPSTACK_API_KEY=#your serpstack api key here
+SEARCHAPI_KEY=#your searchapi api key here
+USE_LOCAL_WEBSEARCH=#set to true to parse google results yourself, overrides other API keys
+SEARXNG_QUERY_URL=# where '<query>' will be replaced with query keywords see https://docs.searxng.org/dev/search_api.html eg https://searxng.yourdomain.com/search?q=<query>&engines=duckduckgo,google&format=json
+BING_SUBSCRIPTION_KEY=#your key
+## Websearch configuration
+PLAYWRIGHT_ADBLOCKER=true
+WEBSEARCH_ALLOWLIST=`[]` # if it's defined, allow websites from only this list.
+WEBSEARCH_BLOCKLIST=`[]` # if it's defined, block websites from this list.
+WEBSEARCH_JAVASCRIPT=true # CPU usage reduces by 60% on average by disabling javascript. Enable to improve website compatibility
+WEBSEARCH_TIMEOUT = 3500 # in milliseconds, determines how long to wait to load a page before timing out
+ENABLE_LOCAL_FETCH=false #set to true to allow fetches on the local network. /!\ Only enable this if you have the proper firewall rules to prevent SSRF attacks and understand the implications.
+## Public app configuration ##
+PUBLIC_APP_GUEST_MESSAGE=# a message to the guest user. If not set, no message will be shown. Only used if you have authentication enabled.
+PUBLIC_APP_NAME=Mit # name used as title throughout the app
+PUBLIC_APP_ASSETS=chatui # used to find logos & favicons in static/$PUBLIC_APP_ASSETS
+PUBLIC_APP_DESCRIPTION=# description used throughout the app
+PUBLIC_APP_DATA_SHARING=# Set to 1 to enable an option in the user settings to share conversations with model authors
+PUBLIC_APP_DISCLAIMER=# Set to 1 to show a disclaimer on login page
+PUBLIC_APP_DISCLAIMER_MESSAGE=# Message to show on the login page
+PUBLIC_ANNOUNCEMENT_BANNERS=`[
+]`
+PUBLIC_SMOOTH_UPDATES=false # set to true to enable smoothing of messages client-side, can be CPU intensive
+PUBLIC_ORIGIN=#https://huggingface.co
+PUBLIC_SHARE_PREFIX=#https://hf.co/chat
+# mostly huggingchat specific
+PUBLIC_GOOGLE_ANALYTICS_ID=#G-XXXXXXXX / Leave empty to disable
+PUBLIC_PLAUSIBLE_SCRIPT_URL=#/js/script.js / Leave empty to disable
+PUBLIC_APPLE_APP_ID=#1234567890 / Leave empty to disable
+### Feature Flags ###
+LLM_SUMMARIZATION=true # generate conversation titles with LLMs
+ENABLE_ASSISTANTS=false #set to true to enable assistants feature
+ENABLE_ASSISTANTS_RAG=false # /!\ This will let users specify arbitrary URLs that the server will then request. Make sure you have the proper firewall rules in place.
+REQUIRE_FEATURED_ASSISTANTS=false # require featured assistants to show in the list
+COMMUNITY_TOOLS=false # set to true to enable community tools
+ALLOW_IFRAME=true # Allow the app to be embedded in an iframe
+### Tools ###
+# Check out public config in `chart/env/prod.yaml` for more details
+TOOLS=`[]`
+### Rate limits ###
+# See `src/lib/server/usageLimits.ts`
+# {
+#   conversations: number, # how many conversations
+#   messages: number, # how many messages in a conversation
+#   assistants: number, # how many assistants
+#   messageLength: number, # how long can a message be before we cut it off
+#   messagesPerMinute: number, # how many messages per minute
+#   tools: number # how many tools
+# }
+USAGE_LIMITS=`{}`
+### HuggingFace specific ###
+# Let user authenticate with their HF token in the /api routes. This is only useful if you have OAuth configured with huggingface.
+USE_HF_TOKEN_IN_API=false
+## Feature flag & admin settings
+# Used for setting early access & admin flags to users
+HF_ORG_ADMIN=
+HF_ORG_EARLY_ACCESS=
+WEBHOOK_URL_REPORT_ASSISTANT=#provide slack webhook url to get notified for reports/feature requests
+### Metrics ###
+METRICS_ENABLED=false
+METRICS_PORT=5565
+LOG_LEVEL=info
+### Parquet export ###
+# Not in use anymore but useful to export conversations to a parquet file as a HuggingFace dataset
+PARQUET_EXPORT_DATASET=
+PARQUET_EXPORT_HF_TOKEN=
+ADMIN_API_SECRET=# secret to admin API calls, like computing usage stats or exporting parquet data
+### Docker build variables ###
+# These values cannot be updated at runtime
+# They need to be passed when building the docker image
+# See https://github.com/huggingface/chat-ui/main/.github/workflows/deploy-prod.yml#L44-L47
+APP_BASE="" # base path of the app, e.g. /chat, left blank as default
+PUBLIC_APP_COLOR=blue # can be any of tailwind colors: https://tailwindcss.com/docs/customizing-colors#default-color-palette
+### Body size limit for SvelteKit https://svelte.dev/docs/kit/adapter-node#Environment-variables-BODY_SIZE_LIMIT
+BODY_SIZE_LIMIT=15728640
+PUBLIC_COMMIT_SHA=
+### LEGACY parameters
+HF_ACCESS_TOKEN=#LEGACY! Use HF_TOKEN instead
+ALLOW_INSECURE_COOKIES=false # LEGACY! Use COOKIE_SECURE and COOKIE_SAMESITE instead
+PARQUET_EXPORT_SECRET=#DEPRECATED, use ADMIN_API_SECRET instead
+RATE_LIMIT= # /!\ DEPRECATED definition of messages per minute. Use USAGE_LIMITS.messagesPerMinute instead
+OPENID_CLIENT_ID=
+OPENID_CLIENT_SECRET=
+OPENID_SCOPES="openid profile" # Add "email" for some providers like Google that do not provide preferred_username
+OPENID_NAME_CLAIM="name" # Change to "username" for some providers that do not provide name
+OPENID_PROVIDER_URL=https://huggingface.co # for Google, use https://accounts.google.com
+OPENID_TOLERANCE=
+OPENID_RESOURCE=

.env.ci ADDED Viewed

	@@ -0,0 +1 @@


1	+ MONGODB_URL=mongodb://localhost:27017/

.eslintignore ADDED Viewed

	@@ -0,0 +1,13 @@

+.DS_Store
+node_modules
+/build
+/.svelte-kit
+/package
+.env
+.env.*
+!.env.example
+# Ignore files for PNPM, NPM and YARN
+pnpm-lock.yaml
+package-lock.json
+yarn.lock

.eslintrc.cjs ADDED Viewed

	@@ -0,0 +1,44 @@

+module.exports = {
+	root: true,
+	parser: "@typescript-eslint/parser",
+	extends: [
+		"eslint:recommended",
+		"plugin:@typescript-eslint/recommended",
+		"plugin:svelte/recommended",
+		"prettier",
+	],
+	plugins: ["@typescript-eslint"],
+	ignorePatterns: ["*.cjs"],
+	overrides: [
+		{
+			files: ["*.svelte"],
+			parser: "svelte-eslint-parser",
+			parserOptions: {
+				parser: "@typescript-eslint/parser",
+			},
+		},
+	],
+	parserOptions: {
+		sourceType: "module",
+		ecmaVersion: 2020,
+		extraFileExtensions: [".svelte"],
+	},
+	rules: {
+		"require-yield": "off",
+		"@typescript-eslint/no-explicit-any": "error",
+		"@typescript-eslint/no-non-null-assertion": "error",
+		"@typescript-eslint/no-unused-vars": [
+			// prevent variables with a _ prefix from being marked as unused
+			"error",
+			{
+				argsIgnorePattern: "^_",
+			},
+		],
+		"object-shorthand": ["error", "always"],
+	},
+	env: {
+		browser: true,
+		es2017: true,
+		node: true,
+	},
+};

.github/ISSUE_TEMPLATE/bug-report--chat-ui-.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+name: Bug Report (chat-ui)
+about: Use this for confirmed issues with chat-ui
+title: ""
+labels: bug
+assignees: ""
+---
+## Bug description
+<!-- A clear and concise description of what the bug is. -->
+## Steps to reproduce
+<!-- Steps to reproduce the issue -->
+## Screenshots
+<!-- If applicable, add screenshots to help explain your problem. -->
+## Context
+### Logs
+<!-- Add any logs that are relevant to your issue. Could be browser or server logs. Wrap in code blocks. -->
+```
+// logs here if relevant
+```
+### Specs
+- **OS**:
+- **Browser**:
+- **chat-ui commit**:
+### Config
+<!-- Add the environment variables you've used to setup chat-ui, making sure to redact any secrets. -->
+## Notes
+<!-- Anything else relevant to help the issue get solved -->

.github/ISSUE_TEMPLATE/config-support.md ADDED Viewed

	@@ -0,0 +1,9 @@

+---
+name: Config Support
+about: Help with setting up chat-ui locally
+title: ""
+labels: support
+assignees: ""
+---
+**Please use the discussions on GitHub** for getting help with setting things up instead of opening an issue: https://github.com/huggingface/chat-ui/discussions

.github/ISSUE_TEMPLATE/feature-request--chat-ui-.md ADDED Viewed

	@@ -0,0 +1,17 @@

+---
+name: Feature Request (chat-ui)
+about: Suggest new features to be added to chat-ui
+title: ""
+labels: enhancement
+assignees: ""
+---
+## Describe your feature request
+<!-- Short description of what this is about -->
+## Screenshots (if relevant)
+## Implementation idea
+<!-- If you know how this should be implemented in the codebase, share your thoughts. Let us know if you feel like implementing it yourself as well! -->

.github/ISSUE_TEMPLATE/huggingchat.md ADDED Viewed

	@@ -0,0 +1,11 @@

+---
+name: HuggingChat
+about: Requests & reporting outages on HuggingChat, the hosted version of chat-ui.
+title: ""
+labels: huggingchat
+assignees: ""
+---
+**Do not use GitHub issues** for requesting models on HuggingChat or reporting issues with HuggingChat being down/overloaded.
+**Use the discussions page on the hub instead:** https://huggingface.co/spaces/huggingchat/chat-ui/discussions

.github/release.yml ADDED Viewed

	@@ -0,0 +1,16 @@

+changelog:
+  exclude:
+    labels:
+      - huggingchat
+      - CI/CD
+      - documentation
+  categories:
+    - title: Features
+      labels:
+        - enhancement
+    - title: Bugfixes
+      labels:
+        - bug
+    - title: Other changes
+      labels:
+        - "*"

.github/workflows/build-docs.yml ADDED Viewed

	@@ -0,0 +1,18 @@

+name: Build documentation
+on:
+  push:
+    branches:
+      - main
+      - v*-release
+jobs:
+  build:
+    uses: huggingface/doc-builder/.github/workflows/build_main_documentation.yml@main
+    with:
+      commit_sha: ${{ github.sha }}
+      package: chat-ui
+      additional_args: --not_python_module
+    secrets:
+      token: ${{ secrets.HUGGINGFACE_PUSH }}
+      hf_token: ${{ secrets.HF_DOC_BUILD_PUSH }}

.github/workflows/build-image.yml ADDED Viewed

	@@ -0,0 +1,140 @@

+name: Build and Publish Image
+permissions:
+  packages: write
+on:
+  push:
+    branches:
+      - "main"
+  pull_request:
+    branches:
+      - "*"
+    paths:
+      - "Dockerfile"
+      - "entrypoint.sh"
+  workflow_dispatch:
+  release:
+    types: [published, edited]
+jobs:
+  build-and-publish-image-with-db:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Extract package version
+        id: package-version
+        run: |
+          VERSION=$(jq -r .version package.json)
+          echo "VERSION=$VERSION" >> $GITHUB_OUTPUT
+          MAJOR=$(echo $VERSION | cut -d '.' -f1)
+          echo "MAJOR=$MAJOR" >> $GITHUB_OUTPUT
+          MINOR=$(echo $VERSION | cut -d '.' -f1).$(echo $VERSION | cut -d '.' -f2)
+          echo "MINOR=$MINOR" >> $GITHUB_OUTPUT
+      - name: Docker metadata
+        id: meta
+        uses: docker/metadata-action@v5
+        with:
+          images: |
+            ghcr.io/huggingface/chat-ui-db
+          tags: |
+            type=raw,value=${{ steps.package-version.outputs.VERSION }},enable=${{github.event_name == 'release'}}
+            type=raw,value=${{ steps.package-version.outputs.MAJOR }},enable=${{github.event_name == 'release'}}
+            type=raw,value=${{ steps.package-version.outputs.MINOR }},enable=${{github.event_name == 'release'}}
+            type=raw,value=latest,enable={{is_default_branch}}
+            type=sha,enable={{is_default_branch}}
+      - name: Set up QEMU
+        uses: docker/setup-qemu-action@v3
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+      - name: Login to GitHub Container Registry
+        if: github.event_name != 'pull_request'
+        uses: docker/login-action@v3
+        with:
+          registry: ghcr.io
+          username: ${{ github.repository_owner }}
+          password: ${{ secrets.GITHUB_TOKEN }}
+      - name: Inject slug/short variables
+        uses: rlespinasse/github-slug-action@v4.5.0
+      - name: Build and Publish Docker Image with DB
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: Dockerfile
+          push: ${{ github.event_name != 'pull_request' }}
+          tags: ${{ steps.meta.outputs.tags }}
+          labels: ${{ steps.meta.outputs.labels }}
+          platforms: linux/amd64,linux/arm64
+          cache-from: type=gha
+          cache-to: type=gha,mode=max
+          build-args: |
+            INCLUDE_DB=true
+            PUBLIC_COMMIT_SHA=${{ env.GITHUB_SHA_SHORT }}
+  build-and-publish-image-nodb:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Extract package version
+        id: package-version
+        run: |
+          VERSION=$(jq -r .version package.json)
+          echo "VERSION=$VERSION" >> $GITHUB_OUTPUT
+          MAJOR=$(echo $VERSION | cut -d '.' -f1)
+          echo "MAJOR=$MAJOR" >> $GITHUB_OUTPUT
+          MINOR=$(echo $VERSION | cut -d '.' -f1).$(echo $VERSION | cut -d '.' -f2)
+          echo "MINOR=$MINOR" >> $GITHUB_OUTPUT
+      - name: Docker metadata
+        id: meta
+        uses: docker/metadata-action@v5
+        with:
+          images: |
+            ghcr.io/huggingface/chat-ui
+          tags: |
+            type=raw,value=${{ steps.package-version.outputs.VERSION }},enable=${{github.event_name == 'release'}}
+            type=raw,value=${{ steps.package-version.outputs.MAJOR }},enable=${{github.event_name == 'release'}}
+            type=raw,value=${{ steps.package-version.outputs.MINOR }},enable=${{github.event_name == 'release'}}
+            type=raw,value=latest,enable={{is_default_branch}}
+            type=sha,enable={{is_default_branch}}
+      - name: Set up QEMU
+        uses: docker/setup-qemu-action@v3
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+      - name: Login to GitHub Container Registry
+        if: github.event_name != 'pull_request'
+        uses: docker/login-action@v3
+        with:
+          registry: ghcr.io
+          username: ${{ github.repository_owner }}
+          password: ${{ secrets.GITHUB_TOKEN }}
+      - name: Inject slug/short variables
+        uses: rlespinasse/github-slug-action@v4.5.0
+      - name: Build and Publish Docker Image without DB
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: Dockerfile
+          push: ${{ github.event_name != 'pull_request' }}
+          tags: ${{ steps.meta.outputs.tags }}
+          labels: ${{ steps.meta.outputs.labels }}
+          platforms: linux/amd64,linux/arm64
+          cache-from: type=gha
+          cache-to: type=gha,mode=max
+          build-args: |
+            INCLUDE_DB=false
+            PUBLIC_COMMIT_SHA=${{ env.GITHUB_SHA_SHORT }}

.github/workflows/build-pr-docs.yml ADDED Viewed

	@@ -0,0 +1,20 @@

+name: Build PR Documentation
+on:
+  pull_request:
+    paths:
+      - "docs/source/**"
+      - ".github/workflows/build-pr-docs.yml"
+concurrency:
+  group: ${{ github.workflow }}-${{ github.head_ref || github.run_id }}
+  cancel-in-progress: true
+jobs:
+  build:
+    uses: huggingface/doc-builder/.github/workflows/build_pr_documentation.yml@main
+    with:
+      commit_sha: ${{ github.event.pull_request.head.sha }}
+      pr_number: ${{ github.event.number }}
+      package: chat-ui
+      additional_args: --not_python_module

.github/workflows/deploy-prod.yml ADDED Viewed

	@@ -0,0 +1,79 @@

+name: Deploy to k8s
+on:
+  # run this workflow manually from the Actions tab
+  workflow_dispatch:
+jobs:
+  build-and-publish-huggingchat-image:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Login to Registry
+        uses: docker/login-action@v3
+        with:
+          username: ${{ secrets.DOCKERHUB_USERNAME }}
+          password: ${{ secrets.DOCKERHUB_PASSWORD }}
+      - name: Docker metadata
+        id: meta
+        uses: docker/metadata-action@v5
+        with:
+          images: |
+            huggingface/chat-ui
+          tags: |
+            type=raw,value=latest,enable={{is_default_branch}}
+            type=sha,enable={{is_default_branch}}
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+      - name: Inject slug/short variables
+        uses: rlespinasse/github-slug-action@v4.5.0
+      - name: Build and Publish HuggingChat image
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: Dockerfile
+          push: ${{ github.event_name != 'pull_request' }}
+          tags: ${{ steps.meta.outputs.tags }}
+          labels: ${{ steps.meta.outputs.labels }}
+          platforms: linux/amd64
+          cache-to: type=gha,mode=max,scope=amd64
+          cache-from: type=gha,scope=amd64
+          provenance: false
+          build-args: |
+            INCLUDE_DB=false
+            APP_BASE=/chat
+            PUBLIC_APP_COLOR=yellow
+            PUBLIC_COMMIT_SHA=${{ env.GITHUB_SHA_SHORT }}
+  deploy:
+    name: Deploy on prod
+    runs-on: ubuntu-latest
+    needs: ["build-and-publish-huggingchat-image"]
+    steps:
+      - name: Inject slug/short variables
+        uses: rlespinasse/github-slug-action@v4.5.0
+      - name: Gen values
+        run: |
+          VALUES=$(cat <<-END
+          image:
+            tag: "sha-${{ env.GITHUB_SHA_SHORT }}"
+          END
+          )
+          echo "VALUES=$(echo "$VALUES" | yq -o=json | jq tostring)" >> $GITHUB_ENV
+      - name: Deploy on infra-deployments
+        uses: aurelien-baudet/workflow-dispatch@v2
+        with:
+          workflow: Update application single value
+          repo: huggingface/infra-deployments
+          wait-for-completion: true
+          wait-for-completion-interval: 10s
+          display-workflow-run-url-interval: 10s
+          ref: refs/heads/main
+          token: ${{ secrets.GIT_TOKEN_INFRA_DEPLOYMENT }}
+          inputs: '{"path": "hub/chat-ui/chat-ui.yaml", "value": ${{ env.VALUES }}, "url": "${{ github.event.head_commit.url }}"}'

.github/workflows/lint-and-test.yml ADDED Viewed

	@@ -0,0 +1,52 @@

+name: Lint and test
+on:
+  pull_request:
+  push:
+    branches:
+      - main
+jobs:
+  lint:
+    runs-on: ubuntu-latest
+    timeout-minutes: 10
+    steps:
+      - uses: actions/checkout@v3
+      - uses: actions/setup-node@v3
+        with:
+          node-version: "20"
+          cache: "npm"
+      - run: |
+          npm install ci
+      - name: "Checking lint/format errors"
+        run: |
+          npm run lint
+      - name: "Checking type errors"
+        run: |
+          npm run check
+  test:
+    runs-on: ubuntu-latest
+    timeout-minutes: 10
+    steps:
+      - uses: actions/checkout@v3
+      - uses: actions/setup-node@v3
+        with:
+          node-version: "20"
+          cache: "npm"
+      - run: |
+          npm ci
+      - name: "Tests"
+        run: |
+          npm run test
+  build-check:
+    runs-on: ubuntu-latest
+    timeout-minutes: 10
+    steps:
+      - uses: actions/checkout@v3
+      - name: Build Docker image
+        run: docker build --secret id=DOTENV_LOCAL,src=.env.ci -t chat-ui:latest .

.github/workflows/trufflehog.yml ADDED Viewed

	@@ -0,0 +1,17 @@

+on:
+  push:
+name: Secret Leaks
+jobs:
+  trufflehog:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+        with:
+          fetch-depth: 0
+      - name: Secret Scanning
+        uses: trufflesecurity/trufflehog@main
+        with:
+          extra_args: --results=verified,unknown

.github/workflows/upload-pr-documentation.yml ADDED Viewed

	@@ -0,0 +1,16 @@

+name: Upload PR Documentation
+on:
+  workflow_run:
+    workflows: ["Build PR Documentation"]
+    types:
+      - completed
+jobs:
+  build:
+    uses: huggingface/doc-builder/.github/workflows/upload_pr_documentation.yml@main
+    with:
+      package_name: chat-ui
+    secrets:
+      hf_token: ${{ secrets.HF_DOC_BUILD_PUSH }}
+      comment_bot_token: ${{ secrets.COMMENT_BOT_TOKEN }}

.gitignore ADDED Viewed

	@@ -0,0 +1,15 @@

+.DS_Store
+node_modules
+/build
+/.svelte-kit
+/package
+.env
+.env.*
+vite.config.js.timestamp-*
+vite.config.ts.timestamp-*
+SECRET_CONFIG
+.idea
+!.env.ci
+!.env
+gcp-*.json
+db

.husky/lint-stage-config.js ADDED Viewed

	@@ -0,0 +1,4 @@

+export default {
+	"*.{js,jsx,ts,tsx}": ["prettier --write", "eslint --fix", "eslint"],
+	"*.json": ["prettier --write"],
+};

.husky/pre-commit ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ set -e
2	+ npx lint-staged --config ./.husky/lint-stage-config.js

.npmrc ADDED Viewed

	@@ -0,0 +1 @@


1	+ engine-strict=true

.prettierignore ADDED Viewed

	@@ -0,0 +1,14 @@

+.DS_Store
+node_modules
+/build
+/.svelte-kit
+/package
+/chart
+.env
+.env.*
+!.env.example
+# Ignore files for PNPM, NPM and YARN
+pnpm-lock.yaml
+package-lock.json
+yarn.lock

.prettierrc ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+	"useTabs": true,
+	"trailingComma": "es5",
+	"printWidth": 100,
+	"plugins": ["prettier-plugin-svelte", "prettier-plugin-tailwindcss"],
+	"overrides": [{ "files": "*.svelte", "options": { "parser": "svelte" } }]
+}

.vscode/launch.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+	"version": "0.2.0",
+	"configurations": [
+		{
+			"command": "npm run dev",
+			"name": "Run development server",
+			"request": "launch",
+			"type": "node-terminal"
+		}
+	]
+}

.vscode/settings.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+	"editor.formatOnSave": true,
+	"editor.defaultFormatter": "esbenp.prettier-vscode",
+	"editor.codeActionsOnSave": {
+		"source.fixAll": "explicit"
+	},
+	"eslint.validate": ["javascript", "svelte"],
+	"[svelte]": {
+		"editor.defaultFormatter": "esbenp.prettier-vscode"
+	}
+}

Dockerfile ADDED Viewed

	@@ -0,0 +1,95 @@

+# syntax=docker/dockerfile:1
+ARG INCLUDE_DB=false
+FROM node:20-slim AS base
+ENV PLAYWRIGHT_SKIP_BROWSER_GC=1
+# install dotenv-cli
+RUN npm install -g dotenv-cli
+# switch to a user that works for spaces
+RUN userdel -r node
+RUN useradd -m -u 1000 user
+USER user
+ENV HOME=/home/user \
+	PATH=/home/user/.local/bin:$PATH
+WORKDIR /app
+# add a .env.local if the user doesn't bind a volume to it
+RUN touch /app/.env.local
+RUN npm i --no-package-lock --no-save playwright@1.47.0
+USER root
+RUN apt-get update
+RUN apt-get install gnupg curl -y
+RUN npx playwright install --with-deps chromium
+RUN chown -R 1000:1000 /home/user/.npm
+USER user
+COPY --chown=1000 .env /app/.env
+COPY --chown=1000 entrypoint.sh /app/entrypoint.sh
+COPY --chown=1000 gcp-*.json /app/
+COPY --chown=1000 package.json /app/package.json
+COPY --chown=1000 package-lock.json /app/package-lock.json
+RUN chmod +x /app/entrypoint.sh
+FROM node:20 AS builder
+WORKDIR /app
+COPY --link --chown=1000 package-lock.json package.json ./
+ARG APP_BASE=
+ARG PUBLIC_APP_COLOR=blue
+ENV BODY_SIZE_LIMIT=15728640
+RUN --mount=type=cache,target=/app/.npm \
+        npm set cache /app/.npm && \
+        npm ci
+COPY --link --chown=1000 . .
+RUN git config --global --add safe.directory /app && \
+    npm run build
+# mongo image
+FROM mongo:7 AS mongo
+# image to be used if INCLUDE_DB is false
+FROM base AS local_db_false
+# image to be used if INCLUDE_DB is true
+FROM base AS local_db_true
+# copy mongo from the other stage
+COPY --from=mongo /usr/bin/mongo* /usr/bin/
+ENV MONGODB_URL=mongodb://localhost:27017
+USER root
+RUN mkdir -p /data/db
+RUN chown -R 1000:1000 /data/db
+USER user
+# final image
+FROM local_db_${INCLUDE_DB} AS final
+# build arg to determine if the database should be included
+ARG INCLUDE_DB=false
+ENV INCLUDE_DB=${INCLUDE_DB}
+# svelte requires APP_BASE at build time so it must be passed as a build arg
+ARG APP_BASE=
+# tailwind requires the primary theme to be known at build time so it must be passed as a build arg
+ARG PUBLIC_APP_COLOR=blue
+ARG PUBLIC_COMMIT_SHA=
+ENV PUBLIC_COMMIT_SHA=${PUBLIC_COMMIT_SHA}
+ENV BODY_SIZE_LIMIT=15728640
+#import the build & dependencies
+COPY --from=builder --chown=1000 /app/build /app/build
+COPY --from=builder --chown=1000 /app/node_modules /app/node_modules
+CMD ["/bin/bash", "-c", "/app/entrypoint.sh"]

LICENSE ADDED Viewed

	@@ -0,0 +1,203 @@

+Copyright 2018- The Hugging Face team. All rights reserved.
+                                 Apache License
+                           Version 2.0, January 2004
+                        http://www.apache.org/licenses/
+   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
+   1. Definitions.
+      "License" shall mean the terms and conditions for use, reproduction,
+      and distribution as defined by Sections 1 through 9 of this document.
+      "Licensor" shall mean the copyright owner or entity authorized by
+      the copyright owner that is granting the License.
+      "Legal Entity" shall mean the union of the acting entity and all
+      other entities that control, are controlled by, or are under common
+      control with that entity. For the purposes of this definition,
+      "control" means (i) the power, direct or indirect, to cause the
+      direction or management of such entity, whether by contract or
+      otherwise, or (ii) ownership of fifty percent (50%) or more of the
+      outstanding shares, or (iii) beneficial ownership of such entity.
+      "You" (or "Your") shall mean an individual or Legal Entity
+      exercising permissions granted by this License.
+      "Source" form shall mean the preferred form for making modifications,
+      including but not limited to software source code, documentation
+      source, and configuration files.
+      "Object" form shall mean any form resulting from mechanical
+      transformation or translation of a Source form, including but
+      not limited to compiled object code, generated documentation,
+      and conversions to other media types.
+      "Work" shall mean the work of authorship, whether in Source or
+      Object form, made available under the License, as indicated by a
+      copyright notice that is included in or attached to the work
+      (an example is provided in the Appendix below).
+      "Derivative Works" shall mean any work, whether in Source or Object
+      form, that is based on (or derived from) the Work and for which the
+      editorial revisions, annotations, elaborations, or other modifications
+      represent, as a whole, an original work of authorship. For the purposes
+      of this License, Derivative Works shall not include works that remain
+      separable from, or merely link (or bind by name) to the interfaces of,
+      the Work and Derivative Works thereof.
+      "Contribution" shall mean any work of authorship, including
+      the original version of the Work and any modifications or additions
+      to that Work or Derivative Works thereof, that is intentionally
+      submitted to Licensor for inclusion in the Work by the copyright owner
+      or by an individual or Legal Entity authorized to submit on behalf of
+      the copyright owner. For the purposes of this definition, "submitted"
+      means any form of electronic, verbal, or written communication sent
+      to the Licensor or its representatives, including but not limited to
+      communication on electronic mailing lists, source code control systems,
+      and issue tracking systems that are managed by, or on behalf of, the
+      Licensor for the purpose of discussing and improving the Work, but
+      excluding communication that is conspicuously marked or otherwise
+      designated in writing by the copyright owner as "Not a Contribution."
+      "Contributor" shall mean Licensor and any individual or Legal Entity
+      on behalf of whom a Contribution has been received by Licensor and
+      subsequently incorporated within the Work.
+   2. Grant of Copyright License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      copyright license to reproduce, prepare Derivative Works of,
+      publicly display, publicly perform, sublicense, and distribute the
+      Work and such Derivative Works in Source or Object form.
+   3. Grant of Patent License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      (except as stated in this section) patent license to make, have made,
+      use, offer to sell, sell, import, and otherwise transfer the Work,
+      where such license applies only to those patent claims licensable
+      by such Contributor that are necessarily infringed by their
+      Contribution(s) alone or by combination of their Contribution(s)
+      with the Work to which such Contribution(s) was submitted. If You
+      institute patent litigation against any entity (including a
+      cross-claim or counterclaim in a lawsuit) alleging that the Work
+      or a Contribution incorporated within the Work constitutes direct
+      or contributory patent infringement, then any patent licenses
+      granted to You under this License for that Work shall terminate
+      as of the date such litigation is filed.
+   4. Redistribution. You may reproduce and distribute copies of the
+      Work or Derivative Works thereof in any medium, with or without
+      modifications, and in Source or Object form, provided that You
+      meet the following conditions:
+      (a) You must give any other recipients of the Work or
+          Derivative Works a copy of this License; and
+      (b) You must cause any modified files to carry prominent notices
+          stating that You changed the files; and
+      (c) You must retain, in the Source form of any Derivative Works
+          that You distribute, all copyright, patent, trademark, and
+          attribution notices from the Source form of the Work,
+          excluding those notices that do not pertain to any part of
+          the Derivative Works; and
+      (d) If the Work includes a "NOTICE" text file as part of its
+          distribution, then any Derivative Works that You distribute must
+          include a readable copy of the attribution notices contained
+          within such NOTICE file, excluding those notices that do not
+          pertain to any part of the Derivative Works, in at least one
+          of the following places: within a NOTICE text file distributed
+          as part of the Derivative Works; within the Source form or
+          documentation, if provided along with the Derivative Works; or,
+          within a display generated by the Derivative Works, if and
+          wherever such third-party notices normally appear. The contents
+          of the NOTICE file are for informational purposes only and
+          do not modify the License. You may add Your own attribution
+          notices within Derivative Works that You distribute, alongside
+          or as an addendum to the NOTICE text from the Work, provided
+          that such additional attribution notices cannot be construed
+          as modifying the License.
+      You may add Your own copyright statement to Your modifications and
+      may provide additional or different license terms and conditions
+      for use, reproduction, or distribution of Your modifications, or
+      for any such Derivative Works as a whole, provided Your use,
+      reproduction, and distribution of the Work otherwise complies with
+      the conditions stated in this License.
+   5. Submission of Contributions. Unless You explicitly state otherwise,
+      any Contribution intentionally submitted for inclusion in the Work
+      by You to the Licensor shall be under the terms and conditions of
+      this License, without any additional terms or conditions.
+      Notwithstanding the above, nothing herein shall supersede or modify
+      the terms of any separate license agreement you may have executed
+      with Licensor regarding such Contributions.
+   6. Trademarks. This License does not grant permission to use the trade
+      names, trademarks, service marks, or product names of the Licensor,
+      except as required for reasonable and customary use in describing the
+      origin of the Work and reproducing the content of the NOTICE file.
+   7. Disclaimer of Warranty. Unless required by applicable law or
+      agreed to in writing, Licensor provides the Work (and each
+      Contributor provides its Contributions) on an "AS IS" BASIS,
+      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
+      implied, including, without limitation, any warranties or conditions
+      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
+      PARTICULAR PURPOSE. You are solely responsible for determining the
+      appropriateness of using or redistributing the Work and assume any
+      risks associated with Your exercise of permissions under this License.
+   8. Limitation of Liability. In no event and under no legal theory,
+      whether in tort (including negligence), contract, or otherwise,
+      unless required by applicable law (such as deliberate and grossly
+      negligent acts) or agreed to in writing, shall any Contributor be
+      liable to You for damages, including any direct, indirect, special,
+      incidental, or consequential damages of any character arising as a
+      result of this License or out of the use or inability to use the
+      Work (including but not limited to damages for loss of goodwill,
+      work stoppage, computer failure or malfunction, or any and all
+      other commercial damages or losses), even if such Contributor
+      has been advised of the possibility of such damages.
+   9. Accepting Warranty or Additional Liability. While redistributing
+      the Work or Derivative Works thereof, You may choose to offer,
+      and charge a fee for, acceptance of support, warranty, indemnity,
+      or other liability obligations and/or rights consistent with this
+      License. However, in accepting such obligations, You may act only
+      on Your own behalf and on Your sole responsibility, not on behalf
+      of any other Contributor, and only if You agree to indemnify,
+      defend, and hold each Contributor harmless for any liability
+      incurred by, or claims asserted against, such Contributor by reason
+      of your accepting any such warranty or additional liability.
+   END OF TERMS AND CONDITIONS
+   APPENDIX: How to apply the Apache License to your work.
+      To apply the Apache License to your work, attach the following
+      boilerplate notice, with the fields enclosed by brackets "[]"
+      replaced with your own identifying information. (Don't include
+      the brackets!)  The text should be enclosed in the appropriate
+      comment syntax for the file format. We also recommend that a
+      file or class name and description of purpose be included on the
+      same "printed page" as the copyright notice for easier
+      identification within third-party archives.
+   Copyright [yyyy] [name of copyright owner]
+   Licensed under the Apache License, Version 2.0 (the "License");
+   you may not use this file except in compliance with the License.
+   You may obtain a copy of the License at
+       http://www.apache.org/licenses/LICENSE-2.0
+   Unless required by applicable law or agreed to in writing, software
+   distributed under the License is distributed on an "AS IS" BASIS,
+   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+   See the License for the specific language governing permissions and
+   limitations under the License.

PRIVACY.md ADDED Viewed

	@@ -0,0 +1,35 @@

+## Privacy
+> Last updated: Feb 14, 2025
+Users of HuggingChat are authenticated through their HF user account.
+We endorse Privacy by Design. As such, your conversations are private to you and will not be shared with anyone, including model authors, for any purpose, including for research or model training purposes.
+You conversation data will only be stored to let you access past conversations. You can click on the Delete icon to delete any past conversation at any moment.
+🗓 Please also consult huggingface.co's main privacy policy at <https://huggingface.co/privacy>. To exercise any of your legal privacy rights, please send an email to <privacy@huggingface.co>.
+## About available LLMs
+The goal of this app is to showcase that it is now possible to build an open source alternative to ChatGPT. 💪
+We aim to always provide a diverse set of state of the art open LLMs, hence we rotate the available models over time. Discuss available models and request new ones on the [models discussion page](https://huggingface.co/spaces/huggingchat/chat-ui/discussions/372).
+Check the [models](https://huggingface.co/chat/models/) page for an up-to-date list of the best available LLMs.
+## Technical details
+[![chat-ui](https://img.shields.io/github/stars/huggingface/chat-ui)](https://github.com/huggingface/chat-ui)
+The app is completely open source, and further development takes place on the [huggingface/chat-ui](https://github.com/huggingface/chat-ui) GitHub repo. We're always open to contributions!
+You can find the production configuration for HuggingChat [here](https://github.com/huggingface/chat-ui/blob/main/chart/env/prod.yaml).
+The inference backend is running the optimized [text-generation-inference](https://github.com/huggingface/text-generation-inference) on HuggingFace's Inference API infrastructure.
+It is possible to deploy a copy of this app to a Space and customize it (swap model, add some UI elements, or store user messages according to your own Terms and conditions). You can also 1-click deploy your own instance using the [Chat UI Spaces Docker template](https://huggingface.co/new-space?template=huggingchat/chat-ui-template).
+We welcome any feedback on this app: please participate to the public discussion at <https://huggingface.co/spaces/huggingchat/chat-ui/discussions>
+<a target="_blank" href="https://huggingface.co/spaces/huggingchat/chat-ui/discussions"><img src="https://huggingface.co/datasets/huggingface/badges/raw/main/open-a-discussion-xl.svg" title="open a discussion"></a>

PROMPTS.md ADDED Viewed

	@@ -0,0 +1,72 @@

+# Prompt templates
+> [!WARNING]
+> We now recommend using the `tokenizer` field to get the chat template directly from the hub. Just set it to your model id on the hub to automatically get the template.
+These are the templates used to format the conversation history for different models used in HuggingChat. Set them in your `.env.local` [like so](https://github.com/huggingface/chat-ui#chatprompttemplate).
+## Llama 2
+```env
+<s>[INST] <<SYS>>\n{{preprompt}}\n<</SYS>>\n\n{{#each messages}}{{#ifUser}}{{content}} [/INST] {{/ifUser}}{{#ifAssistant}}{{content}} </s><s>[INST] {{/ifAssistant}}{{/each}}
+```
+## CodeLlama
+```env
+<s>[INST] <<SYS>>\n{{preprompt}}\n<</SYS>>\n\n{{#each messages}}{{#ifUser}}{{content}} [/INST] {{/ifUser}}{{#ifAssistant}}{{content}} </s><s>[INST] {{/ifAssistant}}{{/each}}
+```
+## Falcon
+```env
+System: {{preprompt}}\nUser:{{#each messages}}{{#ifUser}}{{content}}\nFalcon:{{/ifUser}}{{#ifAssistant}}{{content}}\nUser:{{/ifAssistant}}{{/each}}
+```
+## Mistral
+```env
+<s>{{#each messages}}{{#ifUser}}[INST] {{#if @first}}{{#if @root.preprompt}}{{@root.preprompt}}\n{{/if}}{{/if}} {{content}} [/INST]{{/ifUser}}{{#ifAssistant}}{{content}}</s> {{/ifAssistant}}{{/each}}
+```
+## Zephyr
+```env
+<|system|>\n{{preprompt}}</s>\n{{#each messages}}{{#ifUser}}<|user|>\n{{content}}</s>\n<|assistant|>\n{{/ifUser}}{{#ifAssistant}}{{content}}</s>\n{{/ifAssistant}}{{/each}}
+```
+## IDEFICS
+```env
+{{#each messages}}{{#ifUser}}User: {{content}}{{/ifUser}}<end_of_utterance>\nAssistant: {{#ifAssistant}}{{content}}\n{{/ifAssistant}}{{/each}}
+```
+## OpenChat
+```env
+<s>{{#each messages}}{{#ifUser}}GPT4 User: {{#if @first}}{{#if @root.preprompt}}{{@root.preprompt}}\n{{/if}}{{/if}}{{content}}<|end_of_turn|>GPT4 Assistant: {{/ifUser}}{{#ifAssistant}}{{content}}<|end_of_turn|>{{/ifAssistant}}{{/each}}
+```
+## Mixtral
+```env
+<s> {{#each messages}}{{#ifUser}}[INST]{{#if @first}}{{#if @root.preprompt}}{{@root.preprompt}}\n{{/if}}{{/if}} {{content}} [/INST]{{/ifUser}}{{#ifAssistant}} {{content}}</s> {{/ifAssistant}}{{/each}}
+```
+## ChatML
+```env
+{{#if @root.preprompt}}<|im_start|>system\n{{@root.preprompt}}<|im_end|>\n{{/if}}{{#each messages}}{{#ifUser}}<|im_start|>user\n{{content}}<|im_end|>\n<|im_start|>assistant\n{{/ifUser}}{{#ifAssistant}}{{content}}<|im_end|>\n{{/ifAssistant}}{{/each}}
+```
+## CodeLlama 70B
+```env
+<s>{{#if @root.preprompt}}Source: system\n\n {{@root.preprompt}} <step> {{/if}}{{#each messages}}{{#ifUser}}Source: user\n\n {{content}} <step> {{/ifUser}}{{#ifAssistant}}Source: assistant\n\n {{content}} <step> {{/ifAssistant}}{{/each}}Source: assistant\nDestination: user\n\n ``
+```
+## Gemma
+```env
+{{#each messages}}{{#ifUser}}<start_of_turn>user\n{{#if @first}}{{#if @root.preprompt}}{{@root.preprompt}}\n{{/if}}{{/if}}{{content}}<end_of_turn>\n<start_of_turn>model\n{{/ifUser}}{{#ifAssistant}}{{content}}<end_of_turn>\n{{/ifAssistant}}{{/each}}
+```

chart/Chart.yaml ADDED Viewed

	@@ -0,0 +1,5 @@

+apiVersion: v2
+name: chat-ui
+version: 0.0.1-latest
+type: application
+icon: https://huggingface.co/front/assets/huggingface_logo-noborder.svg

chart/env/prod.yaml ADDED Viewed

	@@ -0,0 +1,677 @@

+image:
+  repository: huggingface
+  name: chat-ui
+nodeSelector:
+  role-huggingchat: "true"
+tolerations:
+  - key: "huggingface.co/huggingchat"
+    operator: "Equal"
+    value: "true"
+    effect: "NoSchedule"
+serviceAccount:
+  enabled: true
+  create: true
+  name: huggingchat-prod
+ingress:
+  path: "/chat"
+  annotations:
+    alb.ingress.kubernetes.io/healthcheck-path: "/healthcheck"
+    alb.ingress.kubernetes.io/listen-ports: "[{\"HTTP\": 80}, {\"HTTPS\": 443}]"
+    alb.ingress.kubernetes.io/group.name: "hub-prod"
+    alb.ingress.kubernetes.io/scheme: "internet-facing"
+    alb.ingress.kubernetes.io/ssl-redirect: "443"
+    alb.ingress.kubernetes.io/tags: "Env=prod,Project=hub,Terraform=true"
+    alb.ingress.kubernetes.io/target-node-labels: "role-hub-utils=true"
+    kubernetes.io/ingress.class: "alb"
+envVars:
+  ADDRESS_HEADER: 'X-Forwarded-For'
+  ALTERNATIVE_REDIRECT_URLS: '["huggingchat://login/callback"]'
+  APP_BASE: "/chat"
+  ALLOW_IFRAME: "false"
+  COMMUNITY_TOOLS: "true"
+  COOKIE_SAMESITE: "lax"
+  COOKIE_SECURE: "true"
+  ENABLE_ASSISTANTS: "true"
+  ENABLE_ASSISTANTS_RAG: "true"
+  METRICS_PORT: 5565
+  LOG_LEVEL: "debug"
+  METRICS_ENABLED: "true"
+  MODELS: >
+    [
+      {
+        "name": "meta-llama/Llama-3.3-70B-Instruct",
+        "id": "meta-llama/Llama-3.3-70B-Instruct",
+        "description": "Ideal for everyday use. A fast and extremely capable model matching closed source models' capabilities. Now with the latest Llama 3.3 weights!",
+        "modelUrl": "https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct",
+        "websiteUrl": "https://llama.meta.com/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/meta-logo.png",
+        "tools": true,
+        "preprompt": "",
+        "parameters": {
+          "stop": ["<|endoftext|>", "<|eot_id|>"],
+          "temperature": 0.6,
+          "max_new_tokens": 1024,
+          "truncate": 7167
+        },
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ]
+      },
+      {
+        "name": "Qwen/Qwen2.5-72B-Instruct",
+        "description": "The latest Qwen open model with improved role-playing, long text generation and structured data understanding.",
+        "modelUrl": "https://huggingface.co/Qwen/Qwen2.5-72B-Instruct",
+        "websiteUrl": "https://qwenlm.github.io/blog/qwen2.5/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/qwen-logo.png",
+        "preprompt": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant.",
+        "parameters": {
+          "stop": ["<|endoftext|>", "<|im_end|>"],
+          "temperature": 0.6,
+          "truncate": 28672,
+          "max_new_tokens": 3072
+        },
+        "tools": true,
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ]
+      },
+      {
+        "name": "CohereForAI/c4ai-command-r-plus-08-2024",
+        "description": "Cohere's largest language model, optimized for conversational interaction and tool use. Now with the 2024 update!",
+        "modelUrl": "https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024",
+        "websiteUrl": "https://docs.cohere.com/docs/command-r-plus",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/cohere-logo.png",
+        "tools": true,
+        "parameters": {
+          "stop": ["<|END_OF_TURN_TOKEN|>", "<|im_end|>"],
+          "truncate": 28672,
+          "max_new_tokens": 2048,
+          "temperature": 0.3
+        },
+        "promptExamples": [
+          {
+            "title": "Generate a mouse portrait",
+            "prompt": "Generate the portrait of a scientific mouse in its laboratory."
+          },
+          {
+            "title": "Review a pull request",
+            "prompt": "Review this pull request: https://github.com/huggingface/chat-ui/pull/1131/files"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          }
+        ]
+      },
+      {
+        "name": "deepseek-ai/DeepSeek-R1-Distill-Qwen-32B",
+        "modelUrl": "https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B",
+        "websiteUrl": "https://deepseek.com/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/deepseek-logo.png",
+        "description": "The first reasoning model from DeepSeek, distilled into a 32B dense model. Outperforms o1-mini on multiple benchmarks.",
+        "reasoning": {
+          "type": "tokens",
+          "beginToken": "",
+          "endToken": "</think>"
+        },
+        "promptExamples": [
+          {
+            "title": "Rs in strawberry",
+            "prompt": "how many R in strawberry?"
+          },
+          {
+            "title": "Larger number",
+            "prompt": "9.11 or 9.9 which number is larger?"
+          },
+          {
+            "title": "Measuring 6 liters",
+            "prompt": "I have a 6- and a 12-liter jug. I want to measure exactly 6 liters."
+          }
+        ],
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://internal.api-inference.huggingface.co/models/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/v1"
+          }
+        ]
+      },
+      {
+        "name": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
+        "modelUrl": "https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
+        "websiteUrl": "https://www.nvidia.com/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/nvidia-logo.png",
+        "description": "Nvidia's latest Llama fine-tune, topping alignment benchmarks and optimized for instruction following.",
+        "parameters": {
+          "stop": ["<|eot_id|>", "<|im_end|>"],
+          "temperature": 0.5,
+          "truncate": 28672,
+          "max_new_tokens": 2048
+        },
+        "promptExamples": [
+          {
+            "title": "Rs in strawberry",
+            "prompt": "how many R in strawberry?"
+          },
+          {
+            "title": "Larger number",
+            "prompt": "9.11 or 9.9 which number is larger?"
+          },
+          {
+            "title": "Measuring 6 liters",
+            "prompt": "I have a 6- and a 12-liter jug. I want to measure exactly 6 liters."
+          }
+        ],
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://internal.api-inference.huggingface.co/models/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF/v1"
+          }
+        ]
+      },
+      {
+        "name": "Qwen/QwQ-32B",
+        "preprompt": "You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.",
+        "modelUrl": "https://huggingface.co/Qwen/QwQ-32B",
+        "websiteUrl": "https://qwenlm.github.io/blog/qwq-32b/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/qwen-logo.png",
+        "description": "QwQ is the latest reasoning model released by the Qwen team, approaching the capabilities of R1 in benchmarks.",
+        "reasoning": {
+          "type": "tokens",
+          "beginToken": "",
+          "endToken": "</think>"
+        },
+        "promptExamples": [
+          {
+            "title": "Rs in strawberry",
+            "prompt": "how many R in strawberry?"
+          },
+          {
+            "title": "Larger number",
+            "prompt": "9.11 or 9.9 which number is larger?"
+          },
+          {
+            "title": "Measuring 6 liters",
+            "prompt": "I have a 6- and a 12-liter jug. I want to measure exactly 6 liters."
+          }
+        ],
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://atv7xs1nxxtx2wl0.us-east-1.aws.endpoints.huggingface.cloud/v1"
+          }
+        ]
+      },
+      {
+        "name": "Qwen/Qwen2.5-Coder-32B-Instruct",
+        "description": "Qwen's latest coding model, in its biggest size yet. SOTA on many coding benchmarks.",
+        "modelUrl": "https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct",
+        "websiteUrl": "https://qwenlm.github.io/blog/qwen2.5-coder-family/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/qwen-logo.png",
+        "preprompt": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant.",
+        "parameters": {
+          "stop": ["<|im_end|>", "<|endoftext|>"],
+          "temperature": 0.6,
+          "truncate": 28672,
+          "max_new_tokens": 3072
+        },
+        "promptExamples": [
+          {
+            "title": "To-do list web app",
+            "prompt": "Create a simple to-do list application where users can:\n- Add new tasks.\n- Mark tasks as complete.\n- Delete completed tasks.\nThe tasks should persist in the browser's local storage so that they remain available even after a page reload.\n"
+          },
+          {
+            "title": "Create a REST API",
+            "prompt": "Build a simple REST API using Node.js, TypeScript and Express:\n- POST /items: Accepts a JSON body with name and quantity and adds a new item.\n- GET /items: Returns a list of all items.\n- PUT /items/:id: Updates the name or quantity of an item by its id.\n- DELETE /items/:id: Removes an item by its id.\nUse an in-memory array as the data store (no need for a database). Include basic error handling (e.g., item not found)."
+          },
+          {
+            "title": "Simple website",
+            "prompt": "Generate a snazzy static landing page for a local coffee shop using HTML and CSS. You can use tailwind using <script src='https://cdn.tailwindcss.com'></script>."
+          }
+        ],
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://internal.api-inference.huggingface.co/models/Qwen/Qwen2.5-Coder-32B-Instruct/v1"
+          }
+        ]
+      },
+      {
+        "name": "google/gemma-3-27b-it",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/google-logo.png",
+        "multimodal": true,
+        "description": "Google's latest open model with great multilingual performance, supports image inputs natively.",
+        "websiteUrl": "https://blog.google/technology/developers/gemma-3/",
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ],
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://wp0d3hn6s3k8jk22.us-east-1.aws.endpoints.huggingface.cloud/v1",
+            "multimodal": {
+              "image": {
+                "maxSizeInMB": 10,
+                "maxWidth": 560,
+                "maxHeight": 560,
+                "supportedMimeTypes": ["image/jpeg"],
+                "preferredMimeType": "image/jpeg"
+              }
+            }
+          }
+        ]
+      },
+      {
+        "name": "meta-llama/Llama-3.2-11B-Vision-Instruct",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/meta-logo.png",
+        "description": "The latest multimodal model from Meta! Supports image inputs natively.",
+        "websiteUrl": "https://llama.com/",
+        "multimodal": true,
+        "parameters": {
+          "stop": ["<|eot_id|>", "<|im_end|>"],
+          "temperature": 0.6,
+          "truncate": 14336,
+          "max_new_tokens": 1536
+        },
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ],
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://internal.api-inference.huggingface.co/models/meta-llama/Llama-3.2-11B-Vision-Instruct/v1",
+            "multimodal": {
+              "image": {
+                "maxSizeInMB": 10,
+                "maxWidth": 560,
+                "maxHeight": 560,
+                "supportedMimeTypes": ["image/png", "image/jpeg", "image/webp"],
+                "preferredMimeType": "image/webp"
+              }
+            }
+          }
+        ]
+      },
+      {
+        "name": "NousResearch/Hermes-3-Llama-3.1-8B",
+        "description": "Nous Research's latest Hermes 3 release in 8B size. Follows instruction closely.",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/nous-logo.png",
+        "websiteUrl": "https://nousresearch.com/",
+        "modelUrl": "https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B",
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ],
+        "parameters": {
+          "stop": ["<|im_end|>"],
+          "temperature": 0.6,
+          "truncate": 14336,
+          "max_new_tokens": 1536
+        }
+      },
+      {
+        "name": "mistralai/Mistral-Nemo-Instruct-2407",
+        "displayName": "mistralai/Mistral-Nemo-Instruct-2407",
+        "description": "A small model with good capabilities in language understanding and commonsense reasoning.",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/mistral-logo.png",
+        "websiteUrl": "https://mistral.ai/news/mistral-nemo/",
+        "modelUrl": "https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407",
+        "preprompt": "",
+        "parameters": {
+          "stop": ["</s>"],
+          "temperature": 0.6,
+          "truncate": 14336,
+          "max_new_tokens": 1536
+        },
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ]
+      },
+      {
+        "name": "microsoft/Phi-3.5-mini-instruct",
+        "description": "One of the best small models (3.8B parameters), super fast for simple tasks.",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/microsoft-logo.png",
+        "modelUrl": "https://huggingface.co/microsoft/Phi-3.5-mini-instruct",
+        "websiteUrl": "https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/discover-the-new-multi-lingual-high-quality-phi-3-5-slms/ba-p/4225280/",
+        "preprompt": "",
+        "parameters": {
+          "stop": ["<|end|>", "<|endoftext|>", "<|assistant|>"],
+          "temperature": 0.6,
+          "truncate": 28672,
+          "max_new_tokens": 3072
+        },
+        "promptExamples": [
+          {
+            "title": "Write an email from bullet list",
+            "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+          },
+          {
+            "title": "Code a snake game",
+            "prompt": "Code a basic snake game in python, give explanations for each step."
+          },
+          {
+            "title": "Assist in a task",
+            "prompt": "How do I make a delicious lemon cheesecake?"
+          }
+        ]
+      },
+      {
+        "name": "internal/task",
+        "tokenizer" : "NousResearch/Hermes-3-Llama-3.1-8B",
+        "unlisted": true,
+        "tools" : true,
+        "endpoints": [
+          {
+            "type": "openai",
+            "baseURL": "https://internal.api-inference.huggingface.co/models/NousResearch/Hermes-3-Llama-3.1-8B/v1"
+          }
+        ],
+        "parameters": {
+          "temperature": 0.1,
+          "max_new_tokens": 256
+        },
+      }
+    ]
+  NODE_ENV: "prod"
+  NODE_LOG_STRUCTURED_DATA: true
+  OLD_MODELS: >
+    [
+      { "name": "bigcode/starcoder" },
+      { "name": "OpenAssistant/oasst-sft-6-llama-30b-xor" },
+      { "name": "HuggingFaceH4/zephyr-7b-alpha" },
+      { "name": "openchat/openchat_3.5" },
+      { "name": "openchat/openchat-3.5-1210" },
+      { "name": "tiiuae/falcon-180B-chat" },
+      { "name": "codellama/CodeLlama-34b-Instruct-hf" },
+      { "name": "google/gemma-7b-it" },
+      { "name": "meta-llama/Llama-2-70b-chat-hf" },
+      { "name": "codellama/CodeLlama-70b-Instruct-hf" },
+      { "name": "openchat/openchat-3.5-0106" },
+      { "name": "meta-llama/Meta-Llama-3-70B-Instruct" },
+      { "name": "meta-llama/Meta-Llama-3.1-405B-Instruct-FP8" },
+      {
+        "name": "CohereForAI/c4ai-command-r-plus",
+        "transferTo": "CohereForAI/c4ai-command-r-plus-08-2024"
+      },
+      {
+        "name": "01-ai/Yi-1.5-34B-Chat",
+        "transferTo": "CohereForAI/c4ai-command-r-plus-08-2024"
+      },
+      {
+        "name": "mistralai/Mixtral-8x7B-Instruct-v0.1",
+        "transferTo": "mistralai/Mistral-Nemo-Instruct-2407"
+      },
+      {
+        "name": "NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO",
+        "transferTo": "NousResearch/Hermes-3-Llama-3.1-8B"
+      },
+      {
+        "name": "mistralai/Mistral-7B-Instruct-v0.3",
+        "transferTo": "mistralai/Mistral-Nemo-Instruct-2407"
+      },
+      {
+        "name": "microsoft/Phi-3-mini-4k-instruct",
+        "transferTo": "microsoft/Phi-3.5-mini-instruct"
+      },
+      {
+        "name": "meta-llama/Meta-Llama-3.1-70B-Instruct",
+        "transferTo": "meta-llama/Llama-3.3-70B-Instruct"
+      },
+      {
+        "name": "Qwen/QwQ-32B-Preview",
+        "transferTo": "Qwen/QwQ-32B"
+      }
+    ]
+  PUBLIC_ORIGIN: "https://huggingface.co"
+  PUBLIC_SHARE_PREFIX: "https://hf.co/chat"
+  PUBLIC_ANNOUNCEMENT_BANNERS: >
+    [
+      {
+        "title": "DeepSeek R1 is now available!",
+        "linkTitle": "Try it out!",
+        "linkHref": "https://huggingface.co/chat/models/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B"
+      }
+    ]
+  PUBLIC_APP_NAME: "HuggingChat"
+  PUBLIC_APP_ASSETS: "huggingchat"
+  PUBLIC_APP_COLOR: "yellow"
+  PUBLIC_APP_DESCRIPTION: "Making the community's best AI chat models available to everyone."
+  PUBLIC_APP_DISCLAIMER_MESSAGE: "Disclaimer: AI is an area of active research with known problems such as biased generation and misinformation. Do not use this application for high-stakes decisions or advice."
+  PUBLIC_APP_GUEST_MESSAGE: "Sign in with a free Hugging Face account to continue using HuggingChat."
+  PUBLIC_APP_DATA_SHARING: 0
+  PUBLIC_APP_DISCLAIMER: 1
+  PUBLIC_PLAUSIBLE_SCRIPT_URL: "/js/script.js"
+  REQUIRE_FEATURED_ASSISTANTS: "true"
+  TASK_MODEL: "internal/task"
+  TEXT_EMBEDDING_MODELS: >
+    [{
+      "name": "bge-base-en-v1-5-sxa",
+      "displayName": "bge-base-en-v1-5-sxa",
+      "chunkCharLength": 512,
+      "endpoints": [{
+        "type": "tei",
+        "url": "https://huggingchat-tei.hf.space/"
+      }]
+    }]
+  WEBSEARCH_BLOCKLIST: '["youtube.com", "twitter.com"]'
+  XFF_DEPTH: '2'
+  TOOLS: >
+    [
+      {
+        "_id": "000000000000000000000001",
+        "displayName": "Image Generation",
+        "description": "Use this tool to generate images based on a prompt.",
+        "color": "yellow",
+        "icon": "camera",
+        "baseUrl": "black-forest-labs/FLUX.1-schnell",
+        "name": "image_generation",
+        "endpoint": "/infer",
+        "inputs": [
+          {
+            "name": "prompt",
+            "description": "A prompt to generate an image from",
+            "paramType": "required",
+            "type": "str"
+          },
+          { "name": "seed", "paramType": "fixed", "value": "0", "type": "float" },
+          {
+            "name": "randomize_seed",
+            "paramType": "fixed",
+            "value": "true",
+            "type": "bool"
+          },
+          {
+            "name": "width",
+            "description": "numeric value between 256 and 2048",
+            "paramType": "optional",
+            "default": 1024,
+            "type": "float"
+          },
+          {
+            "name": "height",
+            "description": "numeric value between 256 and 2048",
+            "paramType": "optional",
+            "default": 1024,
+            "type": "float"
+          },
+          {
+            "name": "num_inference_steps",
+            "paramType": "fixed",
+            "value": "4",
+            "type": "float"
+          }
+        ],
+        "outputComponent": "image",
+        "outputComponentIdx": 0,
+        "showOutput": true
+      },
+      {
+        "_id": "000000000000000000000002",
+        "displayName": "Document Parser",
+        "description": "Use this tool to parse any document and get its content in markdown format.",
+        "color": "yellow",
+        "icon": "cloud",
+        "baseUrl": "huggingchat/document-parser",
+        "name": "document_parser",
+        "endpoint": "/predict",
+        "inputs": [
+          {
+            "name": "document",
+            "description": "Filename of the document to parse",
+            "paramType": "required",
+            "type": "file",
+            "mimeTypes": 'application/*'
+          },
+          {
+            "name": "filename",
+            "paramType": "fixed",
+            "value": "document.pdf",
+            "type": "str"
+          }
+        ],
+        "outputComponent": "textbox",
+        "outputComponentIdx": 0,
+        "showOutput": false,
+        "isHidden": true
+      },
+      {
+        "_id": "000000000000000000000003",
+        "name": "edit_image",
+        "baseUrl": "multimodalart/cosxl",
+        "endpoint": "/run_edit",
+        "inputs": [
+          {
+            "name": "image",
+            "description": "The image path to be edited",
+            "paramType": "required",
+            "type": "file",
+            "mimeTypes": 'image/*'
+          },
+          {
+            "name": "prompt",
+            "description": "The prompt with which to edit the image",
+            "paramType": "required",
+            "type": "str"
+          },
+          {
+            "name": "negative_prompt",
+            "paramType": "fixed",
+            "value": "",
+            "type": "str"
+          },
+          {
+            "name": "guidance_scale",
+            "paramType": "fixed",
+            "value": 6.5,
+            "type": "float"
+          },
+          {
+            "name": "steps",
+            "paramType": "fixed",
+            "value": 30,
+            "type": "float"
+          }
+        ],
+        "outputComponent": "image",
+        "showOutput": true,
+        "displayName": "Image Editor",
+        "color": "green",
+        "icon": "camera",
+        "description": "This tool lets you edit images",
+        "outputComponentIdx": 0
+      }
+    ]
+  HF_ORG_ADMIN: '644171cfbd0c97265298aa99'
+  HF_ORG_EARLY_ACCESS: '5e67bd5b1009063689407478'
+  HF_API_ROOT: 'https://internal.api-inference.huggingface.co/models'
+infisical:
+  enabled: true
+  env: "prod-us-east-1"
+autoscaling:
+  enabled: true
+  minReplicas: 12
+  maxReplicas: 30
+  targetMemoryUtilizationPercentage: "50"
+  targetCPUUtilizationPercentage: "50"
+resources:
+  requests:
+    cpu: 2
+    memory: 4Gi
+  limits:
+    cpu: 4
+    memory: 8Gi
+monitoring:
+  enabled: true

chart/templates/_helpers.tpl ADDED Viewed

	@@ -0,0 +1,22 @@

+{{- define "name" -}}
+{{- default $.Release.Name | trunc 63 | trimSuffix "-" -}}
+{{- end -}}
+{{- define "app.name" -}}
+chat-ui
+{{- end -}}
+{{- define "labels.standard" -}}
+release: {{ $.Release.Name | quote }}
+heritage: {{ $.Release.Service | quote }}
+chart: "{{ include "name" . }}"
+app: "{{ include "app.name" . }}"
+{{- end -}}
+{{- define "labels.resolver" -}}
+release: {{ $.Release.Name | quote }}
+heritage: {{ $.Release.Service | quote }}
+chart: "{{ include "name" . }}"
+app: "{{ include "app.name" . }}-resolver"
+{{- end -}}

chart/templates/config.yaml ADDED Viewed

	@@ -0,0 +1,10 @@

+apiVersion: v1
+kind: ConfigMap
+metadata:
+  labels: {{ include "labels.standard" . | nindent 4 }}
+  name: {{ include "name" . }}
+  namespace: {{ .Release.Namespace }}
+data:
+  {{- range $key, $value := $.Values.envVars }}
+  {{ $key }}: {{ $value | quote }}
+  {{- end }}

chart/templates/deployment.yaml ADDED Viewed

	@@ -0,0 +1,81 @@

+apiVersion: apps/v1
+kind: Deployment
+metadata:
+  labels: {{ include "labels.standard" . | nindent 4 }}
+  name: {{ include "name" . }}
+  namespace: {{ .Release.Namespace }}
+  {{- if .Values.infisical.enabled }}
+  annotations:
+    secrets.infisical.com/auto-reload: "true"
+  {{- end }}
+spec:
+  progressDeadlineSeconds: 600
+  {{- if not $.Values.autoscaling.enabled }}
+  replicas: {{ .Values.replicas }}
+  {{- end }}
+  revisionHistoryLimit: 10
+  selector:
+    matchLabels: {{ include "labels.standard" . | nindent 6 }}
+  strategy:
+    rollingUpdate:
+      maxSurge: 25%
+      maxUnavailable: 25%
+    type: RollingUpdate
+  template:
+    metadata:
+      labels: {{ include "labels.standard" . | nindent 8 }}
+      annotations:
+        checksum/config: {{ include (print $.Template.BasePath "/config.yaml") . | sha256sum }}
+        {{- if $.Values.envVars.NODE_LOG_STRUCTURED_DATA }}
+        co.elastic.logs/json.expand_keys: "true"
+        {{- end }}
+    spec:
+      {{- if .Values.serviceAccount.enabled }}
+      serviceAccountName: "{{ .Values.serviceAccount.name | default (include "name" .) }}"
+      {{- end }}
+      containers:
+        - name: chat-ui
+          image: "{{ .Values.image.repository }}/{{ .Values.image.name }}:{{ .Values.image.tag }}"
+          imagePullPolicy: {{ .Values.image.pullPolicy }}
+          readinessProbe:
+            failureThreshold: 30
+            periodSeconds: 10
+            httpGet:
+              path: {{ $.Values.envVars.APP_BASE | default "" }}/healthcheck
+              port: {{ $.Values.envVars.APP_PORT | default 3000 | int }}
+          livenessProbe:
+            failureThreshold: 30
+            periodSeconds: 10
+            httpGet:
+              path: {{ $.Values.envVars.APP_BASE | default "" }}/healthcheck
+              port: {{ $.Values.envVars.APP_PORT | default 3000 | int }}
+          ports:
+            - containerPort: {{ $.Values.envVars.APP_PORT | default 3000 | int }}
+              name: http
+              protocol: TCP
+            {{- if $.Values.monitoring.enabled }}
+            - containerPort: {{ $.Values.envVars.METRICS_PORT | default 5565 | int }}
+              name: metrics
+              protocol: TCP
+            {{- end }}
+          resources: {{ toYaml .Values.resources | nindent 12 }}
+          {{- with $.Values.extraEnv }}
+          env:
+            {{- toYaml . | nindent 14 }}
+          {{- end }}
+          envFrom:
+            - configMapRef:
+                name: {{ include "name" . }}
+          {{- if $.Values.infisical.enabled }}
+            - secretRef:
+                name: {{ include "name" $ }}-secs
+          {{- end }}
+          {{- with $.Values.extraEnvFrom }}
+            {{- toYaml . | nindent 14 }}
+          {{- end }}
+      nodeSelector: {{ toYaml .Values.nodeSelector | nindent 8 }}
+      tolerations: {{ toYaml .Values.tolerations | nindent 8 }}
+      volumes:
+        - name: config
+          configMap:
+            name: {{ include "name" . }}

chart/templates/hpa.yaml ADDED Viewed

	@@ -0,0 +1,45 @@

+{{- if $.Values.autoscaling.enabled }}
+apiVersion: autoscaling/v2
+kind: HorizontalPodAutoscaler
+metadata:
+  labels: {{ include "labels.standard" . | nindent 4 }}
+  name: {{ include "name" . }}
+  namespace: {{ .Release.Namespace }}
+spec:
+  scaleTargetRef:
+    apiVersion: apps/v1
+    kind: Deployment
+    name: {{ include "name" . }}
+  minReplicas: {{ $.Values.autoscaling.minReplicas }}
+  maxReplicas: {{ $.Values.autoscaling.maxReplicas }}
+  metrics:
+    {{- if ne "" $.Values.autoscaling.targetMemoryUtilizationPercentage }}
+    - type: Resource
+      resource:
+        name: memory
+        target:
+          type: Utilization
+          averageUtilization: {{ $.Values.autoscaling.targetMemoryUtilizationPercentage | int }}
+    {{- end }}
+    {{- if ne "" $.Values.autoscaling.targetCPUUtilizationPercentage }}
+    - type: Resource
+      resource:
+        name: cpu
+        target:
+          type: Utilization
+          averageUtilization: {{ $.Values.autoscaling.targetCPUUtilizationPercentage | int }}
+    {{- end }}
+  behavior:
+    scaleDown:
+      stabilizationWindowSeconds: 600
+      policies:
+        - type: Percent
+          value: 10
+          periodSeconds: 60
+    scaleUp:
+      stabilizationWindowSeconds: 0
+      policies:
+        - type: Pods
+          value: 1
+          periodSeconds: 30
+{{- end }}

chart/templates/infisical.yaml ADDED Viewed

	@@ -0,0 +1,24 @@

+{{- if .Values.infisical.enabled }}
+apiVersion: secrets.infisical.com/v1alpha1
+kind: InfisicalSecret
+metadata:
+  name: {{ include "name" $ }}-infisical-secret
+  namespace: {{ $.Release.Namespace }}
+spec:
+  authentication:
+    universalAuth:
+      credentialsRef:
+        secretName: {{ .Values.infisical.operatorSecretName | quote }}
+        secretNamespace: {{ .Values.infisical.operatorSecretNamespace | quote }}
+      secretsScope:
+        envSlug: {{ .Values.infisical.env | quote }}
+        projectSlug: {{ .Values.infisical.project | quote }}
+        secretsPath: /
+  hostAPI: {{ .Values.infisical.url | quote }}
+  managedSecretReference:
+    creationPolicy: Owner
+    secretName: {{ include "name" $ }}-secs
+    secretNamespace: {{ .Release.Namespace | quote }}
+    secretType: Opaque
+  resyncInterval: {{ .Values.infisical.resyncInterval }}
+{{- end }}

chart/templates/ingress.yaml ADDED Viewed

	@@ -0,0 +1,32 @@

+{{- if $.Values.ingress.enabled }}
+apiVersion: networking.k8s.io/v1
+kind: Ingress
+metadata:
+  annotations: {{ toYaml .Values.ingress.annotations | nindent 4 }}
+  labels: {{ include "labels.standard" . | nindent 4 }}
+  name: {{ include "name" . }}
+  namespace: {{ .Release.Namespace }}
+spec:
+  {{ if $.Values.ingress.className }}
+  ingressClassName: {{ .Values.ingress.className }}
+  {{ end }}
+  {{- with .Values.ingress.tls }}
+  tls:
+    - hosts:
+        - {{ $.Values.domain | quote }}
+      {{- with .secretName }}
+      secretName: {{ . }}
+      {{- end }}
+  {{- end }}
+  rules:
+    - host: {{ .Values.domain }}
+      http:
+        paths:
+          - backend:
+              service:
+                name: {{ include "name" . }}
+                port:
+                  name: http
+            path: {{ $.Values.ingress.path | default "/" }}
+            pathType: Prefix
+{{- end }}

chart/templates/network-policy.yaml ADDED Viewed

	@@ -0,0 +1,36 @@

+{{- if $.Values.networkPolicy.enabled }}
+apiVersion: networking.k8s.io/v1
+kind: NetworkPolicy
+metadata:
+  name: {{ include "name" . }}
+  namespace: {{ .Release.Namespace }}
+spec:
+  egress:
+    - ports:
+        - port: 53
+          protocol: UDP
+      to:
+        - namespaceSelector:
+            matchLabels:
+              kubernetes.io/metadata.name: kube-system
+          podSelector:
+            matchLabels:
+              k8s-app: kube-dns
+    - to:
+        {{- range $ip := .Values.networkPolicy.allowedBlocks }}
+        - ipBlock:
+            cidr: {{ $ip | quote }}
+        {{- end }}
+    - to:
+        - ipBlock:
+            cidr: 0.0.0.0/0
+            except:
+              - 10.0.0.0/8
+              - 172.16.0.0/12
+              - 192.168.0.0/16
+              - 169.254.169.254/32
+  podSelector:
+    matchLabels: {{ include "labels.standard" . | nindent 6 }}
+  policyTypes:
+    - Egress
+{{- end }}

chart/templates/service-account.yaml ADDED Viewed

	@@ -0,0 +1,13 @@

+{{- if and .Values.serviceAccount.enabled .Values.serviceAccount.create }}
+apiVersion: v1
+kind: ServiceAccount
+automountServiceAccountToken: {{ .Values.serviceAccount.automountServiceAccountToken }}
+metadata:
+  name: "{{ .Values.serviceAccount.name | default (include "name" .) }}"
+  namespace: {{ .Release.Namespace }}
+  labels: {{ include "labels.standard" . | nindent 4 }}
+  {{- with .Values.serviceAccount.annotations }}
+  annotations:
+    {{- toYaml . | nindent 4 }}
+  {{- end }}
+{{- end }}

chart/templates/service-monitor.yaml ADDED Viewed

	@@ -0,0 +1,15 @@

+{{- if $.Values.monitoring.enabled }}
+apiVersion: monitoring.coreos.com/v1
+kind: ServiceMonitor
+metadata:
+  labels: {{ include "labels.standard" . | nindent 4 }}
+  name: {{ include "name" . }}
+  namespace: {{ .Release.Namespace }}
+spec:
+  selector:
+    matchLabels: {{ include "labels.standard" . | nindent 6 }}
+  endpoints:
+    - port: metrics
+      path: /metrics
+      interval: 15s
+{{- end }}

chart/templates/service.yaml ADDED Viewed

	@@ -0,0 +1,21 @@

+apiVersion: v1
+kind: Service
+metadata:
+  name: "{{ include "name" . }}"
+  annotations: {{ toYaml .Values.service.annotations | nindent 4 }}
+  namespace: {{ .Release.Namespace }}
+  labels: {{ include "labels.standard" . | nindent 4 }}
+spec:
+  ports:
+  - name: http
+    port: 80
+    protocol: TCP
+    targetPort: http
+  {{- if $.Values.monitoring.enabled }}
+  - name: metrics
+    port: 5565
+    protocol: TCP
+    targetPort: metrics
+  {{- end }}
+  selector: {{ include "labels.standard" . | nindent 4 }}
+  type: {{.Values.service.type}}

chart/values.yaml ADDED Viewed

	@@ -0,0 +1,67 @@

+image:
+  repository: ghcr.io/huggingface
+  name: chat-ui
+  tag: 0.0.0-latest
+  pullPolicy: IfNotPresent
+replicas: 3
+domain: huggingface.co
+networkPolicy:
+  enabled: false
+  allowedBlocks: []
+service:
+  type: NodePort
+  annotations: { }
+serviceAccount:
+  enabled: false
+  create: false
+  name: ""
+  automountServiceAccountToken: true
+  annotations: { }
+ingress:
+  enabled: true
+  path: "/"
+  annotations: { }
+  # className: "nginx"
+  tls: { }
+    # secretName: XXX
+resources:
+  requests:
+    cpu: 2
+    memory: 4Gi
+  limits:
+    cpu: 2
+    memory: 4Gi
+nodeSelector: {}
+tolerations: []
+envVars: { }
+infisical:
+  enabled: false
+  env: ""
+  project: "huggingchat-v2-a1"
+  url: ""
+  resyncInterval: 60
+  operatorSecretName: "huggingchat-operator-secrets"
+  operatorSecretNamespace: "hub-utils"
+# Allow to environment injections on top or instead of infisical
+extraEnvFrom: []
+extraEnv: []
+autoscaling:
+  enabled: false
+  minReplicas: 1
+  maxReplicas: 2
+  targetMemoryUtilizationPercentage: ""
+  targetCPUUtilizationPercentage: ""
+monitoring:
+  enabled: false

docs/source/_toctree.yml ADDED Viewed

	@@ -0,0 +1,64 @@

+- local: index
+  title: 🤗 Chat UI
+- title: Installation
+  sections:
+    - local: installation/local
+      title: Local
+    - local: installation/spaces
+      title: Spaces
+    - local: installation/docker
+      title: Docker
+    - local: installation/helm
+      title: Helm
+- title: Configuration
+  sections:
+    - local: configuration/overview
+      title: Overview
+    - local: configuration/theming
+      title: Theming
+    - local: configuration/open-id
+      title: OpenID
+    - local: configuration/web-search
+      title: Web Search
+    - local: configuration/metrics
+      title: Metrics
+    - local: configuration/embeddings
+      title: Text Embedding Models
+    - title: Models
+      sections:
+        - local: configuration/models/overview
+          title: Overview
+        - local: configuration/models/multimodal
+          title: Multimodal
+        - local: configuration/models/tools
+          title: Tools
+        - title: Providers
+          sections:
+            - local: configuration/models/providers/anthropic
+              title: Anthropic
+            - local: configuration/models/providers/aws
+              title: AWS
+            - local: configuration/models/providers/cloudflare
+              title: Cloudflare
+            - local: configuration/models/providers/cohere
+              title: Cohere
+            - local: configuration/models/providers/google
+              title: Google
+            - local: configuration/models/providers/langserve
+              title: Langserve
+            - local: configuration/models/providers/llamacpp
+              title: Llama.cpp
+            - local: configuration/models/providers/ollama
+              title: Ollama
+            - local: configuration/models/providers/openai
+              title: OpenAI
+            - local: configuration/models/providers/tgi
+              title: TGI
+    - local: configuration/common-issues
+      title: Common Issues
+- title: Developing
+  sections:
+    - local: developing/architecture
+      title: Architecture
+    - local: developing/copy-huggingchat
+      title: Copy HuggingChat

docs/source/configuration/common-issues.md ADDED Viewed

	@@ -0,0 +1,7 @@

+# Common Issues
+## 403：You don't have access to this conversation
+Most likely you are running chat-ui over HTTP. The recommended option is to setup something like NGINX to handle HTTPS and proxy the requests to chat-ui. If you really need to run over HTTP you can add `ALLOW_INSECURE_COOKIES=true` to your `.env.local`.
+Make sure to set your `PUBLIC_ORIGIN` in your `.env.local` to the correct URL as well.

docs/source/configuration/embeddings.md ADDED Viewed

	@@ -0,0 +1,105 @@

+# Text Embedding Models
+By default (for backward compatibility), when `TEXT_EMBEDDING_MODELS` environment variable is not defined, [transformers.js](https://huggingface.co/docs/transformers.js) embedding models will be used for embedding tasks, specifically, the [Xenova/gte-small](https://huggingface.co/Xenova/gte-small) model.
+You can customize the embedding model by setting `TEXT_EMBEDDING_MODELS` in your `.env.local` file where the required fields are `name`, `chunkCharLength` and `endpoints`.
+Supported text embedding backends are: [`transformers.js`](https://huggingface.co/docs/transformers.js), [`TEI`](https://github.com/huggingface/text-embeddings-inference) and [`OpenAI`](https://platform.openai.com/docs/guides/embeddings). `transformers.js` models run locally as part of `chat-ui`, whereas `TEI` models run in a different environment & accessed through an API endpoint. `openai` models are accessed through the [OpenAI API](https://platform.openai.com/docs/guides/embeddings).
+When more than one embedding models are supplied in `.env.local` file, the first will be used by default, and the others will only be used on LLM's which configured `embeddingModel` to the name of the model.
+## Transformers.js
+The Transformers.js backend uses local CPU for the embedding which can be quite slow. If possible, consider using TEI or OpenAI embeddings instead if you use web search frequently, as performance will improve significantly.
+```ini
+TEXT_EMBEDDING_MODELS = `[
+  {
+    "name": "Xenova/gte-small",
+    "displayName": "Xenova/gte-small",
+    "description": "locally running embedding",
+    "chunkCharLength": 512,
+    "endpoints": [
+      { "type": "transformersjs" }
+    ]
+  }
+]`
+```
+## Text Embeddings Inference (TEI)
+> Text Embeddings Inference (TEI) is a comprehensive toolkit designed for efficient deployment and serving of open source text embeddings models. It enables high-performance extraction for the most popular models, including FlagEmbedding, Ember, GTE, and E5.
+Some recommended models at the time of writing (May 2024) are `Snowflake/snowflake-arctic-embed-m` and `BAAI/bge-large-en-v1.5`. You may run TEI locally with GPU support via Docker:
+`docker run --gpus all -p 8080:80 -v tei-data:/data --name tei ghcr.io/huggingface/text-embeddings-inference:1.2 --model-id YOUR/HF_MODEL`
+You can then hook this up to your Chat UI instance with the following configuration.
+```ini
+TEXT_EMBEDDING_MODELS=`[
+  {
+    "name": "YOUR/HF_MODEL",
+    "displayName": "YOUR/HF_MODEL",
+    "preQuery": "Check the model documentation for the preQuery. Not all models have one",
+    "prePassage": "Check the model documentation for the prePassage. Not all models have one",
+    "chunkCharLength": 512,
+    "endpoints": [{
+      "type": "tei",
+      "url": "http://127.0.0.1:8080/"
+    }]
+  }
+]`
+```
+Examples for `Snowflake/snowflake-arctic-embed-m` and `BAAI/bge-large-en-v1.5`:
+```ini
+TEXT_EMBEDDING_MODELS=`[
+  {
+    "name": "Snowflake/snowflake-arctic-embed-m",
+    "displayName": "Snowflake/snowflake-arctic-embed-m",
+    "preQuery": "Represent this sentence for searching relevant passages: ",
+    "chunkCharLength": 512,
+    "endpoints": [{
+      "type": "tei",
+      "url": "http://127.0.0.1:8080/"
+    }]
+  },{
+    "name": "BAAI/bge-large-en-v1.5",
+    "displayName": "BAAI/bge-large-en-v1.5",
+    "chunkCharLength": 512,
+    "endpoints": [{
+      "type": "tei",
+      "url": "http://127.0.0.1:8080/"
+    }]
+  }
+]`
+```
+## OpenAI
+It's also possible to host your own OpenAI API compatible embedding models. [`Infinity`](https://github.com/michaelfeil/infinity) is one example. You may run it locally with Docker:
+`docker run -it --gpus all -v infinity-data:/app/.cache -p 7997:7997 michaelf34/infinity:latest v2 --model-id nomic-ai/nomic-embed-text-v1 --port 7997`
+You can then hook this up to your Chat UI instance with the following configuration.
+```ini
+TEXT_EMBEDDING_MODELS=`[
+  {
+    "name": "nomic-ai/nomic-embed-text-v1",
+    "displayName": "nomic-ai/nomic-embed-text-v1",
+    "chunkCharLength": 512,
+    "model": {
+      "name": "nomic-ai/nomic-embed-text-v1"
+    },
+    "endpoints": [
+      {
+        "type": "openai",
+        "url": "https://127.0.0.1:7997/embeddings"
+      }
+    ]
+  }
+]`
+```

docs/source/configuration/metrics.md ADDED Viewed

	@@ -0,0 +1,9 @@

+# Metrics
+The server can expose prometheus metrics on port `5565` but is off by default. You may enable the metrics server with `METRICS_ENABLED=true` and change the port with `METRICS_PORT=1234`.
+<Tip>
+In development with `npm run dev`, the metrics server does not shutdown gracefully due to Sveltekit not providing hooks for restart. It's recommended to disable the metrics server in this case.
+</Tip>

docs/source/configuration/models/multimodal.md ADDED Viewed

	@@ -0,0 +1,24 @@

+# Multimodal
+We currently support [IDEFICS](https://huggingface.co/blog/idefics) (hosted on [TGI](./providers/tgi)), OpenAI and Anthropic Claude 3 as multimodal models. You can enable it by setting `multimodal: true` in your `MODELS` configuration. For IDEFICS, you must have a [PRO HF Api token](https://huggingface.co/settings/tokens). For OpenAI, see the [OpenAI section](./providers/openai). For Anthropic, see the [Anthropic section](./providers/anthropic).
+```ini
+MODELS=`[
+  {
+    "name": "HuggingFaceM4/idefics-80b-instruct",
+    "multimodal" : true,
+    "description": "IDEFICS is the new multimodal model by Hugging Face.",
+    "preprompt": "",
+    "chatPromptTemplate" : "{{#each messages}}{{#ifUser}}User: {{content}}{{/ifUser}}<end_of_utterance>\nAssistant: {{#ifAssistant}}{{content}}\n{{/ifAssistant}}{{/each}}",
+    "parameters": {
+      "temperature": 0.1,
+      "top_p": 0.95,
+      "repetition_penalty": 1.2,
+      "top_k": 12,
+      "truncate": 1000,
+      "max_new_tokens": 1024,
+      "stop": ["<end_of_utterance>", "User:", "\nUser:"]
+    }
+  }
+]`
+```

docs/source/configuration/models/overview.md ADDED Viewed

	@@ -0,0 +1,147 @@

+# Models Overview
+You can customize the parameters passed to the model or even use a new model by updating the `MODELS` variable in your `.env.local`. The default one can be found in `.env` and looks like this :
+```ini
+MODELS=`[
+  {
+    "name": "mistralai/Mistral-7B-Instruct-v0.2",
+    "displayName": "mistralai/Mistral-7B-Instruct-v0.2",
+    "description": "Mistral 7B is a new Apache 2.0 model, released by Mistral AI that outperforms Llama2 13B in benchmarks.",
+    "websiteUrl": "https://mistral.ai/news/announcing-mistral-7b/",
+    "preprompt": "",
+    "chatPromptTemplate" : "<s>{{#each messages}}{{#ifUser}}[INST] {{#if @first}}{{#if @root.preprompt}}{{@root.preprompt}}\n{{/if}}{{/if}}{{content}} [/INST]{{/ifUser}}{{#ifAssistant}}{{content}}</s>{{/ifAssistant}}{{/each}}",
+    "parameters": {
+      "temperature": 0.3,
+      "top_p": 0.95,
+      "repetition_penalty": 1.2,
+      "top_k": 50,
+      "truncate": 3072,
+      "max_new_tokens": 1024,
+      "stop": ["</s>"]
+    },
+    "promptExamples": [
+      {
+        "title": "Write an email from bullet list",
+        "prompt": "As a restaurant owner, write a professional email to the supplier to get these products every week: \n\n- Wine (x10)\n- Eggs (x24)\n- Bread (x12)"
+      }, {
+        "title": "Code a snake game",
+        "prompt": "Code a basic snake game in python, give explanations for each step."
+      }, {
+        "title": "Assist in a task",
+        "prompt": "How do I make a delicious lemon cheesecake?"
+      }
+    ]
+  }
+]`
+```
+You can change things like the parameters, or customize the preprompt to better suit your needs. You can also add more models by adding more objects to the array, with different preprompts for example.
+## Chat Prompt Template
+When querying the model for a chat response, the `chatPromptTemplate` template is used. `messages` is an array of chat messages, it has the format `[{ content: string }, ...]`. To identify if a message is a user message or an assistant message the `ifUser` and `ifAssistant` block helpers can be used.
+The following is the default `chatPromptTemplate`, although newlines and indentiation have been added for readability. You can find the prompts used in production for HuggingChat [here](https://github.com/huggingface/chat-ui/blob/main/PROMPTS.md). The templating language used is [Handlebars](https://www.npmjs.com/package/handlebars).
+```handlebars
+{{preprompt}}
+{{#each messages}}
+	{{#ifUser}}{{@root.userMessageToken}}{{content}}{{@root.userMessageEndToken}}{{/ifUser}}
+	{{#ifAssistant
+	}}{{@root.assistantMessageToken}}{{content}}{{@root.assistantMessageEndToken}}{{/ifAssistant}}
+{{/each}}
+{{assistantMessageToken}}
+```
+## Custom endpoint authorization
+### Basic and Bearer
+Custom endpoints may require authorization, depending on how you configure them. Authentication will usually be set either with `Basic` or `Bearer`.
+For `Basic` we will need to generate a base64 encoding of the username and password.
+`echo -n "USER:PASS" | base64`
+> VVNFUjpQQVNT
+For `Bearer` you can use a token, which can be grabbed from [here](https://huggingface.co/settings/tokens).
+You can then add the generated information and the `authorization` parameter to your `.env.local`.
+```ini
+"endpoints": [
+  {
+    "url": "https://HOST:PORT",
+    "authorization": "Basic VVNFUjpQQVNT",
+  }
+]
+```
+Please note that if `HF_TOKEN` is also set or not empty, it will take precedence.
+## Models hosted on multiple custom endpoints
+If the model being hosted will be available on multiple servers/instances add the `weight` parameter to your `.env.local`. The `weight` will be used to determine the probability of requesting a particular endpoint.
+```ini
+"endpoints": [
+  {
+    "url": "https://HOST:PORT",
+    "weight": 1
+  },
+  {
+    "url": "https://HOST:PORT",
+    "weight": 2
+  }
+  ...
+]
+```
+## Client Certificate Authentication (mTLS)
+Custom endpoints may require client certificate authentication, depending on how you configure them. To enable mTLS between Chat UI and your custom endpoint, you will need to set the `USE_CLIENT_CERTIFICATE` to `true`, and add the `CERT_PATH` and `KEY_PATH` parameters to your `.env.local`. These parameters should point to the location of the certificate and key files on your local machine. The certificate and key files should be in PEM format. The key file can be encrypted with a passphrase, in which case you will also need to add the `CLIENT_KEY_PASSWORD` parameter to your `.env.local`.
+If you're using a certificate signed by a private CA, you will also need to add the `CA_PATH` parameter to your `.env.local`. This parameter should point to the location of the CA certificate file on your local machine.
+If you're using a self-signed certificate, e.g. for testing or development purposes, you can set the `REJECT_UNAUTHORIZED` parameter to `false` in your `.env.local`. This will disable certificate validation, and allow Chat UI to connect to your custom endpoint.
+## Specific Embedding Model
+A model can use any of the embedding models defined under `TEXT_EMBEDDING_MODELS`, (currently used when web searching). By default it will use the first embedding model, but it can be changed with the field `embeddingModel`:
+```ini
+TEXT_EMBEDDING_MODELS = `[
+  {
+    "name": "Xenova/gte-small",
+    "chunkCharLength": 512,
+    "endpoints": [
+      {"type": "transformersjs"}
+    ]
+  },
+  {
+    "name": "intfloat/e5-base-v2",
+    "chunkCharLength": 768,
+    "endpoints": [
+      {"type": "tei", "url": "http://127.0.0.1:8080/", "authorization": "Basic VVNFUjpQQVNT"},
+      {"type": "tei", "url": "http://127.0.0.1:8081/"}
+    ]
+  }
+]`
+MODELS=`[
+  {
+      "name": "Ollama Mistral",
+      "chatPromptTemplate": "...",
+      "embeddingModel": "intfloat/e5-base-v2"
+      "parameters": {
+        ...
+      },
+      "endpoints": [
+        ...
+      ]
+  }
+]`
+```

docs/source/configuration/models/providers/anthropic.md ADDED Viewed

	@@ -0,0 +1,117 @@

+# Anthropic
+| Feature                     | Available |
+| --------------------------- | --------- |
+| [Tools](../tools)           | No        |
+| [Multimodal](../multimodal) | Yes       |
+We also support Anthropic models (including multimodal ones via `multmodal: true`) through the official SDK. You may provide your API key via the `ANTHROPIC_API_KEY` env variable, or alternatively, through the `endpoints.apiKey` as per the following example.
+```ini
+MODELS=`[
+  {
+      "name": "claude-3-haiku-20240307",
+      "displayName": "Claude 3 Haiku",
+      "description": "Fastest and most compact model for near-instant responsiveness",
+      "multimodal": true,
+      "parameters": {
+        "max_new_tokens": 4096,
+      },
+      "endpoints": [
+        {
+          "type": "anthropic",
+          // optionals
+          "apiKey": "sk-ant-...",
+          "baseURL": "https://api.anthropic.com",
+          "defaultHeaders": {},
+          "defaultQuery": {}
+        }
+      ]
+  },
+  {
+      "name": "claude-3-sonnet-20240229",
+      "displayName": "Claude 3 Sonnet",
+      "description": "Ideal balance of intelligence and speed",
+      "multimodal": true,
+      "parameters": {
+        "max_new_tokens": 4096,
+      },
+      "endpoints": [
+        {
+          "type": "anthropic",
+          // optionals
+          "apiKey": "sk-ant-...",
+          "baseURL": "https://api.anthropic.com",
+          "defaultHeaders": {},
+          "defaultQuery": {}
+        }
+      ]
+  },
+  {
+      "name": "claude-3-opus-20240229",
+      "displayName": "Claude 3 Opus",
+      "description": "Most powerful model for highly complex tasks",
+      "multimodal": true,
+      "parameters": {
+         "max_new_tokens": 4096
+      },
+      "endpoints": [
+        {
+          "type": "anthropic",
+          // optionals
+          "apiKey": "sk-ant-...",
+          "baseURL": "https://api.anthropic.com",
+          "defaultHeaders": {},
+          "defaultQuery": {}
+        }
+      ]
+  }
+]`
+```
+## VertexAI
+We also support using Anthropic models running on Vertex AI. Authentication is done using Google Application Default Credentials. Project ID can be provided through the `endpoints.projectId` as per the following example:
+```ini
+MODELS=`[
+  {
+      "name": "claude-3-haiku@20240307",
+      "displayName": "Claude 3 Haiku",
+      "description": "Fastest, most compact model for near-instant responsiveness",
+      "multimodal": true,
+      "parameters": {
+         "max_new_tokens": 4096
+      },
+      "endpoints": [
+        {
+          "type": "anthropic-vertex",
+          "region": "us-central1",
+          "projectId": "gcp-project-id",
+          // optionals
+          "defaultHeaders": {},
+          "defaultQuery": {}
+        }
+      ]
+  },
+  {
+      "name": "claude-3-sonnet@20240229",
+      "displayName": "Claude 3 Sonnet",
+      "description": "Ideal balance of intelligence and speed",
+      "multimodal": true,
+      "parameters": {
+        "max_new_tokens": 4096,
+      },
+      "endpoints": [
+        {
+          "type": "anthropic-vertex",
+          "region": "us-central1",
+          "projectId": "gcp-project-id",
+          // optionals
+          "defaultHeaders": {},
+          "defaultQuery": {}
+        }
+      ]
+  },
+]`
+```

docs/source/configuration/models/providers/aws.md ADDED Viewed

	@@ -0,0 +1,35 @@

+# Amazon Web Services (AWS)
+| Feature                     | Available |
+| --------------------------- | --------- |
+| [Tools](../tools)           | No        |
+| [Multimodal](../multimodal) | No        |
+You may specify your Amazon SageMaker instance as an endpoint for Chat UI:
+```ini
+MODELS=`[{
+  "name": "your-model",
+  "displayName": "Your Model",
+  "description": "Your description",
+  "parameters": {
+     "max_new_tokens": 4096
+  },
+  "endpoints": [
+    {
+      "type" : "aws",
+      "service" : "sagemaker"
+      "url": "",
+      "accessKey": "",
+      "secretKey" : "",
+      "sessionToken": "",
+      "region": "",
+      "weight": 1
+    }
+  ]
+}]`
+```
+You can also set `"service": "lambda"` to use a lambda instance.
+You can get the `accessKey` and `secretKey` from your AWS user, under programmatic access.