Buckets:
| <meta charset="utf-8" /><meta name="hf:doc:metadata" content="{"title":"Hugging Face Generative AI Services (HUGS)","local":"hugging-face-generative-ai-services-hugs","sections":[{"title":"Key Features","local":"key-features","sections":[],"depth":2},{"title":"Why HUGS?","local":"why-hugs","sections":[{"title":"Built for Open Models","local":"built-for-open-models","sections":[],"depth":3}],"depth":2},{"title":"Getting Started","local":"getting-started","sections":[],"depth":2},{"title":"More Resources","local":"more-resources","sections":[],"depth":2}],"depth":1}"> | |
| <link href="/docs/hugs/pr_13/en/_app/immutable/assets/0.e3b0c442.css" rel="modulepreload"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/entry/start.e16d698a.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/chunks/scheduler.b108d059.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/chunks/singletons.c6602da5.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/chunks/paths.cdca596f.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/entry/app.befdf950.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/chunks/index.008de539.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/nodes/0.c68b1b10.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/nodes/15.2e0d0e44.js"> | |
| <link rel="modulepreload" href="/docs/hugs/pr_13/en/_app/immutable/chunks/EditOnGithub.d1c48e3d.js"><!-- HEAD_svelte-u9bgzb_START --><meta name="hf:doc:metadata" content="{"title":"Hugging Face Generative AI Services (HUGS)","local":"hugging-face-generative-ai-services-hugs","sections":[{"title":"Key Features","local":"key-features","sections":[],"depth":2},{"title":"Why HUGS?","local":"why-hugs","sections":[{"title":"Built for Open Models","local":"built-for-open-models","sections":[],"depth":3}],"depth":2},{"title":"Getting Started","local":"getting-started","sections":[],"depth":2},{"title":"More Resources","local":"more-resources","sections":[],"depth":2}],"depth":1}"><!-- HEAD_svelte-u9bgzb_END --> <p></p> <h1 class="relative group"><a id="hugging-face-generative-ai-services-hugs" class="header-link block pr-1.5 text-lg no-hover:hidden with-hover:absolute with-hover:p-1.5 with-hover:opacity-0 with-hover:group-hover:opacity-100 with-hover:right-full" href="#hugging-face-generative-ai-services-hugs"><span><svg class="" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" aria-hidden="true" role="img" width="1em" height="1em" preserveAspectRatio="xMidYMid meet" viewBox="0 0 256 256"><path d="M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z" fill="currentColor"></path></svg></span></a> <span>Hugging Face Generative AI Services (HUGS)</span></h1> <p data-svelte-h="svelte-1uzhp1c"><img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hugs/hugs-banner.png" alt="HUGS Banner"></p> <blockquote data-svelte-h="svelte-5ttw4z"><p>Optimized, zero-configuration inference microservices for open AI models</p></blockquote> <p data-svelte-h="svelte-1ddt8st">Hugging Face Generative AI Services (HUGS) are optimized, zero-configuration inference microservices designed to simplify and accelerate the development of AI applications with open models. Built on open-source Hugging Face technologies such as Text Generation Inference or Transformers. HUGS provides the best solution for efficiently building Generative AI Applications with open models and are optimized for a variety of hardware accelerators, including NVIDIA GPUs, AMD GPUs, AWS Inferentia, and Google TPUs (soon).</p> <h2 class="relative group"><a id="key-features" class="header-link block pr-1.5 text-lg no-hover:hidden with-hover:absolute with-hover:p-1.5 with-hover:opacity-0 with-hover:group-hover:opacity-100 with-hover:right-full" href="#key-features"><span><svg class="" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" aria-hidden="true" role="img" width="1em" height="1em" preserveAspectRatio="xMidYMid meet" viewBox="0 0 256 256"><path d="M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z" fill="currentColor"></path></svg></span></a> <span>Key Features</span></h2> <ul data-svelte-h="svelte-1svq8vt"><li><strong>Zero-configuration Deployment</strong>: Automatically loads optimal settings based on your hardware environment.</li> <li><strong>Optimized Hardware Inference Engines</strong>: Built on <a href="https://github.com/huggingface/text-generation-inference" rel="nofollow">Hugging Face’s Text Generation Inference (TGI)</a>, optimized for a variety of hardware.</li> <li><strong>Hardware Flexibility</strong>: Optimized for various accelerators, including NVIDIA GPUs, AMD GPUs, AWS Inferentia, and Google TPUs</li> <li><strong>Built for Open Models</strong>: Compatible with a wide range of popular open AI models, including LLMs, Multimodal Models, and Embedding Models.</li> <li><strong>Industry Standardized APIs</strong>: Easily deployable using Kubernetes and standardized on the OpenAI API.</li> <li><strong>Security and Control</strong>: Deploy HUGS within your own infrastructure for enhanced security and data control.</li> <li><strong>Enterprise Compliance</strong>: Minimizes compliance risks by including necessary licenses and terms of services.</li></ul> <h2 class="relative group"><a id="why-hugs" class="header-link block pr-1.5 text-lg no-hover:hidden with-hover:absolute with-hover:p-1.5 with-hover:opacity-0 with-hover:group-hover:opacity-100 with-hover:right-full" href="#why-hugs"><span><svg class="" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" aria-hidden="true" role="img" width="1em" height="1em" preserveAspectRatio="xMidYMid meet" viewBox="0 0 256 256"><path d="M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z" fill="currentColor"></path></svg></span></a> <span>Why HUGS?</span></h2> <p data-svelte-h="svelte-160yj32">Enterprises often struggle with their model-serving infrastructure in terms of performance, engineering complexity, and compliance when using open models. Early-stage startups and large enterprises have built POCs using models not because they want to use closed models with black box APIs but because building their AI with open models takes more work.</p> <p data-svelte-h="svelte-13ab7wg">HUGS are optimized, zero-configuration inference microservices designed to simplify and accelerate the development of AI models. With HUGS, we want to make switching from a closed-source API to a self-hosted open model easy.</p> <p data-svelte-h="svelte-1f3j6je">HUGS deliver endpoints compatible with the OpenAI API, so you don’t need to change your code when transitioning your POC to production with your model and infra. They automatically deliver maximum hardware efficiency. | |
| HUGS make it easy to keep your applications at the cutting edge of Generative AI by offering updates when new battle-tested open models become available.</p> <h3 class="relative group"><a id="built-for-open-models" class="header-link block pr-1.5 text-lg no-hover:hidden with-hover:absolute with-hover:p-1.5 with-hover:opacity-0 with-hover:group-hover:opacity-100 with-hover:right-full" href="#built-for-open-models"><span><svg class="" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" aria-hidden="true" role="img" width="1em" height="1em" preserveAspectRatio="xMidYMid meet" viewBox="0 0 256 256"><path d="M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z" fill="currentColor"></path></svg></span></a> <span>Built for Open Models</span></h3> <p data-svelte-h="svelte-15f75na">Compatible with a wide range of popular open AI models, including:</p> <ul data-svelte-h="svelte-1qzz1dc"><li>LLMs: Llama, Gemma, Mistral, Mixtral, Qwen, Deepseek (soon), T5 (soon), Yi (soon), Phi (soon), Command R (soon)</li> <li>(Soon) Multimodal Models: Idefics, Llava</li> <li>(Soon) Embedding Models: BGE, GTE, Mixbread, Arctic, Jina, Nomic</li></ul> <h2 class="relative group"><a id="getting-started" class="header-link block pr-1.5 text-lg no-hover:hidden with-hover:absolute with-hover:p-1.5 with-hover:opacity-0 with-hover:group-hover:opacity-100 with-hover:right-full" href="#getting-started"><span><svg class="" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" aria-hidden="true" role="img" width="1em" height="1em" preserveAspectRatio="xMidYMid meet" viewBox="0 0 256 256"><path d="M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z" fill="currentColor"></path></svg></span></a> <span>Getting Started</span></h2> <p data-svelte-h="svelte-1a9w6hv">To start using HUGS, you have several options. You can access HUGS as part of your Hugging Face Enterprise subscription, through Cloud Service Provider (CSP) marketplaces. Currently, you can find HUGS on Amazon Web Services (AWS) and Google Cloud Platform (GCP), and soon on Microsoft Azure. HUGS are also natively available inside DigitalOcean GPU Droplet.</p> <p data-svelte-h="svelte-m91r06">For detailed instructions on deployment and usage:</p> <ul data-svelte-h="svelte-loq3ok"><li><a href="https://huggingface.co/enterprise" rel="nofollow">Hugging Face Enterprise</a></li> <li>Amazon Web Services (AWS)<ul><li><a href="./how-to/cloud/aws.mdx">AWS with NVIDIA GPUs</a></li> <li><a href="./how-to/cloud/aws-neuron.mdx">AWS with Inferentia & Trainium</a></li></ul></li> <li><a href="./how-to/cloud/digital-ocean">DigitalOcean</a></li> <li><a href="./how-to/cloud/gcp">Google Cloud Platform (GCP)</a></li> <li><a href="./how-to/cloud/azure">Microsoft Azure</a> (coming soon)</li></ul> <h2 class="relative group"><a id="more-resources" class="header-link block pr-1.5 text-lg no-hover:hidden with-hover:absolute with-hover:p-1.5 with-hover:opacity-0 with-hover:group-hover:opacity-100 with-hover:right-full" href="#more-resources"><span><svg class="" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" aria-hidden="true" role="img" width="1em" height="1em" preserveAspectRatio="xMidYMid meet" viewBox="0 0 256 256"><path d="M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z" fill="currentColor"></path></svg></span></a> <span>More Resources</span></h2> <ul data-svelte-h="svelte-1lie4ja"><li><a href="https://discuss.huggingface.co/" rel="nofollow">Community Forum</a></li> <li><a href="https://huggingface.co/contact/sales?from=hugs" rel="nofollow">Enterprise Support</a></li></ul> <p data-svelte-h="svelte-ka1olc">Experience the power of open models with the simplicity of HUGS. Start building your AI applications faster and more efficiently today!</p> <a class="!text-gray-400 !no-underline text-sm flex items-center not-prose mt-4" href="https://github.com/huggingface/hugs-docs/blob/main/docs/source/index.mdx" target="_blank"><span data-svelte-h="svelte-1kd6by1"><</span> <span data-svelte-h="svelte-x0xyl0">></span> <span data-svelte-h="svelte-1dajgef"><span class="underline ml-1.5">Update</span> on GitHub</span></a> <p></p> | |
| <script> | |
| { | |
| __sveltekit_is8v4k = { | |
| assets: "/docs/hugs/pr_13/en", | |
| base: "/docs/hugs/pr_13/en", | |
| env: {} | |
| }; | |
| const element = document.currentScript.parentElement; | |
| const data = [null,null]; | |
| Promise.all([ | |
| import("/docs/hugs/pr_13/en/_app/immutable/entry/start.e16d698a.js"), | |
| import("/docs/hugs/pr_13/en/_app/immutable/entry/app.befdf950.js") | |
| ]).then(([kit, app]) => { | |
| kit.start(app, element, { | |
| node_ids: [0, 15], | |
| data, | |
| form: null, | |
| error: null | |
| }); | |
| }); | |
| } | |
| </script> | |
Xet Storage Details
- Size:
- 15.1 kB
- Xet hash:
- 67bcc1e223f667f9942a206ffd71cf6495ca8eaba15b646e14ca42ed36713499
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.