Spaces:

mishig
/

chat-ui

Sleeping

App Files Files Community

chat-ui / src /lib /server /models.ts

Commit History

refactor: new API & universal load functions (pt. 2) (#1847)

7d7a53f
unverified

nsarrazin commited on Jun 5, 2025

Revert "refactor: new API & universal load functions (#1743)"

91e839b

nsarrazin commited on Jun 5, 2025

refactor: new API & universal load functions (#1743)

35a395c
unverified

nsarrazin commited on Jun 5, 2025

fix: only import node-llama-cpp if needed and skip for huggingchat image

d8e426c

nsarrazin commited on May 16, 2025

New `InferenceClient` endpoint type (#1813)

e08e6dd
unverified

nsarrazin commited on May 13, 2025

feat: allow storing env variable in DB (#1802)

31daf3d
unverified

nsarrazin commited on Apr 24, 2025

feat: allow strings in `MODELS` env var (#1791)

6a449cd
unverified

nsarrazin commited on Apr 11, 2025

feat: add a local endpoint type for inference directly from chat-ui (#1778)

4e9a7a9
unverified

nsarrazin commited on Apr 4, 2025

fix: handle no beginToken for token based reasoning models (#1713)

6647bbf
unverified

nsarrazin commited on Feb 12, 2025

chores(svelte): migration to svelte 5 (#1685)

21b8785
unverified

nsarrazin commited on Feb 7, 2025

feat: improve tool calling & add tools to qwen 2.5 72b (#1615)

f7aef71
unverified

nsarrazin commited on Dec 6, 2024

feat: UI for advanced reasoning models (#1605)

764ecdf
unverified

nsarrazin commited on Dec 2, 2024

Fix/oai parameters 1552 (#1557)

52dfa8c
unverified

evalstate commited on Nov 11, 2024

fix: only show playground button for models that are available

bad5446
unverified

nsarrazin commited on Nov 5, 2024

feat: Add a model config option to disable system prompts (#1539)

d4c7ddb
unverified

Mounayer

nsarrazin commited on Oct 29, 2024

fix: use custom tool template only for huggingchat

b36dfab
unverified

nsarrazin commited on Oct 25, 2024

feat: use model id as fallback for finding a tokenizer (#1535)

2b554f9
unverified

nsarrazin commited on Oct 21, 2024

feat(models): add `nvidia/Llama-3.1-Nemotron-70B-Instruct-HF` (#1527)

d852d1b
unverified

nsarrazin commited on Oct 16, 2024

[vertex] Add PDF/plein texts support (#1520)

a5e332d
unverified

goupilew commited on Oct 15, 2024

feat: add link to API playground for compatible models (#1488)

baa5d2f
unverified

nsarrazin commited on Sep 25, 2024

feat(conv): let user switch models on conversations with deprecated models (#1462)

68972f0
unverified

nsarrazin commited on Sep 11, 2024

feat(models): add transferTo property on old models (#1453)

a1ae528
unverified

nsarrazin commited on Sep 10, 2024

fix(continue): fix continue feature (#1459)

4f61809
unverified

nsarrazin commited on Sep 10, 2024

Add support for Anthropic models via AWS Bedrock (#1413)

b6274e8
unverified

ABarLT

nsarrazin commited on Aug 26, 2024

feat(assistants): use community tools in assistants (#1421)

3bb3aef
unverified

nsarrazin commited on Aug 21, 2024

chores(deps): update Transformers to latest version and remove chat templates for all models (#1414)

f3064dd
unverified

nsarrazin commited on Aug 16, 2024

Community tools (#1250)

b8228c1
unverified

nsarrazin commited on Aug 12, 2024

fix(logs): improve logging

71f3ffb
unverified

nsarrazin commited on Jul 30, 2024

feat(models): Llama 3.1 support (#1355)

79b41de
unverified

nsarrazin commited on Jul 23, 2024

Add Google Gemini API Support (#1330)

6f1638a
unverified

toandev commited on Jul 18, 2024

Refactor logger error messages for consistency. (#1257)

fa0afa9
unverified

Denzel Mendez commited on Jun 10, 2024

Function calling (#996)

faa93d9
unverified

nsarrazin

victor HF Staff Liam Dyer Mishig commited on May 23, 2024

Generic Multimodal Support Fixed (#1147)

b4690c8
unverified

Liam Dyer

nsarrazin Mishig commited on May 17, 2024

Revert "Generic Multimodal Support" (#1146)

2a40544
unverified

Mishig commited on May 15, 2024

Generic Multimodal Support (#1021)

f493225
unverified

Liam Dyer

nsarrazin Mishig commited on May 15, 2024

feat: add support for anthropic on vertex (#958)

76d1477
unverified

Jun Siang Cheah

nsarrazin commited on May 7, 2024

Move vars to dynamic, add metrics (#1085)

0e5ff83
unverified

nsarrazin

rtrm HF Staff

victor HF Staff commited on May 3, 2024

Use pino for logs (#1086)

bb6f8cb
unverified

nsarrazin commited on Apr 30, 2024

Kill app on tokenizer load fail (#1044)

4b62530
unverified

nsarrazin commited on Apr 21, 2024

Add logs to tokenizer fetch (#1030)

ac3fd05
unverified

nsarrazin commited on Apr 18, 2024

Add langserve endpoint (#1009)

a9e4746
unverified

Antonio Ramos antoniora commited on Apr 16, 2024

Move default template so it doesn't override tokenizer (#987)

0598c3f
unverified

nsarrazin commited on Apr 8, 2024

Set ChatML as default chat prompt template (#985)

97f9784
unverified

nsarrazin commited on Apr 8, 2024

Add support for cohere endpoints (#976)

1aafdd3
unverified

nsarrazin commited on Apr 4, 2024

Use jinja template for chat formatting (#730) (#744)

3d931f5
unverified

nsarrazin commited on Apr 4, 2024

Implement Cloudflare Workers AI endpoint (#907) (#972)

ac2d8ff
unverified

nsarrazin commited on Apr 3, 2024

Google Vertex API support (#950)

48f1340
unverified

madppiper commited on Apr 2, 2024

Count system prompt tokens (#850)

eb2ef82
unverified

Mishig

nsarrazin commited on Mar 22, 2024

Anthropic Endpoint Support (#923)

1db019c
unverified

Liam Dyer commited on Mar 15, 2024

Add gemma model to prod config (#854)

d3928c1
unverified

nsarrazin commited on Feb 21, 2024

Commit History

refactor: new API & universal load functions (pt. 2) (#1847) 7d7a53f unverified

Revert "refactor: new API & universal load functions (#1743)" 91e839b

refactor: new API & universal load functions (#1743) 35a395c unverified

fix: only import node-llama-cpp if needed and skip for huggingchat image d8e426c

New `InferenceClient` endpoint type (#1813) e08e6dd unverified

feat: allow storing env variable in DB (#1802) 31daf3d unverified

feat: allow strings in `MODELS` env var (#1791) 6a449cd unverified

feat: add a local endpoint type for inference directly from chat-ui (#1778) 4e9a7a9 unverified

fix: handle no beginToken for token based reasoning models (#1713) 6647bbf unverified

chores(svelte): migration to svelte 5 (#1685) 21b8785 unverified

feat: improve tool calling & add tools to qwen 2.5 72b (#1615) f7aef71 unverified

feat: UI for advanced reasoning models (#1605) 764ecdf unverified

Fix/oai parameters 1552 (#1557) 52dfa8c unverified

fix: only show playground button for models that are available bad5446 unverified

feat: Add a model config option to disable system prompts (#1539) d4c7ddb unverified

fix: use custom tool template only for huggingchat b36dfab unverified

feat: use model id as fallback for finding a tokenizer (#1535) 2b554f9 unverified

feat(models): add `nvidia/Llama-3.1-Nemotron-70B-Instruct-HF` (#1527) d852d1b unverified

[vertex] Add PDF/plein texts support (#1520) a5e332d unverified

feat: add link to API playground for compatible models (#1488) baa5d2f unverified

feat(conv): let user switch models on conversations with deprecated models (#1462) 68972f0 unverified

feat(models): add transferTo property on old models (#1453) a1ae528 unverified

fix(continue): fix continue feature (#1459) 4f61809 unverified

Add support for Anthropic models via AWS Bedrock (#1413) b6274e8 unverified

feat(assistants): use community tools in assistants (#1421) 3bb3aef unverified

chores(deps): update Transformers to latest version and remove chat templates for all models (#1414) f3064dd unverified

Community tools (#1250) b8228c1 unverified

fix(logs): improve logging 71f3ffb unverified

feat(models): Llama 3.1 support (#1355) 79b41de unverified

Add Google Gemini API Support (#1330) 6f1638a unverified

Refactor logger error messages for consistency. (#1257) fa0afa9 unverified

Function calling (#996) faa93d9 unverified

Generic Multimodal Support Fixed (#1147) b4690c8 unverified

Revert "Generic Multimodal Support" (#1146) 2a40544 unverified

Generic Multimodal Support (#1021) f493225 unverified

feat: add support for anthropic on vertex (#958) 76d1477 unverified

Move vars to dynamic, add metrics (#1085) 0e5ff83 unverified

Use pino for logs (#1086) bb6f8cb unverified

Kill app on tokenizer load fail (#1044) 4b62530 unverified

Add logs to tokenizer fetch (#1030) ac3fd05 unverified

Add langserve endpoint (#1009) a9e4746 unverified

Move default template so it doesn't override tokenizer (#987) 0598c3f unverified

Set ChatML as default chat prompt template (#985) 97f9784 unverified

Add support for cohere endpoints (#976) 1aafdd3 unverified

Use jinja template for chat formatting (#730) (#744) 3d931f5 unverified

Implement Cloudflare Workers AI endpoint (#907) (#972) ac2d8ff unverified

Google Vertex API support (#950) 48f1340 unverified

Count system prompt tokens (#850) eb2ef82 unverified

Anthropic Endpoint Support (#923) 1db019c unverified

Add gemma model to prod config (#854) d3928c1 unverified

refactor: new API & universal load functions (pt. 2) (#1847)

7d7a53f
unverified

Revert "refactor: new API & universal load functions (#1743)"

91e839b

refactor: new API & universal load functions (#1743)

35a395c
unverified

fix: only import node-llama-cpp if needed and skip for huggingchat image

d8e426c

New `InferenceClient` endpoint type (#1813)

e08e6dd
unverified

feat: allow storing env variable in DB (#1802)

31daf3d
unverified

feat: allow strings in `MODELS` env var (#1791)

6a449cd
unverified

feat: add a local endpoint type for inference directly from chat-ui (#1778)

4e9a7a9
unverified

fix: handle no beginToken for token based reasoning models (#1713)

6647bbf
unverified

chores(svelte): migration to svelte 5 (#1685)

21b8785
unverified

feat: improve tool calling & add tools to qwen 2.5 72b (#1615)

f7aef71
unverified

feat: UI for advanced reasoning models (#1605)

764ecdf
unverified

Fix/oai parameters 1552 (#1557)

52dfa8c
unverified

fix: only show playground button for models that are available

bad5446
unverified

feat: Add a model config option to disable system prompts (#1539)

d4c7ddb
unverified

fix: use custom tool template only for huggingchat

b36dfab
unverified

feat: use model id as fallback for finding a tokenizer (#1535)

2b554f9
unverified

feat(models): add `nvidia/Llama-3.1-Nemotron-70B-Instruct-HF` (#1527)

d852d1b
unverified

[vertex] Add PDF/plein texts support (#1520)

a5e332d
unverified

feat: add link to API playground for compatible models (#1488)

baa5d2f
unverified

feat(conv): let user switch models on conversations with deprecated models (#1462)

68972f0
unverified

feat(models): add transferTo property on old models (#1453)

a1ae528
unverified

fix(continue): fix continue feature (#1459)

4f61809
unverified

Add support for Anthropic models via AWS Bedrock (#1413)

b6274e8
unverified

feat(assistants): use community tools in assistants (#1421)

3bb3aef
unverified

chores(deps): update Transformers to latest version and remove chat templates for all models (#1414)

f3064dd
unverified

Community tools (#1250)

b8228c1
unverified

fix(logs): improve logging

71f3ffb
unverified

feat(models): Llama 3.1 support (#1355)

79b41de
unverified

Add Google Gemini API Support (#1330)

6f1638a
unverified

Refactor logger error messages for consistency. (#1257)

fa0afa9
unverified

Function calling (#996)

faa93d9
unverified

Generic Multimodal Support Fixed (#1147)

b4690c8
unverified

Revert "Generic Multimodal Support" (#1146)

2a40544
unverified

Generic Multimodal Support (#1021)

f493225
unverified

feat: add support for anthropic on vertex (#958)

76d1477
unverified

Move vars to dynamic, add metrics (#1085)

0e5ff83
unverified

Use pino for logs (#1086)

bb6f8cb
unverified

Kill app on tokenizer load fail (#1044)

4b62530
unverified

Add logs to tokenizer fetch (#1030)

ac3fd05
unverified

Add langserve endpoint (#1009)

a9e4746
unverified

Move default template so it doesn't override tokenizer (#987)

0598c3f
unverified

Set ChatML as default chat prompt template (#985)

97f9784
unverified

Add support for cohere endpoints (#976)

1aafdd3
unverified

Use jinja template for chat formatting (#730) (#744)

3d931f5
unverified

Implement Cloudflare Workers AI endpoint (#907) (#972)

ac2d8ff
unverified

Google Vertex API support (#950)

48f1340
unverified

Count system prompt tokens (#850)

eb2ef82
unverified

Anthropic Endpoint Support (#923)

1db019c
unverified

Add gemma model to prod config (#854)

d3928c1
unverified