Spaces:

teofizzy
/

mshauri-fedha

Sleeping

App Files Files Community

mshauri-fedha / src /load

Commit History

Revert "feat: Implement a parallel voting ensemble for LLM selection based on Jaccard similarity, replacing the sequential fallback mechanism."

868d1b6

teofizzy commited on Feb 20

feat: Implement a parallel voting ensemble for LLM selection based on Jaccard similarity, replacing the sequential fallback mechanism.

f81b0fc

teofizzy commited on Feb 20

feat: Enhance agent robustness by separating base and fallback LLMs, update Ollama model to 7B, and improve Docker startup reliability.

de956af

teofizzy commited on Feb 20

feat: Enhance LLM resilience with a multi-provider fallback system, upgrade the default Ollama model to 7B, and improve vector store relevance score calculation.

5f850dc

teofizzy commited on Feb 20

feat: Add stop parameter to chat completion requests and simplify the comment regarding `<think>` tag stripping.

48535aa

teofizzy commited on Feb 20

Remove `Qwen/Qwen1.5-110B-Chat` from the list of candidate models.

f8e63eb

teofizzy commited on Feb 20

feat: Augment agent prompts with uploaded file context, clarify data storage consent, and add Qwen/Qwen1.5-110B-Chat to candidate models.

f0c1010

teofizzy commited on Feb 20

Removed artifacts

c0adb25

teofizzy commited on Feb 18

enhanced pompt

79c6f19

teofizzy commited on Feb 11

stripped artifacts

50324c2

teofizzy commited on Feb 11

stripped artifacts

c71fc23

teofizzy commited on Feb 11

stripped of certain artifacts

a09f4a1

teofizzy commited on Feb 11

added swahili language

0830be5

teofizzy commited on Feb 11

controlling scope

d845605

teofizzy commited on Feb 3

citations logic

53ed90b

teofizzy commited on Feb 3

scope guardrails + advice protocol

60cead1

teofizzy commited on Feb 3

health economics

4280afb

teofizzy commited on Feb 3

introduced scope guardrails and confidence scoring (0-100)

706e9aa

teofizzy commited on Feb 3

increased token limit + catch <think> tags

f139ff4

teofizzy commited on Feb 3

uses langchain core

e0e0aba

teofizzy commited on Feb 3

Switched to use Huggingface CHAT

46d2574

teofizzy commited on Feb 3

trying new models that use could possibly accept the HuggingFaceEndpoint API inference

6de20f5

teofizzy commited on Feb 3

added more fallback model options

113bfd1

teofizzy commited on Feb 3

added list of candidate models

2e3e101

teofizzy commited on Feb 2

switched to qwen 32B - Instruct due to inference provider support

81d32dd

teofizzy commited on Feb 2

changed to use huggingface serverless endpoint with local CPU as a fallback

f8266e7

teofizzy commited on Feb 2

added observation logging

c60e74e

teofizzy commited on Jan 21

changed default ollama network port to match huggingface default

bba20c4

teofizzy commited on Jan 21

deployment 1.0

b860cd0

teofizzy commited on Jan 21

prototype stage

7011b92

teofizzy commited on Dec 24, 2025

Commit History

Revert "feat: Implement a parallel voting ensemble for LLM selection based on Jaccard similarity, replacing the sequential fallback mechanism." 868d1b6

feat: Implement a parallel voting ensemble for LLM selection based on Jaccard similarity, replacing the sequential fallback mechanism. f81b0fc

feat: Enhance agent robustness by separating base and fallback LLMs, update Ollama model to 7B, and improve Docker startup reliability. de956af

feat: Enhance LLM resilience with a multi-provider fallback system, upgrade the default Ollama model to 7B, and improve vector store relevance score calculation. 5f850dc

feat: Add stop parameter to chat completion requests and simplify the comment regarding `<think>` tag stripping. 48535aa

Remove `Qwen/Qwen1.5-110B-Chat` from the list of candidate models. f8e63eb

feat: Augment agent prompts with uploaded file context, clarify data storage consent, and add Qwen/Qwen1.5-110B-Chat to candidate models. f0c1010

Removed artifacts c0adb25

enhanced pompt 79c6f19

stripped artifacts 50324c2

stripped artifacts c71fc23

stripped of certain artifacts a09f4a1

added swahili language 0830be5

controlling scope d845605

citations logic 53ed90b

scope guardrails + advice protocol 60cead1

health economics 4280afb

introduced scope guardrails and confidence scoring (0-100) 706e9aa

increased token limit + catch <think> tags f139ff4

uses langchain core e0e0aba

Switched to use Huggingface CHAT 46d2574

trying new models that use could possibly accept the HuggingFaceEndpoint API inference 6de20f5

added more fallback model options 113bfd1

added list of candidate models 2e3e101

switched to qwen 32B - Instruct due to inference provider support 81d32dd

changed to use huggingface serverless endpoint with local CPU as a fallback f8266e7

added observation logging c60e74e

changed default ollama network port to match huggingface default bba20c4

deployment 1.0 b860cd0

prototype stage 7011b92

Revert "feat: Implement a parallel voting ensemble for LLM selection based on Jaccard similarity, replacing the sequential fallback mechanism."

868d1b6

feat: Implement a parallel voting ensemble for LLM selection based on Jaccard similarity, replacing the sequential fallback mechanism.

f81b0fc

feat: Enhance agent robustness by separating base and fallback LLMs, update Ollama model to 7B, and improve Docker startup reliability.

de956af

feat: Enhance LLM resilience with a multi-provider fallback system, upgrade the default Ollama model to 7B, and improve vector store relevance score calculation.

5f850dc

feat: Add stop parameter to chat completion requests and simplify the comment regarding `<think>` tag stripping.

48535aa

Remove `Qwen/Qwen1.5-110B-Chat` from the list of candidate models.

f8e63eb

feat: Augment agent prompts with uploaded file context, clarify data storage consent, and add Qwen/Qwen1.5-110B-Chat to candidate models.

f0c1010

Removed artifacts

c0adb25

enhanced pompt

79c6f19

stripped artifacts

50324c2

stripped artifacts

c71fc23

stripped of certain artifacts

a09f4a1

added swahili language

0830be5

controlling scope

d845605

citations logic

53ed90b

scope guardrails + advice protocol

60cead1

health economics

4280afb

introduced scope guardrails and confidence scoring (0-100)

706e9aa

increased token limit + catch <think> tags

f139ff4

uses langchain core

e0e0aba

Switched to use Huggingface CHAT

46d2574

trying new models that use could possibly accept the HuggingFaceEndpoint API inference

6de20f5

added more fallback model options

113bfd1

added list of candidate models

2e3e101

switched to qwen 32B - Instruct due to inference provider support

81d32dd

changed to use huggingface serverless endpoint with local CPU as a fallback

f8266e7

added observation logging

c60e74e

changed default ollama network port to match huggingface default

bba20c4

deployment 1.0

b860cd0

prototype stage

7011b92