Marco310 commited on
Commit
e127d7d
·
1 Parent(s): 684c8a3

# ⚡ Model Registry Optimization (Fast Mode)

Browse files

## Changes
- **Removed Llama 3.3 70B**: Deprecated due to stability issues in structured output (Ref: agno-agi/agno#4090).
- **Added Qwen 2.5 32B (`qwen-2.5-32b`)**: New default for Fast Mode. Chosen for its superior performance in JSON generation and logic reasoning at lower latency.
- **Added GPT-OSS 20B (`openai/gpt-oss-20b`)**: Lightweight alternative for ultra-fast data retrieval tasks.
-- update README.md

## Impact
- Improves "Fast Mode" stability for Tool Calling (Scout/Navigator).
- Reduces latency for intermediate reasoning steps.

Files changed (1) hide show
  1. app.py +1 -1
app.py CHANGED
@@ -584,7 +584,7 @@ class LifeFlowAI:
584
  def main():
585
  app = LifeFlowAI()
586
  demo = app.build_interface()
587
- demo.launch(server_name="0.0.0.0", server_port=8080, share=True, show_error=True)
588
  #7860
589
  if __name__ == "__main__":
590
  main()
 
584
  def main():
585
  app = LifeFlowAI()
586
  demo = app.build_interface()
587
+ demo.launch(server_name="0.0.0.0", server_port=7860, share=True, show_error=True)
588
  #7860
589
  if __name__ == "__main__":
590
  main()