BeFM

Sleeping

App Files Files Community

BeFM

Commit History

Update default prompt and fix chat interface inputs

d298fc0

Jn-Huang commited on 26 days ago

update messages

77a4a60

Jn-Huang commited on 26 days ago

Customize BeFM UI and defaults

58ffc70

Jn-Huang commited on 26 days ago

Make code compatible with Gradio 6.0 structured content format

38dedc7

Jn-Huang commited on Dec 1, 2025

Fix Gradio ChatInterface: remove lambda wrapper, add lazy loading, make public

4cc1531

Jn-Huang commited on Dec 1, 2025

Switch to transformers version - vLLM uses too much memory on T4 GPU

fc3b3a2

Jn-Huang commited on Dec 1, 2025

Reduce vLLM GPU memory utilization to 0.7 to avoid OOM on T4 GPU

0600d50

Jn-Huang commited on Dec 1, 2025

Switch to vLLM for faster inference with lazy loading and multi-turn fix

89babab

Jn-Huang commited on Dec 1, 2025

Fix multi-turn conversation: handle dict history format

eaaeae1

Jn-Huang commited on Dec 1, 2025

Fix bugs: use token param, apply Llama 3.1 chat template, decode only new tokens

1a77428

Jn-Huang commited on Dec 1, 2025

Add Be.FM-8B chat interface with PEFT adapter

f6fde6f

Jn-Huang commited on Dec 1, 2025

initial commit

8e51924
verified

JinHuang1203 commited on Dec 1, 2025

Commit History

Update default prompt and fix chat interface inputs d298fc0

update messages 77a4a60

Customize BeFM UI and defaults 58ffc70

Make code compatible with Gradio 6.0 structured content format 38dedc7

Fix Gradio ChatInterface: remove lambda wrapper, add lazy loading, make public 4cc1531

Switch to transformers version - vLLM uses too much memory on T4 GPU fc3b3a2

Reduce vLLM GPU memory utilization to 0.7 to avoid OOM on T4 GPU 0600d50

Switch to vLLM for faster inference with lazy loading and multi-turn fix 89babab

Fix multi-turn conversation: handle dict history format eaaeae1

Fix bugs: use token param, apply Llama 3.1 chat template, decode only new tokens 1a77428

Add Be.FM-8B chat interface with PEFT adapter f6fde6f

initial commit 8e51924 verified

Update default prompt and fix chat interface inputs

d298fc0

update messages

77a4a60

Customize BeFM UI and defaults

58ffc70

Make code compatible with Gradio 6.0 structured content format

38dedc7

Fix Gradio ChatInterface: remove lambda wrapper, add lazy loading, make public

4cc1531

Switch to transformers version - vLLM uses too much memory on T4 GPU

fc3b3a2

Reduce vLLM GPU memory utilization to 0.7 to avoid OOM on T4 GPU

0600d50

Switch to vLLM for faster inference with lazy loading and multi-turn fix

89babab

Fix multi-turn conversation: handle dict history format

eaaeae1

Fix bugs: use token param, apply Llama 3.1 chat template, decode only new tokens

1a77428

Add Be.FM-8B chat interface with PEFT adapter

f6fde6f

initial commit

8e51924
verified