How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="yasserrmd/LFM2-700M-gguf",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

LFM2‑350M β€’ Quantized Version (GGUF)

Quantized GGUF version of the LiquidAI/LFM2-700M model.

  • βœ… Format: GGUF
  • βœ… Use with: liquid_llama.cpp
  • βœ… Supported precisions: Q4_0, Q4_K, etc.

Download

wget https://huggingface.co/yasserrmd/LFM2-700M-gguf/resolve/main/lfm2-700m.Q4_K.gguf

(Adjust filename for other quant formats like Q4_0, if available.)

Notes

  • Only compatible with liquid_llama.cpp (not llama.cpp).
  • Replace Q4_K with your chosen quant version.
Downloads last month
30
GGUF
Model size
0.7B params
Architecture
lfm2
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for yasserrmd/LFM2-700M-gguf

Quantized
(21)
this model