llmware
/

phi-4-mini-gguf

Model card Files Files and versions

phi-4-mini-gguf / README.md

doberst's picture

Update README.md

206abe3 verified 7 months ago

|

history blame contribute delete

973 Bytes

	---
	license: apache-2.0
	inference: false
	base_model: microsoft/phi-4-mini-instruct
	base_model_relation: quantized
	tags: [green, llmware-chat, p4, gguf,emerald]
	---

	# phi-4-mini-instruct-gguf

	phi-4-mini-instruct-gguf is a GGUF Q4_K_M int4 quantized version of [phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct), providing a very fast inference implementation, optimized for AI PCs.

	This model will run on an AI PC with at least 16 GB of memory.

	### Model Description

	- Developed by: Microsoft
	- Model type: phi-4-mini
	- Parameters: 4 billion
	- Model Parent: Microsoft/Phi-4-Mini-Instruct
	- Language(s) (NLP): English
	- License: Apache 2.0
	- Uses: Chat, general-purpose LLM
	- Quantization: int4


	## Model Card Contact

	[llmware on github](https://www.github.com/llmware-ai/llmware)

	[llmware on hf](https://www.huggingface.co/llmware)

	[llmware website](https://www.llmware.ai)