Update README.md

Browse files

Files changed (1) hide show

README.md +57 -17

README.md CHANGED Viewed

@@ -1,23 +1,63 @@
 ---
-library_name: mlx
-license: other
-license_name: lfm1.0
-license_link: LICENSE
 language:
 - en
-- ar
-- zh
-- fr
-- de
-- ja
-- ko
-- es
-pipeline_tag: text-generation
 tags:
-- liquid
-- lfm2
-- edge
-- moe
 - mlx
-base_model: LiquidAI/LFM2-8B-A1B
 ---

 ---
+model-index:
+- name: LFM2-8B-A1B — MLX (Apple Silicon), **8-bit**
+  results: []
+license: apache-2.0
 language:
 - en
 tags:
 - mlx
+- apple-silicon
+- text-generation
+- 8bit
+- quantized
+- 8b
+- MoE
+- Mixture of Experts
+pipeline_tag: text-generation
+library_name: mlx
+---
+# LFM2-8B-A1B — **MLX 8-bit** (Apple Silicon)
+**Maintainer / Publisher:** [**Susant Achary**](https://huggingface.co/Susant-Achary)
+This repository provides an **Apple-Silicon-optimized MLX build** of **LFM2-8B-A1B** with **8-bit** weight quantization.
+The goal is a **drop-in, on-device** experience on M-series Macs with **maximal fidelity** among quantized variants while keeping load times small and setup simple.
+> Source model: `mlx-community/LFM2-8B-A1B-8bit-MLX` (Apache-2.0).
+> Format: **MLX** (Metal/MPS), ready for `mlx_lm.generate`.
+---
+## 🔎 Model at a glance
+- **Type:** 8B-parameter decoder-only language model (dense Transformer family).
+- **This build:** **8-bit** quantized **MLX** weights for fast, Apple-native inference.
+- **Typical uses:** instruction following, summarization, drafting, QA, basic code/text utilities.
+> If you need a smaller RAM footprint on older/lower-RAM Macs, consider lower-bit MLX builds (4/5/6-bit). If you want the **closest behavior to FP16** while staying in MLX, **8-bit** is the right choice.
+---
+## 📦 Files in this repo
+- `config.json` (MLX config)
+- `mlx_model*.safetensors` (**8-bit** sharded weights)
+- `tokenizer.json`, `tokenizer_config.json`
+- `model_index.json` and basic metadata
+All assets are arranged for **direct loading** via `mlx_lm`.
 ---
+## 🚀 Quickstart (CLI — MLX)
+**Deterministic generation**
+```bash
+python -m mlx_lm.generate \
+  --model mlx-community/LFM2-8B-A1B-8bit-MLX \
+  --prompt "Summarize the following notes into 5 bullet points:\n<your text>" \
+  --max-tokens 256 \
+  --temperature 0.0 \
+  --device mps \
+  --seed 0