--- license: mit library_name: mlx pipeline_tag: text-generation tags: - transformers - mlx base_model: meituan-longcat/LongCat-Flash-Chat --- # mlx-community/LongCat-Flash-Chat-4bit This model [mlx-community/LongCat-Flash-Chat-4bit](https://huggingface.co/mlx-community/LongCat-Flash-Chat-4bit) was converted to MLX format from [meituan-longcat/LongCat-Flash-Chat](https://huggingface.co/meituan-longcat/LongCat-Flash-Chat) using mlx-lm version **0.27.0**. ## Use with mlx ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tokenizer = load("mlx-community/LongCat-Flash-Chat-4bit") prompt = "hello" if tokenizer.chat_template is not None: messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) response = generate(model, tokenizer, prompt=prompt, verbose=True) ```