MolmoWeb-4B / README.md
ravilution's picture
Initial upload: MolmoWeb-4B with HF/vLLM compatibility patches
b6e1d00 verified
---
license: apache-2.0
base_model:
- allenai/MolmoWeb-4B
language:
- en
pipeline_tag: image-text-to-text
library_name: transformers
tags:
- multimodal
- molmo
- molmo2
- web-agent
- full-precision
- vllm-compatible
---
# ravilution/MolmoWeb-4B
This is a **full-precision** Hugging Face– and **vLLM-compatible** release of [`allenai/MolmoWeb-4B`](https://huggingface.co/allenai/MolmoWeb-4B), a vision-based web agent model by Ai2 capable of navigating and interacting with web browsers.
It follows the same idea as [`ravilution/MolmoWeb-8B-8bit-mlx`](https://huggingface.co/ravilution/MolmoWeb-8B-8bit-mlx): a personal Hub copy with a clear description and practical loading notes—here for the **4B** dense checkpoint rather than an MLX quantization.
> **Note:** This is a **4B** parameter model (four `safetensors` shards). A few **post-download patches** were applied locally so tokenization and generation metadata match what downstream stacks (including vLLM) expect: `eos_token_id` / `bos_token_id` / `pad_token_id`, `transformers_version` in `config.json` and `generation_config.json`, and the tokenizer pretokenizer regex (Mistral-style `(?i:...)` fix). Patches are **idempotent** if you re-run them on a fresh download.
Refer to the [original model card](https://huggingface.co/allenai/MolmoWeb-4B) for benchmarks, architecture, training data, and intended use.
## Use with Transformers
```bash
pip install -U transformers accelerate torch pillow
```
```python
from transformers import AutoProcessor, AutoModelForImageTextToText
import torch
model_id = "ravilution/MolmoWeb-4B"
model = AutoModelForImageTextToText.from_pretrained(
model_id,
trust_remote_code=True,
torch_dtype=torch.float32,
attn_implementation="sdpa",
device_map="auto",
)
processor = AutoProcessor.from_pretrained(
model_id,
trust_remote_code=True,
padding_side="left",
)
```
## Provenance
- **Upstream weights:** `allenai/MolmoWeb-4B`
- **Changes on top:** compatibility patches only (config / generation_config / tokenizer metadata as above); **no** retraining or architectural edits.
## License
Apache 2.0 — see the [original model](https://huggingface.co/allenai/MolmoWeb-4B) for details. Please review Ai2’s Responsible Use Guidelines for intended use.