File size: 2,289 Bytes
b6e1d00
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
license: apache-2.0
base_model:
- allenai/MolmoWeb-4B
language:
- en
pipeline_tag: image-text-to-text
library_name: transformers
tags:
- multimodal
- molmo
- molmo2
- web-agent
- full-precision
- vllm-compatible
---

# ravilution/MolmoWeb-4B

This is a **full-precision** Hugging Face– and **vLLM-compatible** release of [`allenai/MolmoWeb-4B`](https://huggingface.co/allenai/MolmoWeb-4B), a vision-based web agent model by Ai2 capable of navigating and interacting with web browsers.

It follows the same idea as [`ravilution/MolmoWeb-8B-8bit-mlx`](https://huggingface.co/ravilution/MolmoWeb-8B-8bit-mlx): a personal Hub copy with a clear description and practical loading notes—here for the **4B** dense checkpoint rather than an MLX quantization.

> **Note:** This is a **4B** parameter model (four `safetensors` shards). A few **post-download patches** were applied locally so tokenization and generation metadata match what downstream stacks (including vLLM) expect: `eos_token_id` / `bos_token_id` / `pad_token_id`, `transformers_version` in `config.json` and `generation_config.json`, and the tokenizer pretokenizer regex (Mistral-style `(?i:...)` fix). Patches are **idempotent** if you re-run them on a fresh download.

Refer to the [original model card](https://huggingface.co/allenai/MolmoWeb-4B) for benchmarks, architecture, training data, and intended use.

## Use with Transformers

```bash
pip install -U transformers accelerate torch pillow
```

```python
from transformers import AutoProcessor, AutoModelForImageTextToText
import torch

model_id = "ravilution/MolmoWeb-4B"

model = AutoModelForImageTextToText.from_pretrained(
    model_id,
    trust_remote_code=True,
    torch_dtype=torch.float32,
    attn_implementation="sdpa",
    device_map="auto",
)
processor = AutoProcessor.from_pretrained(
    model_id,
    trust_remote_code=True,
    padding_side="left",
)
```

## Provenance

- **Upstream weights:** `allenai/MolmoWeb-4B`
- **Changes on top:** compatibility patches only (config / generation_config / tokenizer metadata as above); **no** retraining or architectural edits.

## License

Apache 2.0 — see the [original model](https://huggingface.co/allenai/MolmoWeb-4B) for details. Please review Ai2’s Responsible Use Guidelines for intended use.