Running 230 FineVision: Open Data is All You Need π 230 A new open-source dataset for training VLMs