--- license: llama3 base_model: meta-llama/Llama-3.1-8B-Instruct tags: - biology - protein - molecule - dna - rna - multimodal - structure-grounded --- # Cuttlefish **Cuttlefish** is a unified all-atom multimodal LLM that grounds language reasoning in geometric cues while scaling structural tokens with structural complexity. Built on Llama-3.1-8B-Instruct, it extends the base LLM with a graph encoder and a Scaling-Aware Patching connector for processing proteins, molecules, DNA, and RNA structures. ## Quick start ```python from huggingface_hub import snapshot_download # Download model local_dir = snapshot_download("zihaojing/Cuttlefish") # Run inference (requires cuttlefish codebase) # python src/runner/inference.py --config configs/inference/octopus_8B_s3_v1_5.yaml ``` ## Input format Cuttlefish accepts a unified parquet schema with structural graph columns: | Field | Description | |---|---| | `modality` | `"molecule"`, `"protein"`, `"dna"`, or `"rna"` | | `node_feat` | Atom/node features (N × d) | | `pos` | 3D coordinates in Å (N × 3) | | `edge_index` | Spatial graph edges in COO (2 × E) | | `messages` | Chat-style instruction with `` token | The `` placeholder in the user message is replaced by the encoded structural tokens at inference time. ## Training details - **Base model**: Llama-3.1-8B-Instruct - **Encoder**: [Cuttlefish-Encoder](https://huggingface.co/zihaojing/Cuttlefish-Encoder) (pretrained on all-atom graph data) - **SFT data**: [Cuttlefish-SFT-Data](https://huggingface.co/datasets/zihaojing/Cuttlefish-SFT-Data) - **Training stages**: 2-stage SFT — connector training then full LLM fine-tuning with LoRA ## Related resources | Resource | Link | |---|---| | Cuttlefish-Encoder | [zihaojing/Cuttlefish-Encoder](https://huggingface.co/zihaojing/Cuttlefish-Encoder) | | SFT instruction data | [zihaojing/Cuttlefish-SFT-Data](https://huggingface.co/datasets/zihaojing/Cuttlefish-SFT-Data) | | Encoder pretraining data | [zihaojing/Cuttlefish-Encoder-Data](https://huggingface.co/datasets/zihaojing/Cuttlefish-Encoder-Data) |