PhysicsLM / README.md
AlexWortega's picture
Upload README.md with huggingface_hub
7ec0e06 verified
---
language: en
license: mit
tags:
- physics
- simulation
- language-model
- icml2026
---
# PhysicsLM
Anonymous submission for ICML 2026: **"PhysicsLM: Autoregressive Language Modeling of 2D Rigid Body Dynamics"**
PhysicsLM fine-tunes LFM2-350M (LiquidAI) via LoRA on 900K 2D rigid-body physics scenes,
learning to predict next simulation states as structured decimal text.
## Model details
- **Base model**: LiquidAI/LFM2-350M
- **Fine-tuning**: LoRA (r=32, alpha=64), 5-stage curriculum on PhysicsScenes
- **Task**: Next-frame physics prediction (autoregressive text generation)
- **Format**: structured decimal text encoding of 2D object states
## Results (seen scenarios)
| Category | PhysicsLM RMSE (px) | Copy-last RMSE | Linear extrap RMSE |
|----------|--------------------|-----------------|--------------------|
| Stacking | 2.60 | 6.72 | 0.06 |
| Constraint | 1.35 | 4.99 | 0.06 |
| Collision | 5.37 | 7.69 | 0.09 |
| Ramp | 18.85 | ... | 0.19 |
| Minigame | 36.14 | ... | 0.09 |
| Complex | 109.57 | ... | 0.04 |
OOD: near-distribution 0.94 px RMSE, novel OOD 24.79 px RMSE. Parse failure: 0.0%.
## Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
tok = AutoTokenizer.from_pretrained("anonsubmiticml2026/PhysicsLM")
model = AutoModelForCausalLM.from_pretrained("anonsubmiticml2026/PhysicsLM",
torch_dtype=torch.bfloat16,
device_map="cuda")
# See paper for text encoding format
```
## Dataset
Training data: [anonsubmiticml2026/PhysicsScenes](https://huggingface.co/datasets/anonsubmiticml2026/PhysicsScenes)
Code: [anonsubmiticml2026/physics-llm-paper](https://huggingface.co/anonsubmiticml2026/physics-llm-paper)