Instructions to use EnjiXiong/AIAA4051-FinalProject-PPNL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use EnjiXiong/AIAA4051-FinalProject-PPNL with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="EnjiXiong/AIAA4051-FinalProject-PPNL")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("EnjiXiong/AIAA4051-FinalProject-PPNL", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use EnjiXiong/AIAA4051-FinalProject-PPNL with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "EnjiXiong/AIAA4051-FinalProject-PPNL" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "EnjiXiong/AIAA4051-FinalProject-PPNL", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/EnjiXiong/AIAA4051-FinalProject-PPNL
- SGLang
How to use EnjiXiong/AIAA4051-FinalProject-PPNL with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "EnjiXiong/AIAA4051-FinalProject-PPNL" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "EnjiXiong/AIAA4051-FinalProject-PPNL", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "EnjiXiong/AIAA4051-FinalProject-PPNL" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "EnjiXiong/AIAA4051-FinalProject-PPNL", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use EnjiXiong/AIAA4051-FinalProject-PPNL with Docker Model Runner:
docker model run hf.co/EnjiXiong/AIAA4051-FinalProject-PPNL
AIAA 4051 Final Project β PPNL Grid Path Planning (full project archive)
This Hugging Face repository is the full project tree for an AIAA 4051 final project on prompting and fine-tuning seq2seq language models for grid path planning, evaluated on the PPNL benchmark (Aghzal et al., ICLR 2024 Workshop).
The lighter-weight code repository lives on GitHub: π https://github.com/EnjiXiong/AIAA4051-FinalProject-PPNL
This HF mirror exists to host artifacts that are too large for GitHub β primarily 13 fine-tuned T5/BART checkpoints (~13 GB) and the PPNL upstream reference code.
What's in here
P3/
βββ grid-path-planning/ β the project itself (code + data + results)
β βββ *.py, *.sh β all source β same as the GitHub repo
β βββ data/ β PPNL benchmark JSONs + custom OOD set
β βββ evaluate/ β upstream PPNL executor scripts (kept for traceability)
β βββ models/<run>/best/ β β the fine-tuned checkpoints (HF-only)
β βββ results/ β per-config metric tables and per-sample predictions
β βββ visualizations/ β case-study plots used in the report
β βββ README.md β project documentation, results table, file walkthrough
β βββ CLAUDE.md β orientation for Claude Code agents
β βββ requirements.txt
βββ llms-as-path-planners/ β upstream PPNL reference code (Aghzal et al., for traceability)
Loading a checkpoint
pip install huggingface_hub
hf download EnjiXiong/AIAA4051-FinalProject-PPNL \
--include "grid-path-planning/models/sft_multiscale_40ep/**/best/**" \
--local-dir .
Then from the GitHub repo's working directory:
python tree_search_eval.py \
--model_dir grid-path-planning/models/sft_multiscale_40ep/t5-base_vanilla_ep40_lr0.0003/best \
--input_format vanilla --beam_width 4
This reproduces the headline result β 100% success across all five canonical PPNL test sets and 94.3% on the custom 1500-sample novel-grid-size set (4Γ4β10Γ10) β by combining a multi-scale-trained T5-base with executor-guarded beam search at inference. See the GitHub README for the full results table and file-by-file documentation.
Available checkpoints
All under grid-path-planning/models/<run>/.../best/:
| Run | Description |
|---|---|
t5-small_vanilla_ep20_lr0.0003 |
T5-small SFT, 6Γ6 only |
t5-base_vanilla_ep15_lr0.0003 |
T5-base SFT, 6Γ6 only |
t5-base_structured_ep15_lr0.0003 |
T5-base SFT, structured input format |
t5-base_cot_ep15_lr0.0003 |
T5-base SFT, CoT (coordinate-tracking) target |
bart-base_vanilla_ep15_lr0.0003, bart-base_vanilla_ep30_lr5e-05 |
BART-base SFT |
sft2k_vanilla |
T5-base SFT warm-start on 2k-sample subset |
sft2k |
T5-base SFT warm-start, structured |
sft_multiscale_warmstart |
Multi-scale (5Γ5β7Γ7) warm-start, 5 ep |
sft_multiscale |
Multi-scale, 15 ep |
sft_multiscale_40ep β |
Best SFT: multi-scale 5Γ5β7Γ7, 40 ep β used with tree search for headline number |
rl/grpo_t5-base_* |
GRPO RL on top of vanilla and structured SFTs |
final/ (last-epoch) snapshots are not mirrored β only the best-validation
checkpoint of each run is uploaded. The repo README.md and CLAUDE.md inside
grid-path-planning/ document everything.
Citation
Underlying benchmark:
@inproceedings{aghzal2024can,
title={Can Large Language Models be Good Path Planners? A Benchmark and
Investigation on Spatial-temporal Reasoning},
author={Aghzal, Mohamed and Plaku, Erion and Yao, Ziyu},
booktitle={ICLR 2024 Workshop on LLM Agents},
year={2024}
}
Upstream PPNL code: https://github.com/MohamedAghzal/llms-as-path-planners