Update README: change base model to Llama-3.1-70B and improve formatting 7f5a09a verified pentagoniac commited on Sep 5, 2025
Update README: set max_model_len to 8192 for optimal performance c99e12f verified pentagoniac commited on Sep 2, 2025
Update README with 120k context and 2000 token examples c964c9e verified pentagoniac commited on Sep 2, 2025
Update README: correct model name, 128k context, working vLLM example 8de19cf verified pentagoniac commited on Sep 2, 2025