Text Generation
Transformers
Safetensors
stripedhyena
long context
deep signal processing
hybrid
biology
genomics
custom_code
Instructions to use togethercomputer/evo-1-131k-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use togethercomputer/evo-1-131k-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="togethercomputer/evo-1-131k-base", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("togethercomputer/evo-1-131k-base", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use togethercomputer/evo-1-131k-base with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "togethercomputer/evo-1-131k-base" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "togethercomputer/evo-1-131k-base", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/togethercomputer/evo-1-131k-base
- SGLang
How to use togethercomputer/evo-1-131k-base with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "togethercomputer/evo-1-131k-base" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "togethercomputer/evo-1-131k-base", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "togethercomputer/evo-1-131k-base" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "togethercomputer/evo-1-131k-base", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use togethercomputer/evo-1-131k-base with Docker Model Runner:
docker model run hf.co/togethercomputer/evo-1-131k-base
Commit History
Update config.json d1ee1bd verified
Update model.py e7b43af verified
Update model.py b07b34f verified
Update model.py e84e201 verified
Update config.json 0dba1f5 verified
Update config.json ad9eb94 verified
Update config.json 2a542d3 verified
Update model.py 3b191f9 verified
Update model.py d726368 verified
Update model.py ea9486a verified
Update config.json 281dab8 verified
Update config.json a8a1f74 verified
Update config.json a62b486 verified
Update model.py cd7d86b verified
Update model.py 5bbaeb8 verified
Update model.py 3668f68 verified
Update model.py 870e350 verified
Update config.json bb6b053 verified
Update modeling_hyena.py b87e4df verified
Update config.json 1d23434 verified
Update modeling_hyena.py 28ff2e4 verified
Update config.json c63a55a verified
Update model.py 2ec9f03 verified
Update modeling_hyena.py d849f5b verified
Update model.py 25b693b verified
Support gradient checkpointing e87428b verified
Update README.md 567369e verified
Update README.md 1ec26ca verified
fix: fa key in config update 9562f3f verified
Update README.md 8eb9480 verified
Update README.md 36216cf verified
Update batch encode plus first argument to match HF convention. 28fe393 verified
Update model.py 60eb4c7 verified
Update README.md 4a59285 verified
add pt ckpt c9e2eda
Zymrael commited on
Update README.md 3fa0ba0 verified
Fix auto tokenizer import reference format in auto map as list for slow and fast. a9a132f
Pragaash commited on