How to use TRI-ML/DCLM-1B with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TRI-ML/DCLM-1B", dtype="auto")
Hello, approximately how many billion tokens were trained before MMLU exceeded 30 points?
Β· Sign up or log in to comment