PyTorch
Transformers
English
confidence-cartography
interpretability
causal-lm
confidence-calibration
mandela-effect
false-belief-detection
teacher-forcing
rho-eval
alignment
rho-guided-sft
contrastive-loss
calibration-repair
behavioral-audit
steering-vectors
mechanistic-interpretability
fidelity-bench
pythia
llama
mistral
qwen
gpt2
Eval Results (legacy)
Instructions to use bsanch52/confidence-cartography with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use bsanch52/confidence-cartography with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("bsanch52/confidence-cartography", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Ctrl+K