Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

COMPASS research group at ELLIS Institute Tübingen

non-profit
http://s-abdelnabi.github.io
compass-group-tue
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

kaethy  new activity about 2 hours ago
compass-group-tue/sdf_evaluation_traits:Improve dataset card: add metadata and links to paper/GitHub/project page
haritzpuerto  authored a paper about 5 hours ago
Models That Know How Evaluations Are Designed Score Safer
haritzpuerto  submitted a paper about 7 hours ago
Models That Know How Evaluations Are Designed Score Safer
View all activity

Papers

Models That Know How Evaluations Are Designed Score Safer

View all Papers

Sahar Abdelnabi's profile pictureHaritz Puerto's profile pictureKatharina Deckenbach's profile picture

compass-group-tue 's collections 1

🕵️🛡️ Evaluation Meta Knowledge
2026 arXiv preprint. Models fine-tuned on documents describing typical evaluation traits show safer behavior by having increased refusal rates and low
  • Models That Know How Evaluations Are Designed Score Safer

    Paper • 2605.28591 • Published 2 days ago • 4
  • compass-group-tue/sdf_evaluation_traits

    Updated about 2 hours ago • 31 • 1
🕵️🛡️ Evaluation Meta Knowledge
2026 arXiv preprint. Models fine-tuned on documents describing typical evaluation traits show safer behavior by having increased refusal rates and low
  • Models That Know How Evaluations Are Designed Score Safer

    Paper • 2605.28591 • Published 2 days ago • 4
  • compass-group-tue/sdf_evaluation_traits

    Updated about 2 hours ago • 31 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs