Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

COMPASS research group at ELLIS Institute Tübingen

non-profit
http://s-abdelnabi.github.io
compass-group-tue
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

kaethy  new activity about 1 hour ago
compass-group-tue/sdf_evaluation_traits:Improve dataset card: add metadata and links to paper/GitHub/project page
haritzpuerto  authored a paper about 4 hours ago
Models That Know How Evaluations Are Designed Score Safer
haritzpuerto  submitted a paper about 7 hours ago
Models That Know How Evaluations Are Designed Score Safer
View all activity

Papers

Models That Know How Evaluations Are Designed Score Safer

View all Papers

Sahar Abdelnabi's profile pictureHaritz Puerto's profile pictureKatharina Deckenbach's profile picture

Collections 1

🕵️🛡️ Evaluation Meta Knowledge
2026 arXiv preprint. Models fine-tuned on documents describing typical evaluation traits show safer behavior by having increased refusal rates and low
  • Models That Know How Evaluations Are Designed Score Safer

    Paper • 2605.28591 • Published 2 days ago • 4
  • compass-group-tue/sdf_evaluation_traits

    Updated about 1 hour ago • 31 • 1
🕵️🛡️ Evaluation Meta Knowledge
2026 arXiv preprint. Models fine-tuned on documents describing typical evaluation traits show safer behavior by having increased refusal rates and low
  • Models That Know How Evaluations Are Designed Score Safer

    Paper • 2605.28591 • Published 2 days ago • 4
  • compass-group-tue/sdf_evaluation_traits

    Updated about 1 hour ago • 31 • 1

models 0

None public yet

datasets 1

compass-group-tue/sdf_evaluation_traits

Updated about 1 hour ago • 31 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs