Spaces:

MerlinSafety
/

README

Running

App Files Files Community

squ11z1 commited on 5 days ago

Commit

93458f7

verified ·

1 Parent(s): 30494aa

Update README.md

Browse files

Files changed (1) hide show

README.md +62 -6

README.md CHANGED Viewed

@@ -1,10 +1,66 @@
 ---
-title: README
-emoji: 👀
-colorFrom: pink
-colorTo: red
 sdk: static
-pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 ---
+title: MerlinResearch
+emoji: 🛡️
+colorFrom: purple
+colorTo: purple
 sdk: static
+pinned: true
+license: apache-2.0
+short_description: AI safety, reasoning, and alignment research lab.
 ---
+# MerlinSafety
+**MerlinSafety** is an independent AI safety and reasoning research organization focused on building practical, auditable, and robust open models.
+## Mission
+We develop and evaluate models that are:
+- Strong in constrained instruction-following
+- Safer in real-world agentic workflows
+- Better aligned under uncertainty and adversarial prompts
+- Transparent in behavior, limits, and deployment risks
+## What We Build
+- Safety-oriented reasoning models
+- Alignment-focused post-training pipelines
+- Evaluation suites for robustness, controllability, and failure analysis
+- Open artifacts for reproducible research
+## Current Focus Areas
+- Safety reasoning for small/efficient LLMs
+- Misalignment reduction via structured post-training
+- Hallucination risk reduction in high-stakes contexts
+- Robust instruction adherence with explicit constraints
+## Research Principles
+1. **Measure behavior, not marketing claims.**
+2. **Prioritize reproducibility and clear documentation.**
+3. **Publish limitations, not only strengths.**
+4. **Design for safe deployment from day one.**
+## Models
+Our flagship releases are published under this organization with:
+- Full model cards
+- Clear training/deployment notes
+- Practical usage guidance
+## Collaboration
+We welcome collaboration on:
+- AI safety evaluation
+- Alignment methods
+- Reasoning benchmarks
+- Responsible open model deployment
+For partnerships or research collaboration, contact us via Hugging Face discussions or linked channels in our repositories.
+---
+**MerlinSafety**
+Safe reasoning. Measurable alignment. Real-world robustness.