Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Spaces:
williyam
/
agentic-rag-gym
Sleeping

App Files Files Community
3
Fetching metadata from the HF Docker repository...
agentic-rag-gym / tests
52 kB
Ctrl+K
Ctrl+K
  • 4 contributors
History: 9 commits
williyam's picture
williyam
fix: tighten reward/grade bounds to [0.01, 0.99] in models, orchestrator, and tests
9955cc3 about 1 month ago
  • __init__.py
    41 Bytes
    feat: complete Agentic RAG Gym implementation about 1 month ago
  • test_config.py
    3.13 kB
    feat: complete Agentic RAG Gym implementation about 1 month ago
  • test_domain.py
    3.33 kB
    feat: complete Agentic RAG Gym implementation about 1 month ago
  • test_grader_exploits.py
    3.79 kB
    test: add exploit tests, integration test, and CI workflow about 1 month ago
  • test_graders.py
    5.8 kB
    fix(server): CORS, sanitization, concurrency, parameter override, mount at /ui about 1 month ago
  • test_integration.py
    6.11 kB
    fix: tighten reward/grade bounds to [0.01, 0.99] in models, orchestrator, and tests about 1 month ago
  • test_models.py
    5.41 kB
    fix: remove dead code (EXECUTOR, MySQL, bleach), fix PlannerAgent receiver about 1 month ago
  • test_orchestrator.py
    9.57 kB
    fix: tighten reward/grade bounds to [0.01, 0.99] in models, orchestrator, and tests about 1 month ago
  • test_rewards.py
    7.05 kB
    feat: complete Agentic RAG Gym implementation about 1 month ago
  • test_server.py
    7.77 kB
    fix: clamp scores to [0.01, 0.99] in API boundary, UI, inference, and tests about 1 month ago