akseljoonas commited on
Commit
d1d8c1f
·
1 Parent(s): 64bf289

fix: reduce tool selection ambiguity in system prompt

Browse files

Replace rigid tool sequences with general principles. Block
redundant MCP paper_search (hf_papers is a superset).

agent/core/tools.py CHANGED
@@ -62,7 +62,7 @@ warnings.filterwarnings(
62
  "ignore", category=DeprecationWarning, module="aiohttp.connector"
63
  )
64
 
65
- NOT_ALLOWED_TOOL_NAMES = ["hf_jobs", "hf_doc_search", "hf_doc_fetch", "hf_whoami"]
66
 
67
 
68
  def convert_mcp_content_to_string(content: list) -> str:
 
62
  "ignore", category=DeprecationWarning, module="aiohttp.connector"
63
  )
64
 
65
+ NOT_ALLOWED_TOOL_NAMES = ["hf_jobs", "hf_doc_search", "hf_doc_fetch", "hf_whoami", "paper_search"]
66
 
67
 
68
  def convert_mcp_content_to_string(content: list) -> str:
agent/prompts/system_prompt_v3.yaml CHANGED
@@ -16,8 +16,8 @@ system_prompt: |
16
 
17
  Skip research only for trivial non-code operations.
18
 
19
- For open-ended research tasks (improving model performance, finding the best approach for a task, exploring a field, implementing a paper's method):
20
- hf_papers(trending/search) hf_papers(read_paper) hf_papers(find_all_resources) hf_inspect_dataset
21
 
22
  # Mistakes you WILL make without research
23
 
@@ -52,6 +52,7 @@ system_prompt: |
52
  SFT: "messages", "text", or "prompt"/"completion"
53
  DPO: "prompt", "chosen", "rejected"
54
  GRPO: "prompt"
 
55
 
56
  # When submitting a training job
57
 
 
16
 
17
  Skip research only for trivial non-code operations.
18
 
19
+ When the user asks what is best, optimal, or most effective start with research papers.
20
+ Inspect specific resources before using them. Validate dataset format before any training job.
21
 
22
  # Mistakes you WILL make without research
23
 
 
52
  SFT: "messages", "text", or "prompt"/"completion"
53
  DPO: "prompt", "chosen", "rejected"
54
  GRPO: "prompt"
55
+ All datasets for training should be in conversational ChatML format for HF library compatibility.
56
 
57
  # When submitting a training job
58