Spaces:
Sleeping
Sleeping
File size: 21,466 Bytes
11bc976 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 |
# Agent system for GAIA benchmark
# Architecture
```python
gaia_system = create_gaia_system()
gaia_system.visualize_system()
```
```
CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
βββ β
Authorized imports: []
βββ π οΈ Tools:
β ββββββββββββββββ³ββββββββββββββββββββββββββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββββββββββ
β β Name β Description β Arguments β
β β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β β final_answer β Provides a final answer to the given problem. β answer (`any`): The final answer to the β
β β β β problem β
β ββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββββββββββ
βββ π€ Managed agents:
βββ search_agent | CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
β βββ β
Authorized imports: []
β βββ π Description: Retrieves factual information and background data from various sources including
β β Wikipedia, web search, and academic papers
β βββ π οΈ Tools:
β βββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββ
β β Name β Description β Arguments β
β β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β β wikipedia_search β Search Wikipedia for information on β query (`string`): The search term or β
β β β a specific topic. β topic β
β β β β language (`string`): Wikipedia β
β β β β language code (default: "en") β
β β β β sentences (`integer`): Number of β
β β β β sentences to return from summary β
β β β β (default: 3) β
β β web_search_duckduckgo β Search the web using DuckDuckGo β query (`string`): Search query β
β β β search engine. β string β
β β β β max_results (`integer`): Maximum β
β β β β number of results to return β
β β β β (default: 5) β
β β fetch_webpage_content β Fetch and extract text content from β url (`string`): The URL to fetch β
β β β a webpage. β max_length (`integer`): Maximum β
β β β β length of content to return β
β β β β (default: 3000) β
β β arxiv_search β Search arXiv papers. β query (`string`): Search query or β
β β β β paper ID (e.g., "1605.08386") β
β β wikipedia_search_tool β Search Wikipedia using LangChain's β query (`string`): Search query β
β β β WikipediaQueryRun. β β
β β duckduckgo_search β Search using DuckDuckGo. β query (`string`): Search query β
β β final_answer β Provides a final answer to the given β answer (`any`): The final answer to β
β β β problem. β the problem β
β βββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββ
βββ document_agent | CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
β βββ β
Authorized imports: []
β βββ π Description: Loads and processes structured and unstructured documents including CSV, Excel, text
β β files, and audio transcriptions
β βββ π οΈ Tools:
β βββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββ
β β Name β Description β Arguments β
β β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β β load_csv_file β Load and analyze a CSV file. β filepath (`string`): Path to the CSV β
β β β β file β
β β β β max_rows (`integer`): Maximum number β
β β β β of rows to display (default: 100) β
β β β β max_columns (`integer`): Maximum β
β β β β number of columns to display β
β β β β (default: 20) β
β β β β get_all_rows (`boolean`): If True, β
β β β β return all rows regardless of β
β β β β max_rows (default: False) β
β β load_excel_file β Load and analyze an Excel file. β filepath (`string`): Path to the β
β β β β Excel file β
β β β β sheet_name (`string`): Specific β
β β β β sheet to load (default: None for β
β β β β first sheet) β
β β β β max_rows (`integer`): Maximum number β
β β β β of rows to display (default: 100) β
β β β β max_columns (`integer`): Maximum β
β β β β number of columns to display β
β β β β (default: 20) β
β β β β get_all_rows (`boolean`): If True, β
β β β β return all rows regardless of β
β β β β max_rows (default: False) β
β β read_text_file β Read content from a text file. β filepath (`string`): Path to the β
β β β β text file β
β β β β max_length (`integer`): Maximum β
β β β β length of content to return β
β β β β (default: 2000) β
β β β β encoding (`string`): File encoding β
β β β β (default: "utf-8") β
β β transcribe_audio_file β Transcribe audio file to text using β filepath (`string`): Path to the β
β β β Whisper. β audio file β
β β β β model_size (`string`): Whisper model β
β β β β size ("tiny", "base", "small", β
β β β β "medium", "large") β
β β final_answer β Provides a final answer to the given β answer (`any`): The final answer to β
β β β problem. β the problem β
β βββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββ
βββ vision_agent | CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
β βββ β
Authorized imports: []
β βββ π Description: Extracts text and meaning from images using OCR, captioning, and visual question
β β answering
β βββ π οΈ Tools:
β βββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββ
β β Name β Description β Arguments β
β β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β β ocr_tool β Extract text from images using OCR. β image_path (`string`): Path to image β
β β β β file β
β β image_captioning_tool β Generate basic image information β image_path (`string`): Path to image β
β β β (placeholder for actual captioning). β file β
β β visual_qa_tool β Answer questions about images β image_path (`string`): Path to image β
β β β (placeholder for actual VQA). β file β
β β β β question (`string`): Question about β
β β β β the image β
β β final_answer β Provides a final answer to the given β answer (`any`): The final answer to β
β β β problem. β the problem β
β βββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββ
βββ reasoning_agent | CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
β βββ β
Authorized imports: []
β βββ π Description: Performs symbolic reasoning, logical pattern recognition, and analytical tasks
β βββ π οΈ Tools:
β βββββββββββββββββββββββββββββββ³ββββββββββββββββββββββββββββββββββββ³ββββββββββββββββββββββββββββββββββββ
β β Name β Description β Arguments β
β β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β β analyze_chess_position β Analyze a chess position given in β fen_notation (`string`): Chess β
β β β FEN notation. β position in FEN (Forsyth-Edwards β
β β β β Notation) β
β β analyze_table_commutativity β Analyze a mathematical operation β table_data (`string`): String β
β β β table for commutativity. β representation of the operation β
β β β β table β
β β count_items_in_list β Count items in a delimited list. β items_text (`string`): Text β
β β β β containing delimited items β
β β β β separator (`string`): Delimiter β
β β β β to split on (default: ",") β
β β final_answer β Provides a final answer to the β answer (`any`): The final answer β
β β β given problem. β to the problem β
β βββββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββ
βββ language_agent | CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
β βββ β
Authorized imports: []
β βββ π Description: Handles low-level text transformations and string manipulations
β βββ π οΈ Tools:
β βββββββββββββββββββββββββββ³ββββββββββββββββββββββββββββββββββββββ³ββββββββββββββββββββββββββββββββββββββ
β β Name β Description β Arguments β
β β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β β reverse_string β Reverse a string character by β text (`string`): The string to β
β β β character. β reverse β
β β reverse_words_in_string β Reverse the order of words in a β text (`string`): The string with β
β β β string. β words to reverse β
β β final_answer β Provides a final answer to the β answer (`any`): The final answer to β
β β β given problem. β the problem β
β βββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββββ
βββ coding_agent | CodeAgent | Qwen/Qwen2.5-Coder-32B-Instruct
βββ β
Authorized imports: ['pandas', 'numpy', 'matplotlib', 'json', 're', 'datetime', 'math',
β 'statistics', 'itertools']
βββ π Description: Executes Python code and performs computational logic through code interpretation
βββ π οΈ Tools:
ββββββββββββββββ³ββββββββββββββββββββββββββββββββββββββββββββ³βββββββββββββββββββββββββββββββββββββββββββ
β Name β Description β Arguments β
β‘ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ©
β final_answer β Provides a final answer to the given β answer (`any`): The final answer to the β
β β problem. β problem β
ββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββββββββββ
```
|