Overview
MultiAgent is a powerful multi-agent system designed to coordinate multiple agents working together to accomplish complex tasks. The creation and configuration process of MultiAgent is streamlined and efficient, utilizing a Planner-Executor-Reviewer
approach for task execution. Organizations can customize their AI agent teams based on different business scenarios, significantly enhancing the efficiency and flexibility of AI applications within the enterprise. The platform comes with various pre-configured AI agents, including AI Coder IDE, Browser Use, and Computer Use. These agents are ready to use out of the box without complex configuration.
The operational mechanism of MultiAgent is as follows:
flowchart LR subgraph P["Planner"] direction TB P1["Task Clarification Identify and clarify user instruction intent"] P2["Planner Generate high-quality Task Todo List"] P3["Action Dynamically generate Action List for Tasks"] end subgraph R["Runner"] direction TB T(["Task-1"]) T2(["Task-2"]) T3(["Task..."]) direction LR A3["Action1"] A4["Action2"] A5["Action3"] A6["Action4"] end subgraph Rev["Reviewer"] direction TB RV["Enterprise Custom Review Standards"] D1{"Human Takeover Needed?"} D2{"Full Task Update Required?"} D3{"Pending Tasks Need Update?"} D4{"Current Task Need Re-execution?"} End(["Continue Task Execution"]) end H2["Human User Takeover"] H1["User Issues Task"] P1 --> P2 P2 --> P3 P -- Serial --> T & T2 & T3 T -- Parallel --> A3 & A4 A6 -- Task Execution Complete --> RV RV -- Not Pass --> D4 D4 -- N --> D3 D4 -. Y .-> T D3 -. Y .-> P D3 -- N --> D2 D2 -- N --> D1 D2 -. Y .-> P D1 -- Y --> H2 D1 -- N --> End A4 -- Serial --> A5 A5 -- Serial --> A6 RV -- Pass --> NT["Start Next Task"] H1 -- Submit Task Instruction --> P H1:::human H2:::human P1:::planner P2:::planner P3:::planner T:::runner T2:::runner T3:::runner A3:::runner A3:::Peach A4:::runner A4:::Peach A5:::runner A5:::Peach A6:::runner A6:::Peach RV:::reviewer D1:::decision D2:::decision D3:::decision D4:::decision End:::reviewer NT:::reviewer classDef human fill:#e3f2fd,stroke:#90caf9,stroke-width:2px classDef planner fill:#e8f5e9,stroke:#81c784,stroke-width:2px classDef runner fill:#fffde7,stroke:#ffe082,stroke-width:2px classDef reviewer fill:#ede7f6,stroke:#b39ddb,stroke-width:2px classDef decision fill:#fce4ec,stroke:#ec407a,stroke-width:2px,stroke-dasharray:4 2 classDef Peach stroke-width:1px, stroke-dasharray:none, stroke:#FBB35A, fill:#FFEFDB, color:#8F632D
Key Features
- Custom AI Agent Team Building: The Multi-Agent platform enables enterprises to flexibly assemble AI Agent teams based on specific business needs, greatly enhancing the flexibility of AI applications within organizations.
- Rich Pre-configured AI Agents: The platform comes with various built-in, ready-to-use AI Agents, such as AI Coder IDE, Browser Use, and Computer Use.
- Multi-scenario Role Support: Provides AI Agent roles covering multiple domains including development, product, testing, algorithms, data, and marketing, meeting the needs of different enterprise departments.
- Wide Application Scope: Multi-Agent can be applied to various business scenarios such as in-depth research, data insight analysis, bid document generation, and financial account detail generation.
- Efficient and User-friendly: Simple operation that greatly improves work efficiency. For example, users can assemble an AI data analysis Agents team within minutes.
Creation and Configuration
Creating a MultiAgent system is straightforward, similar to setting up a single Agent. Key aspects include:
- Global Settings: By default, global settings are collapsed. Here you can configure memory, input/output capabilities, and other modules such as knowledge base, database, logs, and insights, consistent with single Agent functionality.
- Node Addition: Users can add various Agents to the MultiAgent canvas by clicking "Add Node". Available Agent types include:
- Pre-configured Agents: Ready-to-use Agents built by GPTBots with predefined capabilities, such as Online Search and Browser Use.
- Template Agents: Agents requiring user configuration to define their roles and capabilities.
- Agent Integration: To build a MultiAgent team, simply select the desired Agents, add them to the canvas, and connect them with lines. This links their capabilities for collaborative task execution.
Agent Introduction
Planner Agent
The "Planner" is the core Agent of MultiAgent, responsible for requirement clarification, task planning, execution task generation, and task completion quality review. It's recommended to choose the most capable model version, though selecting an inference model may significantly increase runtime.
The Planner supports custom task control mechanisms, such as dynamic task adjustment, user takeover options, and setting maximum retry limits for individual tasks to avoid excessive token consumption.
The Planner can also be configured with memory, tools, knowledge base, and database capabilities for enhanced functionality.
Computer Use Agent
Computer Use Agent supports custom identity prompts and configurable maximum task iteration limits to prevent excessive token consumption. It also supports user takeover functionality, allowing human users to take control and remotely operate the Computer when the Agent cannot properly advance tasks.
Computer Use runs in a sandboxed Linux environment and can utilize computer apps, file systems, and system commands to complete tasks.
Browser Use Agent
Runs in a sandboxed browser environment for browser-based task execution. Has low LLM capability requirements and can be used regardless of whether the LLM supports image recognition.
Coder IDE Agent
Completes tasks through code generation, running in a sandboxed CLI environment. Excels at searching, executing APIs, generating web pages, and creating documentation.
Online Search Agent
Online Search possesses autonomous capabilities for retrieving and obtaining URL page content, making it particularly useful in information network detection scenarios.
Execution Process
The MultiAgent task completion process follows these steps:
- Task Input: Users input task descriptions and submit. The Planner determines whether to execute directly or request clarification.
- Task List Generation: Once instructions are clear, the Planner generates a task list for user confirmation before automatic execution.
- Dynamic Execution: During execution, the Planner dynamically generates Action Lists for each task. After each task completion, the Planner evaluates quality and decides whether to adjust or update the task list.
- Task Completion: Upon completion of all tasks, MultiAgent summarizes and provides final results. Users can ask follow-up questions to update task outcomes if needed.
Best Practices
- Capability Description: Ensure accurate and detailed capability descriptions for each Agent to help the Planner achieve efficient task allocation.
- Model Optimization: Choose models for the Planner that balance performance and speed, avoiding delays from excessive reasoning.
- Control Mechanisms: Set appropriate iteration limits and user takeover options to optimize resource consumption and task reliability.