Creating Components¶
The minimal component¶
Components can be used to implement various functionalities like providing messages to the prompt, executing code, or interacting with external services.
Component is a class that inherits from AgentComponent
OR implements one or more protocols. Every protocol inherits AgentComponent
, so your class automatically becomes a component once you inherit any protocol.
class MyComponent(AgentComponent):
pass
This is already a valid component, but it doesn't do anything yet. To add some functionality to it, you need to implement one or more protocols.
Let's create a simple component that adds "Hello World!" message to the agent's prompt. To do this we need to implement MessageProvider
protocol in our component. MessageProvider
is an interface with get_messages
method:
# No longer need to inherit AgentComponent, because MessageProvider already does it
class HelloComponent(MessageProvider):
def get_messages(self) -> Iterator[ChatMessage]:
yield ChatMessage.user("Hello World!")
Now we can add our component to an existing agent or create a new Agent class and add it there:
class MyAgent(Agent):
self.hello_component = HelloComponent()
get_messages
will called by the agent each time it needs to build a new prompt and the yielded messages will be added accordingly.
Passing data to and between components¶
Since components are regular classes you can pass data (including other components) to them via the __init__
method.
For example we can pass a config object and then retrieve an API key from it when needed:
class DataComponent(MessageProvider):
def __init__(self, config: Config):
self.config = config
def get_messages(self) -> Iterator[ChatMessage]:
if self.config.openai_credentials.api_key:
yield ChatMessage.system("API key found!")
else:
yield ChatMessage.system("API key not found!")
Note
Component-specific configuration handling isn't implemented yet.
Configuring components¶
Components can be configured using a pydantic model.
To make component configurable, it must inherit from ConfigurableComponent[BM]
where BM
is the configuration class inheriting from pydantic's BaseModel
.
You should pass the configuration instance to the ConfigurableComponent
's __init__
or set its config
property directly.
Using configuration allows you to load confugration from a file, and also serialize and deserialize it easily for any agent.
To learn more about configuration, including storing sensitive information and serialization see Component Configuration.
# Example component configuration
class UserGreeterConfiguration(BaseModel):
user_name: str
class UserGreeterComponent(MessageProvider, ConfigurableComponent[UserGreeterConfiguration]):
def __init__(self):
# Creating configuration instance
# You could also pass it to the component constructor
# e.g. `def __init__(self, config: UserGreeterConfiguration):`
config = UserGreeterConfiguration(user_name="World")
# Passing the configuration instance to the parent class
UserGreeterComponent.__init__(self, config)
# This has the same effect as the line above:
# self.config = UserGreeterConfiguration(user_name="World")
def get_messages(self) -> Iterator[ChatMessage]:
# You can use the configuration like a regular model
yield ChatMessage.system(f"Hello, {self.config.user_name}!")
Providing commands¶
To extend what an agent can do, you need to provide commands using CommandProvider
protocol. For example to allow agent to multiply two numbers, you can create a component like this:
class MultiplicatorComponent(CommandProvider):
def get_commands(self) -> Iterator[Command]:
# Yield the command so the agent can use it
yield self.multiply
@command(
parameters={
"a": JSONSchema(
type=JSONSchema.Type.INTEGER,
description="The first number",
required=True,
),
"b": JSONSchema(
type=JSONSchema.Type.INTEGER,
description="The second number",
required=True,
)})
def multiply(self, a: int, b: int) -> str:
"""
Multiplies two numbers.
Args:
a: First number
b: Second number
Returns:
Result of multiplication
"""
return str(a * b)
To learn more about commands see 🛠️ Commands.
Prompt structure¶
After components provided all necessary data, the agent needs to build the final prompt that will be send to a llm.
Currently, PromptStrategy
(not a protocol) is responsible for building the final prompt.
If you want to change the way the prompt is built, you need to create a new PromptStrategy
class, and then call relevant methods in your agent class.
You can have a look at the default strategy used by the AutoGPT Agent: OneShotAgentPromptStrategy, and how it's used in the Agent (search for self.prompt_strategy
).
Example UserInteractionComponent
¶
Let's create a slightly simplified version of the component that is used by the built-in agent. It gives an ability for the agent to ask user for input in the terminal.
-
Create a class for the component that inherits from
CommandProvider
.class MyUserInteractionComponent(CommandProvider): """Provides commands to interact with the user.""" pass
-
Implement command method that will ask user for input and return it.
def ask_user(self, question: str) -> str: """If you need more details or information regarding the given goals, you can ask the user for input.""" print(f"\nQ: {question}") resp = input("A:") return f"The user's answer: '{resp}'"
-
The command needs to be decorated with
@command
.@command( parameters={ "question": JSONSchema( type=JSONSchema.Type.STRING, description="The question or prompt to the user", required=True, ) }, ) def ask_user(self, question: str) -> str: """If you need more details or information regarding the given goals, you can ask the user for input.""" print(f"\nQ: {question}") resp = input("A:") return f"The user's answer: '{resp}'"
-
We need to implement
CommandProvider
'sget_commands
method to yield the command.def get_commands(self) -> Iterator[Command]: yield self.ask_user
-
Since agent isn't always running in the terminal or interactive mode, we need to disable this component by setting
self._enabled=False
when it's not possible to ask for user input.def __init__(self, interactive_mode: bool): self.config = config self._enabled = interactive_mode
The final component should look like this:
# 1.
class MyUserInteractionComponent(CommandProvider):
"""Provides commands to interact with the user."""
# We pass config to check if we're in noninteractive mode
def __init__(self, interactive_mode: bool):
self.config = config
# 5.
self._enabled = interactive_mode
# 4.
def get_commands(self) -> Iterator[Command]:
# Yielding the command so the agent can use it
# This won't be yielded if the component is disabled
yield self.ask_user
# 3.
@command(
# We need to provide a schema for ALL the command parameters
parameters={
"question": JSONSchema(
type=JSONSchema.Type.STRING,
description="The question or prompt to the user",
required=True,
)
},
)
# 2.
# Command name will be its method name and description will be its docstring
def ask_user(self, question: str) -> str:
"""If you need more details or information regarding the given goals,
you can ask the user for input."""
print(f"\nQ: {question}")
resp = input("A:")
return f"The user's answer: '{resp}'"
Now if we want to use our user interaction instead of the default one we need to somehow remove the default one (if our agent inherits from Agent
the default one is inherited) and add our own. We can simply override the user_interaction
in __init__
method:
class MyAgent(Agent):
def __init__(
self,
settings: AgentSettings,
llm_provider: MultiProvider,
file_storage: FileStorage,
app_config: Config,
):
# Call the parent constructor to bring in the default components
super().__init__(settings, llm_provider, file_storage, app_config)
# Disable the default user interaction component by overriding it
self.user_interaction = MyUserInteractionComponent()
Alternatively we can disable the default component by setting it to None
:
class MyAgent(Agent):
def __init__(
self,
settings: AgentSettings,
llm_provider: MultiProvider,
file_storage: FileStorage,
app_config: Config,
):
# Call the parent constructor to bring in the default components
super().__init__(settings, llm_provider, file_storage, app_config)
# Disable the default user interaction component
self.user_interaction = None
# Add our own component
self.my_user_interaction = MyUserInteractionComponent(app_config)
Learn more¶
The best place to see more examples is to look at the built-in components in the classic/original_autogpt/components and classic/original_autogpt/commands directories.
Guide on how to extend the built-in agent and build your own: 🤖 Agents
Order of some components matters, see 🧩 Components to learn more about components and how they can be customized.
To see built-in protocols with accompanying examples visit ⚙️ Protocols.