tobitege bc31fb15fe (fix) CodeActAgent: fix issues with vision support in prompts (#3665) 1 год назад
..
micro fd0fad7362 improve github.md with more API commands with less code duplication (#3651) 1 год назад
README.md e72dc96d13 [Fix] Stop API key from leaking in evaluation outputs. (#3603) 1 год назад
__init__.py 6487175a31 refactored all relative paths to absolute paths (#3495) 1 год назад
action_parser.py 01ae22ef57 Rename OpenDevin to OpenHands (#3472) 1 год назад
codeact_agent.py bc31fb15fe (fix) CodeActAgent: fix issues with vision support in prompts (#3665) 1 год назад
system_prompt.j2 7068a73ae7 (enh) Improve CodeActAgent's file editing reliability (#3610) 1 год назад
user_prompt.j2 01ae22ef57 Rename OpenDevin to OpenHands (#3472) 1 год назад

README.md

CodeAct Agent Framework

This folder implements the CodeAct idea (paper, tweet) that consolidates LLM agents’ actions into a unified code action space for both simplicity and performance (see paper for more details).

The conceptual idea is illustrated below. At each turn, the agent can:

  1. Converse: Communicate with humans in natural language to ask for clarification, confirmation, etc.
  2. CodeAct: Choose to perform the task by executing code
    • Execute any valid Linux bash command
    • Execute any valid Python code with an interactive Python interpreter. This is simulated through bash command, see plugin system below for more details.

image

Plugin System

To make the CodeAct agent more powerful with only access to bash action space, CodeAct agent leverages OpenHands's plugin system:

Demo

https://github.com/All-Hands-AI/OpenHands/assets/38853559/f592a192-e86c-4f48-ad31-d69282d5f6ac

Example of CodeActAgent with gpt-4-turbo-2024-04-09 performing a data science task (linear regression)

Work-in-progress & Next step

[] Support web-browsing [] Complete the workflow for CodeAct agent to submit Github PRs