Xingyao Wang 5b3db1bd33 feat: make add_in_context_learning_example configurable in fn call converter (#5018)		1 year ago
..
agenthub	fac5237c69 Fix user commands in terminal with function calling (#4955)	1 year ago
controller	c784151765 fix file descriptor leaks (#4988)	1 year ago
core	07f0d1ccb3 feat(llm): convert function call request for non-funcall OSS model (#4711)	1 year ago
events	f55ddbed0e fix docker leak (#4970)	1 year ago
linter	ec3152b6e1 linter: only lint on updated lines in the new file (#4409)	1 year ago
llm	5b3db1bd33 feat: make add_in_context_learning_example configurable in fn call converter (#5018)	1 year ago
memory	eeb2342509 Refactor history/event stream (#3808)	1 year ago
resolver	a753babb7a Integrate OpenHands resolver into main repository (#4964)	1 year ago
runtime	f3b35663e9 fix zip downloads (#5009)	1 year ago
security	436ecb80a3 Logger fixes for openhands-resolver (#4710)	1 year ago
server	9cd248d475 feat(frontend): Display runtime ID in the browser console if available (#4978)	1 year ago
storage	641a15356f Better AWS S3 storage support (#4195)	1 year ago
utils	07f0d1ccb3 feat(llm): convert function call request for non-funcall OSS model (#4711)	1 year ago
README.md	dc0a1f3940 Fix wrong doc url (#3531)	1 year ago
__init__.py	4405b109e3 Fix issue #4809: [Bug]: Model does not support image upload when usin… (#4810)	1 year ago
py.typed	6ce77e157b Fix pypi build (#3548)	1 year ago

OpenHands Architecture

This directory contains the core components of OpenHands.

This diagram provides an overview of the roles of each component and how they communicate and collaborate.

Classes

The key classes in OpenHands are:

LLM: brokers all interactions with large language models. Works with any underlying completion model, thanks to LiteLLM.
Agent: responsible for looking at the current State, and producing an Action that moves one step closer toward the end-goal.
AgentController: initializes the Agent, manages State, and drive the main loop that pushes the Agent forward, step by step
State: represents the current state of the Agent's task. Includes things like the current step, a history of recent events, the Agent's long-term plan, etc
EventStream: a central hub for Events, where any component can publish Events, or listen for Events published by other components
- Event: an Action or Observeration
  - Action: represents a request to e.g. edit a file, run a command, or send a message
  - Observation: represents information collected from the environment, e.g. file contents or command output
Runtime: responsible for performing Actions, and sending back Observations
- Sandbox: the part of the runtime responsible for running commands, e.g. inside of Docker
Server: brokers OpenHands sessions over HTTP, e.g. to drive the frontend
- Session: holds a single EventStream, a single AgentController, and a single Runtime. Generally represents a single task (but potentially including several user prompts)
- SessionManager: keeps a list of active sessions, and ensures requests are routed to the correct Session

Control Flow

Here's the basic loop (in pseudocode) that drives agents.

while True:
  prompt = agent.generate_prompt(state)
  response = llm.completion(prompt)
  action = agent.parse_response(response)
  observation = runtime.run(action)
  state = state.update(action, observation)

In reality, most of this is achieved through message passing, via the EventStream. The EventStream serves as the backbone for all communication in OpenHands.

flowchart LR
  Agent--Actions-->AgentController
  AgentController--State-->Agent
  AgentController--Actions-->EventStream
  EventStream--Observations-->AgentController
  Runtime--Observations-->EventStream
  EventStream--Actions-->Runtime
  Frontend--Actions-->EventStream

Runtime

Please refer to the documentation to learn more about Runtime.

README.md

OpenHands Architecture

Classes

Control Flow

Runtime