What is Kodosumi?
Kodosumi is the runtime environment to manage and execute agentic services at scale. The system is based on Ray - a distributed computing framework - and a combination of litestar and fastapi to deliver agentic services to users or other agents. Similar to Ray, Kodosumi follows a Python first agenda.
Kodosumi is one component of a larger eco system with masumi and sokosumi.
Kodosumi consists of three main building blocks. First, a Ray cluster to execute agentic services at scale. Kodosumi builds on top of Ray and actively manages the lifecycle and events of service executions from starting to finished or error. No matter you name your code an application, flow, service or script: The third building block is your application which runs on top of Kodosumi.
The following architecture shows the relation between the three building blocks:
- Your service on top of 2) Kodosumi which operates 3) a distributed compute cluster with Ray secure and at scale.
You build and deploy your Flow by providing an endpoint (HTTP route) and an entrypoint (Python callable) to Kodosumi (left bottom box in the diagram). Kodosumi delivers features for access control, flow control and manages flow execution with Ray head node and worker nodes. Kodosumi spooler gathers flow execution results and outputs into the event stream.
Deep-dive into endpoints and how these translate into entrypoints of flows which operationalize the business logic of agentic services or agents in the broader sense.
If you need further background information read why Kodosumi