Skip to Content
GuideConceptsCloudPilot AI Components Details

CloudPilot AI Components Details

This documentation provides a technical overview of CloudPilot AI’s architectural components, detailing their core functions and operational behavior within Kubernetes environments. It serves as a reference for engineers who want to deep dive into CloudPilot AI.

Component Details

CloudPilot AI consists of six core components that work together to optimize cluster costs while ensuring workload stability. Each component has specific responsibilities and resource requirements designed for efficient cluster management.

ComponentKindResource RequirementsFunctions
cloudpilot-agentDeployment1 Replica
250m CPU / 500MiB Memory
Pushes nodes/resource requests information for optimization planning and handles cluster registration with the CloudPilot AI platform.
cloudpilot-optimizerDeployment2 Replicas
1000m CPU / 1GiB Memory (each)
Performs node operations based on configuration, including node draining, deletion, and creation to optimize cluster resource utilization and costs.
cloudpilot-controllerDeployment1 Replica
300m CPU / 500MiB Memory
Synchronizes configuration from server side to local clusters, applies custom resources, labels workloads for webhook patching, evicts pods to meet configuration requirements, and drains nodes with high interruption rates.
cloudpilot-webhookDeployment2 Replicas
100m CPU / 100MiB Memory (each)
Mutates pod creation requests by patching pod affinity and anti-affinity rules to enhance workload stability and distribution.
cloudpilot-workload-log-collectorDeployment1 Replica
100m CPU / 100MiB Memory
Collects CloudPilot AI-specific component logs for troubleshooting and operational monitoring.
cloudpilot-spot-handlerDaemonSet50m CPU / 100MiB Memory (per node)Retrieves spot instance interruption notifications and immediately notifies the cloudpilot-optimizer to launch replacement instances.

Support

These components work together to provide a comprehensive cluster optimization solution. The agent handles communication with the CloudPilot AI platform, while the optimizer and controller manage cluster resources based on received configurations. The webhook ensures optimal pod placement, and the spot handler provides proactive management of spot instance interruptions.

For further questions, feel free to reach out to us through our Slack channel.

Last updated on