01 Zakres zadań
- Operate, monitor, and maintain a Kafka-based messaging platform in a production environment
- Ensure platform availability, stability, and performance in line with operational SLAs
- Monitor system health using logs, metrics, and alerting tools
- Perform routine operational checks and maintenance activities
- Handle incidents and service requests via ticketing systems and support channels
- Troubleshoot issues across Kafka components (brokers, producers, consumers, integrations)
- Analyze logs, metrics, and system behavior to identify root causes of incidents
- Execute operational procedures based on runbooks and standard operating procedures (SOPs)
- Perform configuration changes (topics, access controls, settings) following established processes
- Maintain and continuously improve operational documentation and runbooks
- Act as a primary support contact for internal users of the Kafka platform
- Provide technical support via collaboration tools (e.g., Slack, Teams)
- Assist users with troubleshooting and best practices
- Translate user-reported issues into actionable insights for technical teams
