Certified Remote
PUBLISHED
Jan 16, 2026
Join Kentik as a Staff Site Reliability Engineer specializing in cloud infrastructure, where you'll ensure the reliability, scalability, and performance of our network observability platform. Collaborate with cross-functional teams to automate operations, optimize systems, and drive innovation in a dynamic cloud environment.
Kentik is seeking an experienced Staff Site Reliability Engineer with a focus on cloud technologies to join our innovative team. In this pivotal role, you will be responsible for the design, deployment, and maintenance of highly reliable cloud-based systems that power our real-time network analytics and observability platform. You will work closely with software engineers, product managers, and other stakeholders to identify and resolve infrastructure issues, implement proactive monitoring solutions, and automate routine operations to minimize downtime and enhance efficiency.
Key responsibilities include architecting scalable cloud infrastructures, developing and maintaining CI/CD pipelines, performing capacity planning, and leading incident response efforts. You will leverage your expertise in cloud-native services to optimize costs, improve system resilience, and support the rapid growth of our customer base. At Kentik, we value engineers who are passionate about reliability engineering practices, embrace automation, and thrive in a collaborative, remote-first environment. If you have a proven track record in SRE and a desire to contribute to cutting-edge cloud solutions, we encourage you to apply and help shape the future of network intelligence.
The employer recommends obtaining this certification to validate your skills and enhance your application.
Note: You can still apply for this position without the certification, but having it will make your profile stand out and may be required to move forward in the hiring process.