Lead the Site Reliability Engineering division and oversee the teams responsible for SRE, DevOps, and related areas
Design and implement procedures to ensure that our systems are scalable, secure, and cost-effective
Drive improvements in automation and CI/CD processes to improve efficiency and reduce downtime
Develop and maintain monitoring and alerting systems to quickly detect and respond to system issues
Lead and advise high-complexity projects from scoping to production
Assist in determining the future technical direction, with a focus on improving reliability and performance
Provide technical direction and guidance to members of the team
Collaborate with cross-functional stakeholders (engineering, business, product) in both tactical and strategic capacities
Apply pragmatic reasoning to navigate complex challenges and competing interests
Build a DevOps culture to provide high quality, continuous operations, and ongoing support ensuring critical service level metrics, customer requirements and financial objectives
Manage vendors and software assets
Design, release and improve elements and software releases for business needs and satisfaction.
Namizədə tələblər
Education: Bachelor's degree in Computer Science or related technical field
Work experience: Minimum 7 years; At least 3 years in a managerial position
License / Certificate: Linux/DevOops tools/Cloud technologies related certificates desired
Foreign Language: English (upper-intermediate), Russian desired
Computer Skills: Proficient in Linux OS, virtualization, cloud technologies, large scale systems deployments, and DevOps tools
Market Knowledge: Passionate about staying on top of SRE trends, experimenting with and learning new technologies
Other: Experience managing a team of SREs or DevOps Engineers, leading production troubleshooting of distributed systems, code, storage, networking, and operating systems. Previous participation in a 24x7, on-call rotation for large-scale deployment is preferred.