




Summary: We are seeking a Lead DevOps Engineer to enhance the reliability and scalability of our Ads Organization data infrastructure, delivering upgrades, optimizing CI/CD, and troubleshooting production challenges. Highlights: 1. Improve reliability and scalability of Ads Organization data infrastructure 2. Oversee and optimize data processing operations using Airflow, Spark, and Flink 3. Create and sustain cloud infrastructure with AWS, Kubernetes, and Terraform We are searching for a Lead DevOps Engineer to improve the reliability and scalability of our Ads Organization data infrastructure. You will troubleshoot production challenges, deliver upgrades and maintenance, and optimize CI/CD workflows while coordinating with stakeholders. Apply to help us ship stable systems with strong observability **Responsibilities** * Oversee and optimize data processing operations using Airflow/MWAA, Spark, and Flink * Create and sustain cloud infrastructure with AWS, Kubernetes, and Terraform * Engage stakeholders to collect requirements and communicate infrastructure change progress * Plan and execute upgrades, conduct maintenance, and troubleshoot data platforms while using Datadog for monitoring and performance insights * Refine CI/CD delivery by enhancing Spinnaker and Jenkins pipelines for consistent releases **Requirements** * Proven track record of 5\+ years in DevOps engineering roles * Demonstrated experience of 1\+ year in leadership or team management responsibilities * Deep expertise in Amazon Web Services (AWS) for deploying, managing, and operating cloud infrastructure * Hands\-on experience with Apache Airflow for orchestrating and scheduling data workflows * Advanced knowledge of Kubernetes to manage and scale containerized applications * Strong proficiency in Terraform for infrastructure automation and configuration * English level B2 (Upper\-Intermediate) or higher with strong communication skills for collaboration and reporting **Nice to have** * Experience with Apache Flink for real\-time data stream processing * Familiarity with Apache NiFi for automating and controlling data flows * Knowledge of Databricks for advanced analytics and machine learning efforts * Experience with Datadog for monitoring infrastructure and driving issue resolution


