···
Log in / Register
SRE
$MXN 100,000-110,000/year
Indeed
Full-time
Onsite
No experience limit
No degree limit
Isabel La Católica 5, Centro Histórico de la Cdad. de México, Centro, Cuauhtémoc, 06000 Ciudad de México, CDMX, Mexico
Favourites
Share
Description

Summary: Seeking a skilled Site Reliability Engineer (SRE) to support, optimize, and enhance large-scale, mission-critical applications, ensuring system reliability, scalability, and performance. Highlights: 1. Support and optimize large-scale, mission-critical applications 2. Collaborate with diverse teams for system reliability 3. Focus on reliability, automation, and continuous improvement Job Summary We are seeking a skilled **Site Reliability Engineer (SRE)** to support, optimize, and enhance large\-scale, mission\-critical applications. The ideal candidate has strong technical expertise, excellent troubleshooting skills, and experience maintaining complex distributed systems in production environments. In this role, you will collaborate with development, DevOps, infrastructure, and networking teams to ensure **system reliability, scalability, and performance**. Key Responsibilities * Provide production support for complex **Java\-based applications**, ensuring stability, performance, and resiliency * Manage and support applications running in **AWS, PCF, Kubernetes, and containerized environments** * Maintain and optimize systems built on **Kafka, PostgreSQL, and other distributed components** * Build, configure, and maintain **monitoring dashboards and alerts** using tools such as Splunk * Perform **root cause analysis** using logs, stack traces, thread dumps, heap dumps, and system diagnostics * Apply **ITIL/ITSM practices**, including incident management and change control processes * Contribute to improvements in **resiliency, high availability, automation, and system performance** * Collaborate with **development, DevOps, networking, and infrastructure teams** to ensure end\-to\-end system reliability Technical Skills Candidates should have experience with **multiple SRE\-related technologies** and be intermediate in at least two of the following areas: * Production support of complex **Java applications** * Cloud platforms and PaaS environments such as **AWS, PCF, and Kubernetes** * **Kafka administration and troubleshooting** * Monitoring and observability tools such as **Splunk**, including dashboard creation and log analysis * **ITIL/ITSM frameworks** and operational processes * **SDLC processes and CI/CD / DevOps tooling** * Distributed computing environments such as **UNIX, Windows, or Mainframe systems** * **Networking fundamentals** (Layers 1–3\) * **System diagnostics and performance analysis**, including: * Thread dumps * Heap dumps * TCP dumps * CPU and memory diagnostics * Experience with **load balancers and web application firewalls (WAFs)** * Knowledge of **high availability, resiliency, and business continuity practices** * Understanding of **caching and CDN concepts** * Experience with **configuration management and Infrastructure as Code** Soft Skills * Self\-driven with the ability to operate **independently and proactively** * Strong **critical thinking, analytical, and troubleshooting skills** * Highly **detail\-oriented and structured** in approach * Excellent **communication and collaboration skills** Why Join Us You will be part of a collaborative engineering environment focused on **reliability, automation, and continuous improvement**, supporting critical systems that power large\-scale applications. Tipo de puesto: Por tiempo indeterminado Sueldo: $100,000\.00 \- $110,000\.00 al mes Beneficios: * Seguro de vida * Vales de despensa Lugar de trabajo: remoto híbrido en Ciudad de México

Source:  indeed View original post
Juan García
Indeed · HR

Company

Indeed
Juan García
Indeed · HR
Similar jobs
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.