




Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large\-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning. ✅ **Site Reliability Engineering (SRE)** or similar roles with a focus on full stack ownership of critical services and technology areas. ✅ **Design and delivery of mission\-critical systems** , with a strong focus on security, resiliency, scalability, and performance. ✅ **Deep understanding of end\-to\-end configuration** , technical dependencies, and production service characteristics. ✅ **Experience acting as a technical authority** for end\-to\-end performance, operability, and scalability. ✅ **Collaboration with development teams** to define and implement improvements in cloud service architectures. ✅ **Ability to articulate and guide** on the technical characteristics of services and technology stacks. ✅ **Strong experience with Linux\-based systems** , including administration, networking, performance tuning, and troubleshooting. ✅ **Knowledge of automation and orchestration tools** (DevOps, CI/CD, Terraform, Ansible, Kubernetes, etc.). ✅ **Experience managing complex escalations** and defining mitigations for distributed systems. ✅ **Understanding of the impact of architectural decisions** on distributed systems and cloud services. ✅ **Strong professional curiosity** and motivation to develop a deep understanding of advanced services and technologies.


