Lead Site Reliability Engineer

Número de la requisición: 2290455
Categoría de la vacante: Technology
Localização da vaga: Hyderabad, Telangana

Man standing and writing on a white board while presenting to coworkers in a meeting room.

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.

The ideal candidate will have a solid background in observability tools (Datadog, Splunk, Kibana), cloud-native infrastructure (AWS, EKS), Python scripting, and GitHub Actions workflows. Experience in healthcare data interoperability and analytics is essential.

Primary Responsibilities:

  • Design, implement, and maintain scalable, reliable, and secure infrastructure on AWS and EKS
  • Develop and manage observability and monitoring solutions using Datadog, Splunk, and Kibana
  • Collaborate with development teams to ensure high availability and performance of microservices-based applications
  • Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (IaC) and CI/CD pipelines
  • Build and maintain GitHub Actions workflows for continuous integration and deployment
  • Troubleshoot production issues and lead root cause analysis to improve system reliability
  • Ensure compliance with healthcare data standards and regulations (e.g., HIPAA, HL7, FHIR)
  • Work closely with data engineering and analytics teams to support healthcare data pipelines and analytics platforms
  • Mentor junior engineers and contribute to SRE best practices and culture
  • Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so

Required Qualifications:

  • Bachelor’s degree in Engineering (B.Tech) or equivalent in Computer Science, Information Technology, or a related field
  • 10+ years of experience in Site Reliability Engineering, DevOps, or related roles
  • Hands-on experience with AWS services, EKS, and container orchestration
  • Experience with healthcare technology solutions, health data interoperability standards (FHIR, HL7), and healthcare analytics
  • Experience with GitHub Actions or similar CI/CD tools
  • Solid expertise in Datadog, Splunk, Kibana, and other observability tools
  • Deep understanding of microservices architecture and distributed systems
  • Proficiency in Python for scripting and automation
  • Solid scripting and automation skills (e.g., Bash, Terraform, Ansible)
  • Proven excellent problem-solving, communication, and collaboration skills

Preferred Qualifications:

  • Certifications in AWS, Kubernetes, or healthcare IT (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator)
  • Experience with security and compliance in healthcare environments

At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone – of every race, gender, sexuality, age, location and income – deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes – an enterprise priority reflected in our mission.

Información adicional sobre la vacante

Número de la requisición 2290455

Segmento de negocio Optum

Disponibilidad para viajar No

País IN

Estado de horas extras Exempt

Vacante de teletrabajo No