Lead Site Reliability Engineer

Requisition Number: 2290455
Job Category: Technology
Primary Location: Hyderabad, Telangana, IN

Man standing and writing on a white board while presenting to coworkers in a meeting room.

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.

The ideal candidate will have a solid background in observability tools (Datadog, Splunk, Kibana), cloud-native infrastructure (AWS, EKS), Python scripting, and GitHub Actions workflows. Experience in healthcare data interoperability and analytics is essential.

Primary Responsibilities:

  • Design, implement, and maintain scalable, reliable, and secure infrastructure on AWS and EKS
  • Develop and manage observability and monitoring solutions using Datadog, Splunk, and Kibana
  • Collaborate with development teams to ensure high availability and performance of microservices-based applications
  • Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (IaC) and CI/CD pipelines
  • Build and maintain GitHub Actions workflows for continuous integration and deployment
  • Troubleshoot production issues and lead root cause analysis to improve system reliability
  • Ensure compliance with healthcare data standards and regulations (e.g., HIPAA, HL7, FHIR)
  • Work closely with data engineering and analytics teams to support healthcare data pipelines and analytics platforms
  • Mentor junior engineers and contribute to SRE best practices and culture
  • Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so

Required Qualifications:

  • Bachelor’s degree in Engineering (B.Tech) or equivalent in Computer Science, Information Technology, or a related field
  • 10+ years of experience in Site Reliability Engineering, DevOps, or related roles
  • Hands-on experience with AWS services, EKS, and container orchestration
  • Experience with healthcare technology solutions, health data interoperability standards (FHIR, HL7), and healthcare analytics
  • Experience with GitHub Actions or similar CI/CD tools
  • Solid expertise in Datadog, Splunk, Kibana, and other observability tools
  • Deep understanding of microservices architecture and distributed systems
  • Proficiency in Python for scripting and automation
  • Solid scripting and automation skills (e.g., Bash, Terraform, Ansible)
  • Proven excellent problem-solving, communication, and collaboration skills

Preferred Qualifications:

  • Certifications in AWS, Kubernetes, or healthcare IT (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator)
  • Experience with security and compliance in healthcare environments

At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone – of every race, gender, sexuality, age, location and income – deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes – an enterprise priority reflected in our mission.

Additional Job Detail Information

Requisition Number 2290455

Business Segment Optum

Employee Status Regular

Travel No

Country: IN

Overtime Status Exempt

Schedule Full-time

Shift Day Job

Telecommuter Position No

Similar Jobs:

IVR Quality Engineer
Bangalore, Karnataka
Senior Software Engineer I
Hyderabad, Telangana
Quality Engineer
Hyderabad, Telangana

Our Hiring Process

We want you to know what our hiring process looks like. Watch the video and find out what to expect along the way.

What It’s Like

Watch the video and hear how our employees describe what it’s like to work here in Customer Service.

Careers at Optum

If you want to use your abilities to help us challenge the status quo and achieve on our ambitious mission, this is the right place for you. We are creating and delivering quality health care solutions that deeply impact the health care system. And this means opportunities for people like you to grow and innovate with us.

Closing the GAP

Our team members help close the gap in health care. Take a closer look and see how Lisa helps members navigate a complex health care system.