Lead Site Reliability Engineer

Requisition Number: 2277395
Job Category: Technology
Primary Location: Hyderabad, Telangana, IN

Man standing and writing on a white board while presenting to coworkers in a meeting room.

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by diversity and inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health equity on a global scale. Join us to start Caring. Connecting. Growing together.

Primary Responsibilities:

  • Maintain the performance, security, and reliability of our database systems
  • Managing Azure Postgres and MongoDB databases, ensuring high availability, and implementing best practices for database administration
  • Install, configure, and maintain Azure Postgres and MongoDB database systems, Monitor database performance and implement optimization strategies, Ensure database security and implement access controls
  • Perform regular backups and recovery procedures. Optimize database performance through indexing, query tuning, and resource management
  • Implementing and maintain database security measures, including encryption and role-based access control
  • Ensure compliance with organizational and industry standards for data protection. Achieve near 100% database uptime and availability
  • Manage Azure Cloud Infrastructure and building resilient and self-scaling systems
  • Implementing solutions to continuously improve operational reliability of the cloud infrastructure
  • Availability, performance, monitoring and Infra Provisioning for the Platform which comprises of Cloud infrastructure and On Prem technologies
  • Closely partner with Engineering and Technical Support teams to drive resolution of critical issues
  • Publish and implement operational standards for all Cloud infrastructure and services
  • Working towards reducing Operations toil by automating repeatable tasks
  • Mentor and develop other members in the SRE subject area
  • Application deployments using CI/CD tools, code repository, code scanning, artifact repo, compliance scanning, packaging, deployment, and configuration management
  • Build Operations Dashboards leveraging tools like Dynatrace, Splunk or Grafana
  • Handling incident, change and problem management
  • Help with provisioning of Infrastructure using Terraform
  • Enhancing Platform Observability Dashboards
  • Closely partnering with Development Teams and help address Platform related roadblocks
  • Conduct post-mortem after a production issues
  • React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
  • Work with Docker, Kubernetes, Azure cloud, Prometheus, Grafana, Java, Python and many other modern SaaS technologies
  • Participate in projects involving people of many different disciplines: Engineering, Cloud, Networking, CI/CD, Project management, Monitoring, alerting etc
  • Stay informed of new technologies and Innovate
  • Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so

Required Qualifications:

  • Bachelor’s or advanced Degree in a related technical field
  • 3+ years of IT Experience
  • 3+ years of experience with Azure Postgres
  • 3+ years of experience with MongoDB
  • 3+ years of DevOps Experience
  • 2+ years of experience in Kafka Support
  • 2+ years of experience in Monitoring tools and technologies (Splunk, Dynatrace, New Relic)
  • 2+ years of experience in Monitoring tools and technologies (Splunk, Dynatrace, new relic)

Preferred Qualifications:

  • Infrastructure Engineering Experience
  • Cloud Experience (Azure/AWS/GCP)
  • Automation experience
  • Hands-on scripting with one or more: YAML, JSON, PowerShell, BASH or Python.
  • Good Knowledge on SRE principles

At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes – an enterprise priority reflected in our mission.

Additional Job Detail Information

Requisition Number 2277395

Business Segment Optum

Employee Status Regular

Travel No

Additional Locations
Noida, Uttar Pradesh, IN
Gurgaon, Haryana, IN

Overtime Status Exempt

Schedule Full-time

Shift Day Job

Telecommuter Position No

Our Hiring Process

We want you to know what our hiring process looks like. Watch the video and find out what to expect along the way.

What It’s Like

Watch the video and hear how our employees describe what it’s like to work here in Customer Service.

Careers at Optum

If you want to use your abilities to help us challenge the status quo and achieve on our ambitious mission, this is the right place for you. We are creating and delivering quality health care solutions that deeply impact the health care system. And this means opportunities for people like you to grow and innovate with us.

Closing the GAP

Our team members help close the gap in health care. Take a closer look and see how Lisa helps members navigate a complex health care system.