Software Engineer – Production Support
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
The Operations engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of enterprise systems and applications. This role blends software engineering with IT operations to create automated solutions for operational challenges, improve system health, and reduce manual toil.
Primary Responsibilities:
- Application Stability
- Ensure system reliability through proactive monitoring and alerting using tools like Dynatrace, Splunk, Prometheus / Grafana
- Incident & Performance Management
- Lead incident response and post-incident reviews to identify root causes and preventive actions
- Monitoring & Health Checks
- Set up alerting systems for critical metrics both Infra and functional
- Proficient on Monitoring tools like Dynatrace/DataDog and Splunk
- Monitor application health and manage Management by Objectives (MBO) processes
- Communication & Collaboration
- Coordinate with developers and stakeholders during deployments
- Ensure clear communication before and after production changes
- Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regard to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Required Qualifications:
- Bachelor’s degree in computer science, Engineering, or related field
- 3+ years of experience in infrastructure or operations engineering
- Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker)
- Proficiency in any 1 coding lang – Java, python or javascript (UI tech)
- Familiarity with monitoring tools (Dynatrace, Splunk, Prometheus, Grafana) and incident management frameworks
- Ability to handle production support, including weekend rotations
- Solid grasp of microservices architecture, including API gateways and SQL/NoSQL databases
Preferred Qualifications:
- Experience with CI/CD tools and Infrastructure as Code (Terraform, Ansible)
- Exposure to AI/ML use cases in operations (AIOps)
- Solid understanding of ITIL and ITSM processes.
- Proven excellent communication and problem-solving skills
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone – of every race, gender, sexuality, age, location and income – deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes – an enterprise priority reflected in our mission.
#njp
Información adicional sobre la vacante
Número de la requisición 2320534
Segmento de negocio Optum
Disponibilidad para viajar No
País IN
Estado de horas extras Exempt
Vacante de teletrabajo No