09/07 Srikanth Annam
HR at Epam

Views:490 Applications:48 Rec. Actions:Recruiter Actions:0

Epam - Site Reliability/DevOps Engineer - Java/Python (5-9 yrs)

Hyderabad Job Code: 465528

Job Location: Bangalore

Years of Experience: 5- 9 years

Role :

- As a DevOps Engineer, you will be responsible for driving the technical vision to improve the build and deployment processes, incident management, and operational excellence of the Marketplace Platform team.


- Your primary responsibility as a DevOps Engineer is to work with product and engineering teams to streamline the build, deployment, on-call, and escalation management processes.


- You have excellent analytical, debugging, problem-solving and root-cause analysis skills.


- You have excellent written and verbal communication skills.


- You must raise the bar for operational excellence and design as well as implement components that are conducive for hands-free operations.

Primary Responsibilities :

- Lead on monitoring, troubleshooting and root cause analysis for production issues

- Able to analyze large log files to find a pattern leading to a potential failure

- Provide inputs to enhance KPI Dashboards for production environment

- Responsible for escalation management of production issues, excellent communication skills

- Lead on sustainable incident response and blameless postmortems.

- A bias for quickly analyzing and resolving production issues arising from a high volume production environment

- Learn new tools on demand if necessary

- Find quick workarounds and if necessary Apply patches and resolve issues

- Objective : 99.9999 [four 9's ] uptime of production environment - lead this to Six 9's

- Lead on service deployment, operations and automation

- Service capacity management and auto scaling

- Lead the cross-functional collaboration efforts to improve the overall team velocity

- Provide continuous improvement feedback to development and DevOps teams to strengthen the next version of service deployments

Secondary responsibilities :

- Design and drive the implementation of fully automated CI/CD pipelines

- Lead the monitoring, debugging, and enhancing pipelines for optimal operation and performance

- Help QA teams to enhance test scripts to avoid future regressions

Leadership Responsibilities :

- Lead the definition and measurement of KPIs for operational excellence

- Provide quick, reliable, and easy to interpret script results and dashboard reports to raise awareness of operational overhead

- Lead the cross-functional collaboration efforts to improve the overall team velocity

- Experience with leading on-call responsibilities and addressing production issues

- Experience with providing technical leadership to an offshore team

Minimum Qualifications :

- BS in Computer Science or equivalent with 8+ years of experience

- Familiar with Linux, Shell commands, Bash scripting

- Excellent scripting skills : Be able to write own scripts and also analyze other written scripts to perform quick Root Cause Analysis

- Experience in one or more of the following: C, C++, Java, Python, Go

- Hands-on experience with providing technical leadership to DevOps teams

- Good understanding of distributed applications - One of these would help : Microservices, J2EE SOA services, Restful APIs

- A big plus if prior development background in designing, developing and deploying distributed apps

- Deep understanding of Linux system commands to fine tune capacity and resources - Process, Network and Storage

Preferred Qualifications :

- Proven ability to lead in a fast-paced environment

- Experience with providing technical leadership to an offshore team

- Experience with leading escalation management for production outages

- Experience with leading on-call responsibilities and addressing real time production issues

- Experience on designing, analyzing and troubleshooting large-scale distributed systems

Women-friendly workplace:

Maternity and Paternity Benefits

Add a note
Something suspicious? Report this job posting.