Site Reliability Engineer
- Job Category: Engineering/Technical
- Education: Bachelors Degree
- FT/PT: Full-Time
- Company: CGI
- More Jobs from this Company
Position ID: J0621-0273
Site Reliability Engineer with strong technical skills to engineer solutions for operational problems
Your future duties and responsibilities:
As a Site Reliability Engineer (SRE), the candidate will be responsible for formulating and implementing solutions that will improve overall stability and resiliency of critical business applications. The candidate will also be responsible for leading critical incident response function: - lead and drive the triaging and resolution of complex, high impact production issue to quickly restore the services to minimize any impacts to business functions.
High Level Responsibilities
• Response Function
o Lead Critical Incident Response function.
o On-Call Support including off hours
o Monitor production environments
o Conduct Post-Mortem analysis
• Problem Solving Function
o Repetitive Incident Resolution. Remediate the issue via sustainable and preventative solutions
o Knowledge Management
o Build and Refine Application monitoring rules
o Improvise Run-The-Shop Operations and Processes
o Build or modify solutions to improve system resiliency, performance and efficiency.
o Reduce manual work through automation.
• Building a strong team by mentoring and leveling-up junior developers
Required qualifications to be successful in this role:
Experienced SRE with strong background on coding and automation
8+ years experience on Run-The-Shop operations
Should be able to quickly identify the root cause and resolve critical issues by looking across multiple layers
Extensive Support Experience in a global large-scale, high volume application in a mission-critical and complex production environment
Proficient knowledge in application and infrastructure architecture discipline.
Extensive experience in Linux Systems, analyzing and interpreting the application/system log files
Working knowledge of WEB applications (Tomcat, Java) and database (Oracle, SQL,)
Advanced knowledge in Application Performance Monitoring tools, analyzing application and system log files
Proficient knowledge of infrastructure components for example Load Balancers (F5), Web Security Mechanism, REST APIs
Proficient knowledge of SRE principles and automation design principles
Proficient Knowledge of Object Oriented Programming concepts. Working knowledge of JAVA is Preferred.
5+ years of experience in programming/scripting languages for example Python/PowerShell/Shell Scripting
5+ years of Banking and Financial domain knowledge preferred
Strong, courageous communicator capable of effectively communicating, verbally and written to technical, business and “C” level executives.
Capable of working in high pressure situations
Java or any software language 5+ years
Python/Shell Scripting/Power Shell 5+ years
Linux System 8+ years
Dynatrace or any other APM tools 8+ years
Web Applications 8+ years
DESIRED QUALIFICATIONS/NON-ESSENTIAL SKILLS REQUIRED
Minimum Education Required: Bachelors Degree
500 West Summit Hill Drive
Knoxville TN 37902
Job Link: Apply Here!