Knoxville TN | Knoxville Info | iKnowKnoxville
Great Smoky Mountains
View RSS Feed

Site Reliability Engineer

« Back

Job Description:

Position ID: J0621-0273

Position Description:
Site Reliability Engineer with strong technical skills to engineer solutions for operational problems

Your future duties and responsibilities:
As a Site Reliability Engineer (SRE), the candidate will be responsible for formulating and implementing solutions that will improve overall stability and resiliency of critical business applications. The candidate will also be responsible for leading critical incident response function: - lead and drive the triaging and resolution of complex, high impact production issue to quickly restore the services to minimize any impacts to business functions.

High Level Responsibilities

• Response Function
o Lead Critical Incident Response function.
o On-Call Support including off hours
o Monitor production environments
o Conduct Post-Mortem analysis
• Problem Solving Function
o Repetitive Incident Resolution. Remediate the issue via sustainable and preventative solutions
o Knowledge Management
o Build and Refine Application monitoring rules
o Improvise Run-The-Shop Operations and Processes
o Build or modify solutions to improve system resiliency, performance and efficiency.
o Reduce manual work through automation.
• Building a strong team by mentoring and leveling-up junior developers

Required qualifications to be successful in this role:
Experienced SRE with strong background on coding and automation

8+ years experience on Run-The-Shop operations

Should be able to quickly identify the root cause and resolve critical issues by looking across multiple layers

Extensive Support Experience in a global large-scale, high volume application in a mission-critical and complex production environment

Proficient knowledge in application and infrastructure architecture discipline.

Extensive experience in Linux Systems, analyzing and interpreting the application/system log files

Working knowledge of WEB applications (Tomcat, Java) and database (Oracle, SQL,)

Advanced knowledge in Application Performance Monitoring tools, analyzing application and system log files

Proficient knowledge of infrastructure components for example Load Balancers (F5), Web Security Mechanism, REST APIs

Proficient knowledge of SRE principles and automation design principles

Proficient Knowledge of Object Oriented Programming concepts. Working knowledge of JAVA is Preferred.

5+ years of experience in programming/scripting languages for example Python/PowerShell/Shell Scripting

5+ years of Banking and Financial domain knowledge preferred

Strong, courageous communicator capable of effectively communicating, verbally and written to technical, business and “C” level executives.

Capable of working in high pressure situations
Java or any software language 5+ years
Python/Shell Scripting/Power Shell 5+ years
Linux System 8+ years
Dynatrace or any other APM tools 8+ years
Web Applications 8+ years


Communication skill
Team player
Problem solving

Minimum Education Required: Bachelors Degree

Shell Script


500 West Summit Hill Drive
Knoxville TN 37902

Contact Information:

Job Link: Apply Here!

Share by Email