3+ years of operational experience in Information Technology & Information Security University Degree in Computer Science, Computer Engineering or another relevant field Good interpersonal communication and presentation skills Ability to be a team player Ability to work effectively in multiple cultures and at a range of levels Ability to constantly build up skillset using a mix of self-motivated and course-based learning environment Ability to work independently, proactively to see the big picture and work through solutions as needed
|
Strong Linux administration skills Strong troubleshooting skills will be a plus Knowledge of CI/CD principles Strong TCP/IP knowledge Knowledge of Source control tools Knowledge of clouds Experience in automation development Knowledge of Container technology and orchestration Experience with of Monitoring tools Working knowledge of utilizing REST APIs Intermediate to advanced knowledge of at least one high-level scripting language Thorough understanding of major system components used in system administration
|
Identify sources of instability in large-scale distributed systems and drive operational excellence Analyze complex systems from a reliability and resilience perspective Improve reliability and drive down the burden of toil with tooling and automation Implement and continually improve application and system monitoring. Resolve complex technical issues as necessary Use modern tools to streamline configuration management Diagnose complex system performance problems using dumps, traces, or other diagnostics aids Third party Integrations Incident Response Implement and continually improve application and system monitoring Participate in on-call rotations Task automation
|
You’ll be responsible for monitoring of general up time and availability for all applications owned by SNAPP
The SRE role is embedded within the cross-functional with teams and DevOps and Security team
You’ll have the opportunity to design systems and solutions that best support the needs of your team and use best practices to defend against cyber-attacks within a large scale business
|
Site Reliability Engineer |