SRE - Site Reliability Engineer
Posted on: March 20, 2023
Locations: NJ2-101-06-07 NC1-003-06-90 TX2-984-05-10 Job
- Responsible for reliability and support of all the products and
services supporting the Internal Cloud Platform.
- Maintain services once they are live by measuring and
monitoring availability, latency, and overall system health.
- Troubleshoot issues across the entire stack: hardware,
software, application, and network
- Perform deep dives into both systemic and latent reliability
issues; perform blameless RCA, partner with engineering and
operation teams across the organization to roll out fixes.
- Drive standardization efforts across multiple disciplines and
services in conjunction with embedded SREs throughout the
- Identify and drive opportunities to improve automation for the
- Provide on-call coverage as per rotation
- Be a key stakeholder in the design of cloud services so that
they are resilient from day 0 and identify/fix resiliency problems
by collaborating with product teams Required Qualifications
- BS /MS degree in Computer Science or related technical field
involving systems or equivalent practical experience.
- Minimum 6+ years of hands-on experience maintaining
- Excellent understanding of Linux /Windows operating systems
- Experience with VMware and other virtualization tools.
- Experience with automation in one or more of the programming:
Python, Java, Ansible and shell scripting and source control
- Experience with Sql/NoSql databases like Mysql, mongodb and
CI/CD tools git /Jenkins
- Experience with Ansible Tower, Redhat Satellite Foreman,
capsule architecture knowledge is a plus.
- Experience with Hashicorp Vault /Terraform /Consul /Nomad is a
- Systematic problem-solving approach, sense of ownership and
- Excellent interpersonal, organizational and communication
(written, verbal, and presentation) skills are a must. Proven
ability to work independently with minimal supervision and as part
of a team with direct responsibilities. Top 3 Must Have Skillsets
Required : Ex: Java; CCAR; OTC; Agile; SQL Linux /Python /Ansible /
Containers / CI-CD / automation Level of Experience Needed: 5+
Keywords: Experis, Charlotte , SRE - Site Reliability Engineer, Professions , Charlotte, North Carolina
Didn't find what you're looking for? Search again!
Loading more jobs...