CharlotteRecruiter Since 2001
the smart solution for Charlotte jobs

Site Reliability Engineer (SRE) 100% remote

Company: MissionStaff
Location: Charlotte
Posted on: January 16, 2022

Job Description:

Are you looking for a 100% remote SRE position, Look no further: Through powerful analytics, this company transforms data into intelligence, in a fast and efficient manner. Through leading-edge, proprietary technology and a massive data repository, their data and analytical solutions harness the power of data fusion, uncovering the relevance of disparate data points and converting them into comprehensive and insightful views of people, businesses, assets, and their interrelationships. Currently they are looking for a SRE to join their growing team.
The Senior Site Reliability Engineer is responsible for ensuring availability, minimizing latency, and maximizing performance, capacity and scalability of software services across multiple AWS accounts. This person will join a growing technical team, leveraging automation platforms and their subject matter expertise to ensure that systems are highly available, security compliant and performant.
What You Will DO:

  • Develop strategies for continuous monitoring and analysis to reduce both downtime and required manual intervention
  • Build nontrivial internal tooling to support and enable engineering workflows
  • Design and write automation that investigate how our infrastructure handles failure and scaling
  • Monitor the breadth of our full platform stack (hosts, applications, and performance)
  • Embrace and encourage the adoption of the DevOps culture and philosophies
  • Write and maintain detailed documentation, including architectural diagrams
  • Guide and mentor peers and colleagues on best practice approaches to full stack monitoring, log analysis, and infrastructure/application performance management).
    What you Bring
    • 3-5 years of experience with customer facing production environment(s) using containerization and orchestration tools
    • 3-5 years of experience with building observability systems using products like Elastic Search, Logstash, Prometheus, Kibana and AWS CloudWatch
    • 3-5 years of combined experience in SRE/DevOps or Software Development roles in a full stack engineering environment
    • Strong communication skills, confidently representing your expertise to peers and stakeholders across the organization
    • Must have experience enriching alerts for faster root-cause detection and incident resolution
    • Experience with Infrastructure as Code solutions, particularly Terraform/Terragrunt and/or AWS SAM/CloudFormation
    • Strong scripting experience in Bash and/or Python
    • Experience with configuration management software such as Ansible, Chef, Puppet or Salt
    • Experience in leveraging enterprise cloud monitoring frameworks such as Datadog, Blue Matador, NewRelic, etc.
    • Industry Certifications (AWS Solutions Architect Professional or DevOps Engineer Professional) a big plus
      Culture
      • Unlimited PTO- typically comes to about 2-3 weeks, ranges with different managers
      • You'll be part of a culture you can be proud of. Friendly and inclusive - it's what makes them unique they support and help you from the moment you join.
      • They will work with you to make the right development choices for your career. The skills you gain will help you to get the most out of your time with them, and make you more marketable in the future.
        The Offer
        • Competitive Salary: Up to $170K DOE

Keywords: MissionStaff, Charlotte , Site Reliability Engineer (SRE) 100% remote, Professions , Charlotte, North Carolina

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest North Carolina jobs by following @recnetNC on Twitter!

Charlotte RSS job feeds