Site Reliability Engineer - Onsite
Company: A-Line Staffing Solutions
Location: Charlotte
Posted on: May 28, 2023
|
|
Job Description:
Title: Site Reliability Engineer Location: Charlotte, NC
(Onsite)Note: This position is a contract on W2, and is NOT open to
C2CIT Operations team is looking for an outstanding Site
Reliability Engineer (SRE). The SRE will serve as a highly
specialized technical lead focusing on operational stability by
driving IT operations readiness through the continuous improvement
in our products. This role will involve working closely with
development teams and business partners implementing enhanced
monitoring and alerting capabilities for our distributed platforms.
Additionally, the SRE will aid in the development of automation to
reduce MTTR and manual tasks. We are looking for a high energy team
player with an innovative mindset interested in joining a group of
IT professionals dedicated to enhancing IT operations. Passion for
technology and problem solving are a must have.The Work
ItselfCollaborates with Agile squads/developers, sustain and
business partners and provides significant contributions to develop
specifications to resolve problems, and to address enhancement
needs focusing in areas of logging, monitoring and metrics for
operational readinessUses technical knowledge, creativity, and
company practices to drive down occurrences of incidents through
development of proactive monitoring and alerting.Provide continuous
feedback to development teams on system stability, defect analysis
and system enhancementsDevelop runbooks and patterns to sustain
applications in a production environmentParticipate in technical
discussions and drive transition to sustain activities with the
development and production operations teamsWork with IT business
and development partners to gather input to develop new
capabilities in displaying/monitoring/alerting on key performance
indicators (KPIs) by tracking business transactions (BT) in
real-timePartner with application owners to develop creative and
effective solutions to mitigate risk and successfully remediate any
audit issuesParticipate in RCA and SWAT investigations for the IT
Production Engineering teamPlan for validation and verification of
changes deployed by infrastructure teams, development teams and
sustain teamParticipate where needed in day-to-day execution of
real time advanced level technical support and
troubleshootingProvides guidance in resolving performance related
issues and designing solutions for any technical issues faced by
the applicationReview technical documentationMandatory
SkillsApplied DevOps experienceExperience with AWS Lambda and ECS
FargateExperience with Splunk, Datadog, AppDynamics or other
similar monitoring tools creating dashboards, alerting and
reportsCorrelate environment conditions and metrics to application
eventsExperience debugging problems in a distributed
systemExperience with source control management, specifically
Git.Coding experience in PythonExperience in enterprise development
and production troubleshooting and issue resolutionShows knowledge
and understanding of enterprise-scale platforms and
architecturesPossesses strong analytical, problem-solving skills
and exhibits strong leadership skillsExperience with Co-ordination
between upstream applications to resolve incidentsGrasp innovative
technologies and can adapt to rapid shifts in prioritiesDesired
SkillsApplied AWS/Cloud experience preferredDevelopers with IAC and
AWS experience are welcome to apply.Experience with Salesforce,
Genesys or Telephony is a plusIf you think this position is a good
fit for you, please reach out to me - feel free e-mail me, or apply
to this posting!Andrew
TorchineAtorchine@alinestaffing.comPDN-991b2d8e-5d12-49b9-8e4c-98b4ce98c282
Keywords: A-Line Staffing Solutions, Charlotte , Site Reliability Engineer - Onsite, Professions , Charlotte, North Carolina
Click
here to apply!
|