GreenSky Administrative Services LLC

Site Reliability Engineer

Job Location(s) US-Atlanta
Posted Date 3 weeks ago(11/12/2021 10:10 AM)
Job ID
2021-4728
# of Openings
2
Category
Information Technology

Overview

As a GreenSky Site Reliability Engineer you will be responsible for the design, deployment, and operation of our 100 percent cloud native platform powering all GreenSky applications.  The qualified candidate will have experience in delivering B2B or B2C applications and systems that scale reliably across multiple cloud environments while supporting continuous deployment and integration pipelines.  The SRE will lead the definition of and deployment of GreenSky’s AWS and Azure based system and application architecture supporting the goals of the product, development, and production operations teams with a focus on continuous deployments, operational efficiencies, and application performance scaling.

Duties & Responsibilities

  • Take significant role in owning and supporting overall GreenSky cloud environments which support the overall GreenSky IT ecosystems including driving standardization, cost management, and functionality for users and applications
  • Support GreenSky development teams throughout the development lifecycle
  • Build long-term service reliability into all IT environments, increasing the odds that when a problem gets fixed, it stays fixed
  • Automate repeatable workflows that improve infrastructure reliability, identify capacity volatility, prevent problem occurrence, and accelerate the remediation process via enhanced response and resolution.
  • Automate response to all non-exceptional service conditions.
  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
  • Encourage and minimize manual systems work to focus on efforts that bring long-term value to the company’s operations.
  • Evaluate potential failures and their effects on the system.
  • Develop and deploy operational test cases to catch issues in production and lower environments.
  • Ensure the infrastructure stability to support services before they go live through activities such as system design consulting, developing software platforms and frameworks, and capacity planning.

Required Skills/Qualifications

  • Minimum of seven (7) years of experience in systems engineering for revenue generating platforms
  • One or more of the following: Python, Perl, BASH, C, C++, Java, Go, Ruby
  • Experience with automating system management with one more of the following: AWS Cloud Formation, AWS CDK, Terraform, Azure ARM or BICEP, Chef, Ansible, or Vagrant
  • Experience deploying applications in IaaS and PaaS cloud environments
  • Excellent Linux system administration and overall operating systems skills
  • Experience with application servers and web servers, networks, firewalls, switches, load balancers.
  • Knowledge and familiarity with the configuration and usage of Apache, LDAP, DNS, and other open-source systems infrastructure software
  • Experience using Jenkins, Git, Confluence, Node.js, and Jira

 

#LI-JD1

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed