Sr. Manager/Architect – DevOps
As Sr. Manager/Architect of the DevOps Team you will: Lead and mentor the team and develop strategies for Tillster’s AWS cloud infrastructure, automation and tooling to support the software development life cycle, and robust monitoring and alerting of our unique restaurant-ordering platform-as-service. Your creativity and skills will directly support a platform that processes millions of transactions for well-known, international brands in the quick service and casual dining industry such as Burger King, Pizza Hut, Baskin Robbins and many others as we scale globally.You’ll be depended upon to provide innovative approaches to Tier-2 operational support, automation, configuration management and continuous deployment. This role demands a motivated and self-directed leader who embraces challenges and solves problems with detailed execution and follow-through. You are an effective communicator and team collaborator who pushes fellow team members to think about problems and solutions in new ways.
What you’ll do:
- Manage the DevOps / Site Reliability Engineering (SRE) team
- Define strategy, procedures and protocols for day-to-day operations together with the Technology Management team
- Mentor team members to grow their skill-sets and help them define career paths and goals
- Build, monitor and scale our AWS infrastructure to support our platform-as-service
- Tackle complex infrastructure challenges with scalable automation and CI/CD solutions
- Maintain multiple production-class Kubernetes environments
- Build custom tools and instrumentation to ensure service availability
- Implement custom monitoring across the entire application stack, including 3rd party APIs and Point of Sale systems
- Work with partners and vendors on technical improvements and fixes
- Constantly research new technology and techniques that will benefit the team, the company, and its clients
Who we’re looking for:
- You have professional experience in AWS technologies and architectures and understand the unique performance and operational characteristics of applications that run there
- You think of infrastructure as code and prefer automation over manual implementation
- You have professional experience running containerized applications using Docker and Kubernetes
- You’re comfortable troubleshooting JVM-based distributed systems & web services in a high-traffic, high-visibility environment
- You can creatively solve complex customer or internal business requirements
- You believe in the approach of automating the way through challenges that span multiple systems and environments
- You are experienced in DevOps programming in at least one language such as Groovy, Python, Javascript/Node.js, Go, Bash and know how to review pull-requests on a daily basis
- You understand how technical debt is undesirable and know how to identify and remediate it
- You understand performance tuning concepts and how they apply to high-traffic web and mobile application environments
- You have experience running teams using agile methodologies such as Scrum or Kanban
- By habit, you track and document your work and time with tools like JIRA and Confluence, Slack and email
- Bachelor's degree from a four-year college or university, or three to four years related experience and/or training; or equivalent combination of education and experience.
- 5+ years of managing a variety of software projects and/or IT infrastructure projects
Technologies you are familiar with:
The more hands-on experience you have with the following technologies the better, and where you don’t have it all, you’ll be able to discuss it intelligently and put new learnings into action:
- AWS – Tillster is 100% in AWS. We make heavy use of VPCs, EC2, RDS, EC, ES, S3, CloudFront, Route53 and many others
- CloudFormation and Terraform for Infrastructure as Code in common practice
- Linux – command-line and system administration skills are a requirement
- Jenkins – used for CI/CD, builds, and deployment
- Kubernetes – production implementation and management in AWS of containerized applications
- Monitoring – familiarity with industry-standard monitoring tools such as CloudWatch, New Relic, AppDynamics, Datadog, Dynatrace, Sensu, etc
- Logging aggregation systems such as ELK, Splunk, Graylog, etc
- Tomcat – Production experience managing Java-based applications
- HTTP servers and load balancers such as Nginx, Apache, HAProxy, etc