• Login Name       Password       Remember me       LOGIN LOGIN    FORGOT PASSWORD
  • REGISTER

 LOGIN WITH
 
 
 
 
 EMPLOYERS
 
 
 
 
 
 NEWSLETTER
 
 
 
 
 FOLLOW US

Intermediate Site Reliability Engineer (SRE II)

Job Ref
285373
Job Type
Permanent
Employer Type
Company
Date Added 19 Sep 2023
Expiry Date 17 Oct 2023
* There have been no applications to this job.
* This job has been viewed 3885 times.
Employer:
SprintHive

Location:
Western Cape

Salary:
Market related

Benefits:


Role details:
SprintHive is offering the right candidate a great opportunity to join a fast growing South African fintech that enables seamless and innovative end-to-end customer onboarding
services that drive conversion rates, prevent fraud, reduce risk and costs. SprintHive provides automated and easy to implement solutions that fully onboard a new customer
in under two minutes.
You’ll work in a small, senior team that operates on trust and high collaboration. The team works remotely most of the time and occasionally comes into the office when more direct collaboration is required. You should be motivated to achieve operational excellence using automation tooling (e.g. Terraform) and enjoy keeping your technical skills current to allow you to contribute to architectural discussions. Naturally, you’ll be exposed to many aspects
of our business from day one. We’ll ensure that you have the tools and support to do great work, but you’ll also have the freedom to try new things and learn.
- - - - - - - - - - - - - -
Your responsibilities will include:

? Improving and maintaining our infrastructure using Terraform, which includes
making effective use of public clouds (primarily Google Cloud and AWS) while
considering:
1. Security
2. Latency & Availability
3. Maintainability
4. Scalability
5. Dependencies (including human)
? Ensuring our infrastructure is automated and reproducible across environments
? Leveraging Kubernetes in an effective manner to host our applications
? Owning infrastructure projects from start to finish and driving them to completion
within agreed timeframes
? Documenting infrastructure design and how tooling should be used
? Regularly considering the long-term vision for our infrastructure and our
alignment to it
? Making well-considered tradeoffs between short-term infrastructure requirements
and long-term objectives
? Identifying potential improvements that could enable us to deliver faster without
compromising operational objectives
? Managing our identity platform and enabling enterprise user and system
authentication and authorization using OAuth2
? Writing, testing and executing change control plans for production changes with
an eye for detail to spot potential issues
? Having a good working understanding of how our systems operate and be able to
debug production issues
? Being part of our on-call rotation. When on-call, you will work on repaying technical debt and deal with operational incidents as and when they occur. This will require you to have or acquire a good general knowledge of production
operations for technical support.
? Being part of our security incident response team
? Writing operational tooling to automate otherwise manual processes (e.g. Golang, Bash)
? Performing high quality, ego-free code reviews for your colleagues as well as submitting your code for review by others and accepting their feedback generously
? Taking ownership of our operational metrics and drive visibility, testing and improvement initiatives
? Working effectively with the development team to plan and deploy required infrastructure changes or new capabilities ahead of time and unblocking the development team when unforeseen infrastructure blockers arise
? Accepting feedback willingly and sharing your knowledge freely


Why join SprintHive?

? Flexible working hours and leave (no clock watching)
? Strong values that are practised
? Customer focused
? Remote work for most days of the week
? Opportunity to learn and grow being surrounded by a strong technical team
? Afforded a high level of autonomy
? High quality hardware and software (e.g. MacBook Pro, 27'' or 34'' Dell monitor, JetBrains IDE Subscription)


Our Infrastructure & Software Stack:

? Kubernetes running on Google Kubernetes Engine (GKE)
? Prometheus, Grafana, Elastic, Kibana
? CI/CD with Jenkins
? Kong API Gateway
? LogDNA
? Falco
? MongoDB Atlas
? Microservice Architecture with Event Sourcing and CQRS
? Containers running Kotlin, Python, Javascript (and a bit of Golang)


APPLY  
 



 
HOME|
INFO|