Vacant Position: Site Reliability Engineer
Location: Nairobi
Reports to: Development Lead
Our client, a well-established and highly reputable B2B services provider is seeking to recruit a Site Reliability Engineer to join their firm.
Key Responsibilities:
- Working across several business areas providing development, maintenance, and support
- Work on projects that directly impact key business metrics
- Engage in and improve the whole lifecycle of services—from inception and design, deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
- Lead sustainable incident response, blameless postmortems, and production improvements that result in direct business opportunities for Organization.
- Manage individual project priorities, deadlines, and deliverables.
- Guide other team members on managing end-to-end availability and performance of mission-critical services, building automation to prevent problem recurrence, and building automated responses for non-exceptional service conditions.
- Able to work in shifts
- Meet frequently in standups and grooming sessions with your engineers and product team
- A working understanding of software engineering principles
- Help identify probable causes and provide immediate solutions during an incident
- Contribute to engineering efforts from planning and organization to execution and delivery to solve complex, real-world engineering problems.
Minimum Requirements:
- Bachelor’s degree in Computer Science, a related technical field involving software/systems engineering, or equivalent practical experience.
- Experience programming in at least one of the following languages: C, C++, Java, Python, or Go.
- Experience with algorithms and data structures.
- 3-5 years of experience in computing, distributed systems, storage, or networking.
Skills and Competencies
- Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
- Ability to debug, optimize code, and automate routine tasks.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
- Experience with algorithms and data structures and/or Unix/Linux systems internals (e.g., filesystems, system calls) and administration.
How to Apply
Apply through Flexi Personnel ATS send your CV to recruit@flexi-personnel.com by 6th May 2022 indicating Site Reliability Engineer as the email subject.
NB: Flexi Personnel does not charge candidates for job placement.