Job Overview
-
Date PostedMay 4, 2026
-
Country
-
Expiration date--
Job Description
**Join Moniepoint: Pioneering Financial Inclusion in Africa**
Moniepoint Incorporated is rapidly transforming the African business landscape as a leading global business payments and banking platform, proudly recognized as QED Investors’ inaugural investment on the continent. We empower over 600,000 businesses, from burgeoning startups to established enterprises, by equipping them with the essential tools to achieve their growth ambitions and foster greater economic participation. We are seeking a dynamic and experienced Team Lead, Site Reliability Engineering to join our growing team in Nigeria and play a pivotal role in ensuring the robust and scalable operation of our critical infrastructure.
Job Overview
As the Team Lead for Site Reliability Engineering (SRE) at Moniepoint, you will be instrumental in shaping and executing our strategy for highly available, performant, and resilient systems. You will lead a talented team of SREs, fostering a culture of continuous improvement, innovation, and proactive problem-solving. This role demands a deep understanding of distributed systems, cloud-native architectures, and the unique challenges of operating at scale within a fast-paced fintech environment. You will champion SRE best practices, drive automation initiatives, and ensure the reliability of the platforms that power countless businesses across Nigeria and beyond.
Key Responsibilities
* Lead, mentor, and grow a high-performing Site Reliability Engineering team, fostering a collaborative and technically excellent environment.
* Design, implement, and maintain robust, scalable, and highly available infrastructure and systems to support Moniepoint’s rapidly growing product suite.
* Develop and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs), proactively monitoring system health and performance.
* Drive the adoption of automation for infrastructure provisioning, deployment, monitoring, and incident response to reduce manual toil and improve efficiency.
* Champion a blameless post-mortem culture, conducting thorough incident analysis and implementing preventative measures.
* Collaborate closely with development teams to embed SRE principles into the software development lifecycle, ensuring reliability by design.
* Define and implement disaster recovery and business continuity plans to ensure minimal downtime and data loss.
* Evaluate and integrate new technologies and tools to enhance system reliability, security, and operational efficiency.
Requirements
* Proven experience in a leadership role within Site Reliability Engineering or a related infrastructure operations discipline.
* Deep understanding of cloud computing platforms (e.g., AWS, GCP, Azure) and containerization technologies (e.g., Docker, Kubernetes).
* Extensive experience with infrastructure as code (IaC) tools (e.g., Terraform, Ansible) and CI/CD pipelines.
* Proficiency in at least one scripting or programming language (e.g., Python, Go, Java) for automation and tooling.
* Strong grasp of monitoring, alerting, and logging frameworks (e.g., Prometheus, Grafana, ELK stack).
* Demonstrated ability to diagnose and resolve complex technical issues in high-pressure environments.
* Excellent communication, interpersonal, and problem-solving skills.
What We Offer
* The opportunity to be at the forefront of financial technology innovation in Africa, impacting the growth of thousands of businesses.
* A dynamic and collaborative work environment with a team of passionate and talented professionals.
* Continuous learning and development opportunities, with support for professional growth and skill enhancement.
* A competitive compensation and benefits package, recognizing your expertise and contribution.
* The chance to work on challenging problems at scale, shaping the future of payments and banking in a rapidly evolving market.