Job Overview
-
Date PostedJune 2, 2026
-
Country
-
Expiration date--
Job Description
CapaCloud is seeking a talented and experienced AI Infrastructure / ML Engineer to join our dynamic, remote team. In this role, you will play a pivotal part in designing, building, and maintaining the robust infrastructure that powers our cutting-edge AI and machine learning solutions. If you are passionate about enabling scalable and efficient ML operations and possess a strong academic background coupled with practical experience, we encourage you to apply.
Job Overview
This remote position is crucial for ensuring the reliability, scalability, and performance of CapaCloud’s AI/ML platforms. You will work collaboratively with our engineering and data science teams to streamline the ML lifecycle and foster innovation.
Key Responsibilities
- Design, implement, and manage scalable AI/ML infrastructure.
- Develop and maintain MLOps pipelines for model training, deployment, and monitoring.
- Optimize compute resources and storage for AI workloads.
- Ensure the security and compliance of ML systems.
- Automate deployment, scaling, and management of ML models.
- Troubleshoot and resolve infrastructure-related issues.
Requirements
- Master’s Degree in Computer Science, Engineering, or a related field.
- Proven experience with AI/ML infrastructure.
- Demonstrated expertise in MLOps principles and tools.
- Strong understanding of cloud platforms (e.g., AWS, Azure, GCP).
- Proficiency in containerization technologies (e.g., Docker, Kubernetes).
- Excellent problem-solving and analytical skills.
What We Offer
- The opportunity to work on groundbreaking AI technologies.
- A fully remote work environment with flexible hours.
- A collaborative and innovative company culture.
- Continuous learning and professional development opportunities.
- Competitive compensation package.