D
Platform SRE and Reliability Engineer
Deeplight AI
Abu Dhabi, UAEAED 7,000-18,000/moToday
UAEIT & TechnologyFull Time
Skills Required
PythonSqlMongodbAwsAzureDockerKubernetesGitErpSafety
Job Description
DeepLight AI is a specialist AI and data consultancy with extensive experience implementing intelligent enterprise systems across multiple industries, with particular depth in financial services and banking. Our team combines deep expertise in data science, statistical modeling, AI/ML technologies, workflow automation, and systems integration with a practical understanding of complex business operations.The Platform SRE and Reliability Engineer is responsible for ensuring the absolute quality, resilience, and performance of the Bank's next-generation AI and digital platforms. This role focuses on the high-stakes intersection of Site Reliability Engineering (SRE) and AI Quality Assurance, designing automated frameworks to validate everything from Conversational AI agents and RAG pipelines to core banking microservices. By implementing robust continuous testing pipelines and reliability governance, you will guarantee that the Bank's AI-driven experiences remain secure, scalable, and deterministically accurate under real-world conditions.ResponsibilitiesBuilding reusable automation frameworks to test the accuracy, stability, latency, and safety of Conversational AI platforms (voice and chat) and LLM-based agentsValidating multi-agent orchestration, human-in-the-loop escalation logic, and the integrity of RAG pipelines and vector search resultsTesting AI/ML platform components for scaling behavior, failover resilience, high availability, and disaster recoveryIntegrating automated test pipelines into CI/CD workflows for MLOps, focusing on drift detection, retraining validation, and model registry integrityVerifying AI/ML pipelines on Azure AI Foundry and AWS SageMaker, ensuring data integrity across storage services (S3/Blobs) and serverless functionsConducting load testing for AI services and enforcing engineering guardrails for fairness, explainability, and regulatory complianceActing as a bridge between engineering and business, translating complex technical reliability requirements into actionable quality narrativesAs an AI consultancy, our greatest asset is the expertise of our people.While technical mastery is the foundation of what we do, the ability to bridge the gap between complex data science and actionable business value is what defines your success with DeepLight.We're looking for individuals who are not only world-class in their fields of specialism, but also compelling communicators and persuasive advocates for their own skills.You will be the face of our firm, tasked with building trust, articulating the \"why\" behind your technical decisions, and effectively \"selling\" your vision to high-level stakeholders.If you thrive on the challenge of presenting cutting-edge solutions as much as you do on building them, you will fit right in.RequirementsA Bachelor's degree in Computer Science, AI, Software Engineering, or a related quantitative field. A Master's degree in AI/ML is highly preferred5+ years in QA, Application Testing, or Reliability Engineering, ideally for a large-scale brand or digital-only bankProven track record in deploying AI/ML QA solutions at an enterprise scale within the financial services sectorExperience testing distributed architectures, microservices, and large-scale data platforms (Vector DBs, Data Lakes)Expertise in Python-based automation frameworks and tools such as Selenium, Playwright, PyTest, JMeter, and LocustA deep understanding of LLM evaluation frameworks, prompt stability testing, and hallucination avoidance validationHands‑on experience testing and validating services across both Azure and AWS cloud environmentsStrong SQL/NoSQL validation skills (Postgres, MongoDB) and experience testing REST, GraphQL, and FastAPI integrationsBe proficient in testing within Docker and Kubernetes (EKS/AKS) environmentsIt would be beneficial if you also had:An ability to evaluate and adopt emerging QA tools for AI frameworks like LangChain, CrewAI, and BedrockAn understanding of cutting‑edge quality trends, including multimodal QA and RLHF (Reinforcement Learning from Human Feedback) output evaluationA proactive approach to identifying edge cases in AI agents that could impact banking compliance or customer experienceA strong ability to coordinate with different functional teams to implement models and monitor outcomesBenefitsBenefits & Growth Opportunities:Competitive salaryVisa sponsorship for the successful individualComprehensive health insurance for the successful individualProfessional development and certification supportOpportunity to work on cutting‑edge AI projectsCareer advancement opportunities in a rapidly growing AI companyThis position offers a unique opportunity to shape the future of AI implementation while working with a talented team of professionals at the forefront of technological innovation. The successful candidate will play a crucial role in driving our company's success in delivering transformative AI solutions to our clients.At DeepLight AI, we recognise
Similar Opportunities
T
Digital Security Architecture with Banking /Fintech Domain
TAT IT Technolgies
Abu Dhabi, UAEAED 8,000-22,000/moToday
UAEIT & Technology
W
Web Developer
Work in USA
Abu Dhabi, UAEAED 6,000-18,000/moToday
UAEIT & Technology
D
Lead Data Engineer – Health Tech AI, Hybrid Cloud
Discovered MENA
Abu Dhabi, UAEAED 7,000-18,000/moToday
UAEIT & Technology
B
Senior Zoho Developer
Blue Ocean Academy
Dubai, UAEAED 7,000-20,000/moToday
UAEIT & Technology
A
Enterprise Tech Sales Specialist: Water Infra
Autodesk Middle East
UAEAED 6,000-16,000/moToday
UAEIT & Technology
V
ESMM Developer: Angular/Azure SQL | Agile, Full-time
VAM Systems
Abu Dhabi, UAEAED 7,000-20,000/moToday
UAEIT & Technology