Your trusted digital partner for tailored IT solutions with expertise in web & app dev, e-comm, digital marketing, branding, & web security. We enable your overall success on every online platform & a complete digital transformation. At Ekfrazo, we acknowledge the minutest of details that go into every work that we do. To accomplish this, we take all the responsibilities of delivering a perfected work on time which includes Software & Product Development, Web & Mobile Apps, Branding, Digital Marketing, IT Consulting, Recruitment & Staffing. Our areas of expertise include delivering the best possible work based on the ideas and the requirements that you have with a blend of our creativity and skills, be it in any domain.
The Role
You will be responsible for :
Key Activities & Responsibilities
- Develop a scalable AI infrastructure strategy aligned with enterprise-wide digital transformation goals, ensuring a high-performance and secure foundation for AI workloads
- Architect AI infrastructure solutions across cloud and on-prem environments, optimizing flexibility, performance, and security to meet evolving business needs
- Implement and oversee infrastructure for AI model training, inferencing, and execution frameworks that enhance processing efficiency and overall model performance
- Design and implement AI infrastructure solutions that scale seamlessly to accommodate enterprise-wide AI adoption, support business growth, ensure high availability, and incorporate future advancements in AI and cloud computing
- Establish AI infrastructure security SOPs, access control mechanisms, and compliance frameworks to mitigate risks and ensure data protection
- Design and implement automated MLOps pipelines that streamline AI model deployment, monitoring, retraining, and governance for efficient AI operations
- Deploy Infrastructure as Code (IaC) solutions using Terraform, Ansible, or equivalent tools to automate AI infrastructure provisioning and scaling
- Optimize high-performance computing environments, ensuring efficient utilization of GPU, CPU, and storage resources for AI and ML workloads
- Manage AI workload orchestration using Kubernetes and containerization technologies to enhance scalability and performance across distributed AI environments
- Enhance AI data storage and processing capabilities by optimizing pipelines, retrieval mechanisms, and data engineering strategies for real-time analytics
- Work closely with IT and Software COE teams to integrate AI infrastructure seamlessly with enterprise systems and applications
- Develop AI infrastructure cost optimization strategies, balancing performance, scalability, and budget constraints to maximize ROI
- Deploy real-time AI infrastructure monitoring tools to continuously track system health, identify performance bottlenecks, detect potential anomalies, and implement proactive optimization measures to enhance system reliability and efficiency
- Implement AI governance and model version control policies, ensuring regulatory compliance, model integrity, security, ethical AI practices, and proactive risk mitigation
- Stay ahead of AI infrastructure innovations, emerging cloud technologies, and industry best practices to enhance enterprise AI capabilities
Ideal Profile
Skills and Experience
Education :
Bachelor’s degree in Computer Science, IT Infrastructure Engineering, or a related fieldCertifications in cloud computing (AWS, Azure, GCP), MLOps, or DevOps (preferred)Experience :
8+ years of experience in IT Infrastructure, Cloud Computing, or AI Systems ArchitectureHands on experience in AI infrastructure design, cloud-based AI solutions, or MLOpsExpertise in managing AI workloads across cloud and hybrid environments.Proven track record in scaling AI infrastructure for large enterprisesStrong experience in Kubernetes, containerization, and orchestration toolsExperience in optimizing AI workloads for performance and cost efficiencySkills :
Expertise in AI Infrastructure & Cloud ArchitectureStrong Understanding of AI Model Deployment & MLOpsAdvanced Proficiency in Kubernetes & AI Workload OrchestrationHands-on Experience with Cloud Platforms (AWS, Azure, GCP)Proficiency in Infrastructure as Code (Terraform, Ansible)AI Security & Compliance KnowledgeAI Infrastructure Cost Optimization StrategiesPerformance Tuning for AI Systems & WorkloadsWhat's on Offer?
Work alongside & learn from best in class talentJoin a well known brand within TelecommunicationsExcellent career development opportunities