Talent.com
This job offer is not available in your country.
AI Infrastructure Architecture

AI Infrastructure Architecture

Ekfrazo Technologies Private LimitedJohannesburg Metropolitan Area, South Africa
28 days ago
Job description

Your trusted digital partner for tailored IT solutions with expertise in web & app dev, e-comm, digital marketing, branding, & web security. We enable your overall success on every online platform & a complete digital transformation. At Ekfrazo, we acknowledge the minutest of details that go into every work that we do. To accomplish this, we take all the responsibilities of delivering a perfected work on time which includes Software & Product Development, Web & Mobile Apps, Branding, Digital Marketing, IT Consulting, Recruitment & Staffing. Our areas of expertise include delivering the best possible work based on the ideas and the requirements that you have with a blend of our creativity and skills, be it in any domain.

The Role

You will be responsible for :

Key Activities & Responsibilities

  • Develop a scalable AI infrastructure strategy aligned with enterprise-wide digital transformation goals, ensuring a high-performance and secure foundation for AI workloads
  • Architect AI infrastructure solutions across cloud and on-prem environments, optimizing flexibility, performance, and security to meet evolving business needs
  • Implement and oversee infrastructure for AI model training, inferencing, and execution frameworks that enhance processing efficiency and overall model performance
  • Design and implement AI infrastructure solutions that scale seamlessly to accommodate enterprise-wide AI adoption, support business growth, ensure high availability, and incorporate future advancements in AI and cloud computing
  • Establish AI infrastructure security SOPs, access control mechanisms, and compliance frameworks to mitigate risks and ensure data protection
  • Design and implement automated MLOps pipelines that streamline AI model deployment, monitoring, retraining, and governance for efficient AI operations
  • Deploy Infrastructure as Code (IaC) solutions using Terraform, Ansible, or equivalent tools to automate AI infrastructure provisioning and scaling
  • Optimize high-performance computing environments, ensuring efficient utilization of GPU, CPU, and storage resources for AI and ML workloads
  • Manage AI workload orchestration using Kubernetes and containerization technologies to enhance scalability and performance across distributed AI environments
  • Enhance AI data storage and processing capabilities by optimizing pipelines, retrieval mechanisms, and data engineering strategies for real-time analytics
  • Work closely with IT and Software COE teams to integrate AI infrastructure seamlessly with enterprise systems and applications
  • Develop AI infrastructure cost optimization strategies, balancing performance, scalability, and budget constraints to maximize ROI
  • Deploy real-time AI infrastructure monitoring tools to continuously track system health, identify performance bottlenecks, detect potential anomalies, and implement proactive optimization measures to enhance system reliability and efficiency
  • Implement AI governance and model version control policies, ensuring regulatory compliance, model integrity, security, ethical AI practices, and proactive risk mitigation
  • Stay ahead of AI infrastructure innovations, emerging cloud technologies, and industry best practices to enhance enterprise AI capabilities

Ideal Profile

Skills and Experience

Education :

  • Bachelor’s degree in Computer Science, IT Infrastructure Engineering, or a related field
  • Certifications in cloud computing (AWS, Azure, GCP), MLOps, or DevOps (preferred)
  • Experience :

  • 8+ years of experience in IT Infrastructure, Cloud Computing, or AI Systems Architecture
  • Hands on experience in AI infrastructure design, cloud-based AI solutions, or MLOps
  • Expertise in managing AI workloads across cloud and hybrid environments.
  • Proven track record in scaling AI infrastructure for large enterprises
  • Strong experience in Kubernetes, containerization, and orchestration tools
  • Experience in optimizing AI workloads for performance and cost efficiency
  • Skills :

  • Expertise in AI Infrastructure & Cloud Architecture
  • Strong Understanding of AI Model Deployment & MLOps
  • Advanced Proficiency in Kubernetes & AI Workload Orchestration
  • Hands-on Experience with Cloud Platforms (AWS, Azure, GCP)
  • Proficiency in Infrastructure as Code (Terraform, Ansible)
  • AI Security & Compliance Knowledge
  • AI Infrastructure Cost Optimization Strategies
  • Performance Tuning for AI Systems & Workloads
  • What's on Offer?

  • Work alongside & learn from best in class talent
  • Join a well known brand within Telecommunications
  • Excellent career development opportunities
  • Create a job alert for this search

    Ai Infrastructure Architecture • Johannesburg Metropolitan Area, South Africa