AI Infrastructure Engineer
Source Technology - Alameda, CA
Apply NowJob Description
Job DescriptionWe are seeking a highly skilled AI Infrastructure Engineer to join our team on a contract basis. The ideal candidate will have experience in designing, deploying, and managing scalable infrastructure for AI and machine learning (ML) applications. This role will focus on optimizing workflows, ensuring system reliability, and enabling seamless integration of AI solutions into production environments.Key ResponsibilitiesDesign and implement infrastructure to support large-scale AI and ML workflows.Develop and optimize pipelines for data processing, model training, and deployment.Manage and monitor cloud or on-premises infrastructure for AI solutions.Collaborate with AI/ML engineers to ensure smooth deployment and scaling of models.Establish best practices for AI infrastructure, including containerization, orchestration, and CI/CD for ML workflows.Ensure system security and data compliance throughout the AI/ML lifecycle.Troubleshoot infrastructure issues and maintain system reliability.Required Skills & Experience5+ years of experience in infrastructure engineering with a focus on AI/ML environments.Proficiency in cloud platforms (e.g., AWS, Azure, GCP) and on-premises systems.Expertise in containerization tools like Docker and orchestration platforms like Kubernetes.Hands-on experience with ML workflow tools such as MLflow, Kubeflow, or Airflow.Strong scripting skills in Python, Bash, or similar languages.Familiarity with distributed systems and frameworks (e.g., Apache Spark, Ray).Experience with monitoring and logging tools like Prometheus, Grafana, or ELK Stack.Understanding of data governance and security practices in AI pipelines.Preferred SkillsExperience with GPUs and hardware optimization for AI workloads.Knowledge of data versioning tools like DVC or Delta Lake.Familiarity with MLOps practices and tools.Prior experience in deploying large language models or other advanced AI systems.Why Join Us?Work on cutting-edge AI/ML infrastructure projects.Collaborate with a team of innovative professionals.Opportunity to shape the foundation of scalable AI systems.
Created: 2025-01-26