ML/Ops Engineer with strong Azure cloud experience Remote Position Duration: 12+ months Role Overv

Remote Full-time
This a Full Remote job, the offer is available from: Mali, Georgia (USA) ML/Ops Engineer with strong Azure cloud experience Remote Position Duration: 12+ months Client is seeking a highly skilled ML/Ops Engineer with strong Azure cloud experience to design, build and operate our enterprise Machine Learning Operations (MLOps) platform. In this role you will work closely with data scientists, ML engineers, DevOps teams and business stakeholders to ensure that ML models are delivered into production in a scalable, secure and cost‐efficient way. You will help define best practices and drive automation, monitoring and governance for the ML lifecycle. Key Responsibilities • Architect, build and maintain the ML/Ops infrastructure on Microsoft Azure: including services such as Azure Machine Learning, Azure Databricks, Azure Kubernetes Service (AKS), Azure Functions, Azure Storage, etc. • Design and implement end-to-end CI/CD pipelines (data ingestion → feature engineering → model training → deployment) using Azure DevOps (or equivalent) to streamline model delivery. • Develop Infrastructure as Code (IaC) using Terraform, ARM templates or Bicep to provision and manage cloud resources for ML workloads. • Containerize ML models using Docker, orchestrate via Kubernetes/AKS and deploy as microservices or batch jobs in production. • Monitor production models: track model performance, drift, latency, throughput, costs; use services like Azure Monitor, Azure Log Analytics for observability. • Work with data scientists and engineers to enable feature stores, experiment tracking (e.g., MLflow) and model versioning. • Optimize compute and storage usage: cost-manage cloud resources, right-size clusters, auto-scale jobs. • Ensure platform security, compliance and governance: implement access controls, encryption (at rest/in transit), logging, auditability and data privacy standards. • Collaborate with cross-functional teams: enable reuse of ML platform capabilities across business units, provide documentation, training and support. • Continuously evaluate and adopt new Azure/ML technologies and best practices to improve platform performance, scalability and maintainability. Required Qualifications • Bachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or related field (or equivalent experience). • 5+ years (or adjust to seniority) of experience in a production ML engineering, MLOps, or cloud engineering role. • Proven experience with Azure cloud platform: Azure Machine Learning, Azure Databricks, AKS, Azure Storage/Blob, Azure Functions, Azure DevOps pipelines. • Strong programming skills in Python and familiarity with ML frameworks (TensorFlow, PyTorch, scikit-learn). • Solid experience with containerization (Docker) and orchestration (Kubernetes/AKS). • Hands-on in CI/CD tooling, version control (Git), infrastructure as code (Terraform/ARM/Bicep). • Experience deploying and operating ML models in production environments, including model monitoring, drift detection, logging and instrumentation. • Good data engineering skills: working with SQL/noSQL, ETL/ELT pipelines, large data volumes, feature engineering. • Excellent problem-solving skills, ability to troubleshoot production issues in distributed/cloud systems. • Strong communication and collaboration skills: able to partner with data scientists, software/DevOps engineers and business stakeholders. Preferred Qualifications • Experience with LLMs (large-language models), computer vision or multimodal ML use cases. • Familiarity with other public clouds (AWS/GCP) or hybrid/multi-cloud environments. • Experience with streaming/event-driven architectures (Kafka, Azure Event Grid, Service Bus). • Prior exposure to healthcare, supply chain or pharmaceutical domain. • Certifications such as Microsoft Certified: Azure Solutions Architect, Azure DevOps Engineer Expert or Azure Data Scientist Associate. • Experience with ML observability tools (e.g., Evidently, Prometheus/Grafana) or service mesh for microservices (Istio). Metasys Technologies is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identify, national origin, veteran or disability status. This offer from "Metasys Technologies" has been enriched by Jobgether.com and got a 85% flex score. Apply tot his job
Apply Now

Similar Opportunities

Lead Cloud Platform Engineer

Remote

Operational Technology (OT) Restoration Security Consultant

Remote

Cloud Cyber Security Consultant – Work Remotely

Remote

Associate Cloud Operations Technician

Remote

AWS Cloud Architect/ Engineer

Remote

[Remote] M365 Cloud Security Engineer- Remote (Anywhere in the U.S.)

Remote

SAP Security Lead Architect

Remote

Cybersecurity Analyst - Assessment & Authorization (Hybrid)

Remote

Cloud Security Analyst (Remote)

Remote

Domain Consultant 2 - Cortex/Cloud

Remote

**Experienced Data Entry Assistant / Entry Level – Remote Opportunity at blithequark**

Remote

**Experienced Part-Time Remote Amazon Chat Specialist – Delivering Exceptional Customer Service with blithequark**

Remote

Experienced Data Entry Professional – Remote Full-Time/Part-Time Opportunity for Detail-Oriented Individuals at arenaflex

Remote

**Experienced Part-Time Remote Data Entry Clerk – Unlock Your Potential with Blithequark**

Remote

Remote Inside Sales Associate - Loewen Windows Expert Wanted for High-Growth Company

Remote

Spanish (US) Freelance Editor/QAer

Remote

**Experienced Full Stack Data Engineer – Data Management and Engineering at arenaflex**

Remote

Experienced Customer Service Representative – Remote Full-Time Opportunity for Delivering Exceptional Service and Support

Remote

Entry-Level arenaflex Data Entry Specialist - Launch Your Career with Comprehensive Training and Flexible Scheduling (Part-Time Opportunity)

Remote

MLOps Engineer UT68: Corporate 1 Farm Springs Road, Farmington, CT, 06034 USA

Remote
← Back to Home