CloudWerx Logo

CloudWerx

Sr. AI/ML Engineer

Posted Yesterday
Be an Early Applicant
Easy Apply
Remote
Hiring Remotely in Canada
Senior level
Easy Apply
Remote
Hiring Remotely in Canada
Senior level
Lead design and deployment of production-grade AI solutions, build multi-agent systems and cloud-native ML infrastructure, apply rigorous statistical evaluation, implement generative and predictive models, champion MLOps and data pipeline best practices, and collaborate with clients to translate business needs into technical solutions.
The summary above was generated by AI

Role Summary

As a Senior AI/ML Engineer, you will lead the design and deployment of high-impact AI solutions, expertly bridging traditional predictive modeling with next-generation Agentic AI. You are equally comfortable tuning a Gradient Boosted Tree for structured data as you are architecting a multi-agent system for complex reasoning tasks. You will collaborate directly with clients to understand their needs, translate business challenges into technical solutions, and provide expert guidance on AI/ML best practices. The role requires overseeing dataset quality for model training and leveraging GCP for efficient model deployment and scaling. You will utilize the Google AI ecosystem (Vertex AI, Google ADK) and orchestration frameworks like LangChain/LangGraph to move AI solutions from experimental demos to reliable, production-grade systems grounded in statistical rigor.

Role and Responsibility

  • Architect and implement sophisticated multi-agent systems and autonomous workflows leveraging the Google AI SDK, LangGraph, and LangChain to solve complex, non-linear business processes.
  • Lead the design and construction of cloud-native solutions, using Terraform, Kubernetes, and Docker to ensure that AI models are deployed on scalable, reliable infrastructure.
  • Apply rigorous statistical evaluation frameworks to model performance, moving beyond standard metrics to include uncertainty estimation, calibration, and robust hypothesis testing during model optimization (LoRA, QLoRA).
  • Lead the development of custom predictive models and deep learning solutions, utilizing frameworks like PyTorch and Scikit-Learn to select suitable architectures—whether decision trees, neural nets, or ensembles—based on performance and client criteria.
  • Design and implement state-of-the-art generative models for NLP and multimodal tasks, leveraging tools like OpenCV for image preprocessing and Stable Diffusion concepts where applicable.
  • Champion MLOps best practices within the team, building validated data pipelines and CI/CD/CT workflows using Kubeflow and Vertex AI Pipelines to ensure model quality and integrity.
  • Collaborate directly with clients to understand their unique needs, translating business challenges into technical solutions and providing expert guidance on dataset management best practices.
  • Personally tackle the most difficult engineering challenges, identifying technical risks such as overfitting or latency issues, and optimizing hyperparameters to ensure precision and interpretability.

Required Experience and Qualifications

  • 7+ years of technical experience, with at least 3+ years focused on ML/AI and 1 year in a consulting capacity.
  • Experience building and evaluating agentic loops, including tool-use (function calling), self-reflection, and multi-step reasoning architectures.
  • Deep proficiency in the modern Python AI stack, including extensive experience with core libraries (NumPy, Pandas, PyTorch) and specialized LLMOps/Agentic tools for monitoring and evaluation (e.g., LangSmith, Braintrust, AgentOps, or HoneyHive)
  • Proven track record of building AI/ML solutions for users, including experience with GenAI common solutions like Vertex AI, OpenAI API, and vector database technologies.
  • Strong foundation in probabilistic modeling, Bayesian statistics, and experimental design (A/B testing for AI) to ensure model reliability and groundedness.
  • Excellent verbal and written communication skills, with the ability to confidently articulate complex AI concepts to business, technical, and non-technical stakeholders.

Education/Certifications

  • Google Cloud: Professional Machine Learning Engineer
  • Industry: Databricks Certified Machine Learning Professional OR DeepLearning.AI Generative AI Specialization.
  • Education in: Computer Science, Mathematics, Machine Learning/Data Science


What We Offer

  • Competitive Compensation – Market-aligned salary reflecting your expertise and impact
  • Remote Work Flexibility – Work in a remote first organization, but can still collaborate at multiple offices globally.
  • Comprehensive Health Benefits – Medical, dental, vision, and wellness coverage for you and your family.
  • Flexible Paid Time Off – Take the time you need with a results-focused approach
  • Professional Development – Work with industry leaders, and learn from the most talented engineers in the industry. 
  • Google Cloud Training & Certifications – Access to leading cloud education resources.
  • High-Impact Client Work – Enterprise engagements shaping the future of Cloud, Data Platforms, and Agents
  • Collaborative Culture – Professional, transparent, and team-oriented environment.

Our Diversity and Inclusion Commitment

At CloudWerx, we are dedicated to creating a workplace that values and celebrates diversity. We believe that a diverse and inclusive environment fosters innovation, collaboration, and mutual respect. We are committed to providing equal employment opportunities for all individuals, regardless of background, and actively promote diversity across all levels of our organization. We welcome all walks of life, as we are committed to building a team that embraces and mirrors a wide range of perspectives and identities. Join us in our journey toward a more inclusive and equitable workplace.


Background Check Requirement 

All candidates for employment will be subject to pre-employment background screening for this position. All offers are contingent upon the successful completion of the background check. For additional information on the background check requirements and process, please reach out to us directly.


Our Story 

CloudWerx is an engineering-focused cloud consulting firm born in Silicon Valley - in the heart of hyper-scale and innovative technology. In a cloud environment we help businesses looking to architect, migrate, optimize, secure or cut costs. Our team has unique experience working in some of the most complex cloud environments at scale and can help businesses accelerate with confidence.

Top Skills

Python,Numpy,Pandas,Pytorch,Scikit-Learn,Lora,Qlora,Opencv,Stable Diffusion,Vertex Ai,Google Ai Sdk,Google Adk,Google Cloud Platform,Gcp,Langchain,Langgraph,Langsmith,Braintrust,Agentops,Honeyhive,Openai Api,Vector Databases,Terraform,Kubernetes,Docker,Kubeflow,Vertex Ai Pipelines

Similar Jobs

3 Days Ago
Remote
Canada
Senior level
Senior level
Fintech • Payments • Financial Services
Design, build, and productionize ML and LLM capabilities (OCR, LLM-assisted outputs); implement model/versioning, evaluation, monitoring, guardrails, PII-safe logging, and integrate models into backend services in a regulated, multi-tenant environment.
Top Skills: Python,Fastapi,Pydantic,O Cr,Document Ai,Ocr,Llms,Ci/Cd,Docker,Kubernetes,Testing
2 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Software
The role involves developing high-performance AI features for observability, rapid prototyping, and cross-functional collaboration, aimed at improving incident response and system behavior understanding.
Top Skills: Ai FrameworksAWSAzureDockerGCPGrafanaKubernetesLlmsTerraform
10 Days Ago
Easy Apply
In-Office or Remote
Toronto, ON, CAN
Easy Apply
Senior level
Senior level
Big Data • Software
As a Senior Machine Learning Engineer, you will design and build AI systems, focusing on recommendation and ranking systems, while collaborating with cross-functional teams to enhance ML capabilities.
Top Skills: Graph DatabasesOcrPythonPyTorchScikit-LearnVector Databases

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account