Cantina Labs

AI Research Engineer, Computer Vision

Posted An Hour Ago

Be an Early Applicant

Remote

Hiring Remotely in Canada

Mid level

Remote

Hiring Remotely in Canada

Mid level

As an AI Research Engineer, you'll implement image and video generation models, manage training pipelines, and optimize performance while collaborating with researchers.

The summary above was generated by AI

About Cantina:

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the Role:

We are looking for a talented AI Research Engineer to join our computer vision research team. In this role, you will work closely with our research team, implementing, training, and evaluating state-of-the-art image and video generation models. You will own the engineering execution that turns research ideas into working systems: building robust data pipelines, running and stabilizing large-scale training, implementing models from papers, optimizing for speed/efficiency, and running rigorous evaluations, all to accelerate our core image and video generation models.

This is a high-impact implementation and execution role. This role is ideal for engineers who enjoy building reliable ML systems and scaling research ideas into production-quality training pipelines. The ideal candidate is someone who gets deep satisfaction from making complex systems work, translating research ideas into reliable, scalable code, debugging training instabilities, and delivering measurable improvements in training stability, model quality, and inference efficiency. This is an excellent opportunity to work closely with experienced researchers, gain deep hands-on exposure to cutting-edge model training techniques, latest research methods in diffusion/transformer-based generation, large-scale experimentation, and efficiency innovations, all while contributing directly to production-grade models.

What You’ll Do:

Build and maintain end-to-end data pipelines for large-scale image and video datasets: collection, filtering, augmentation, conditioning alignment, and efficient storage/sampling.
Implement model architectures (diffusion, autoregressive, flow-based, diffusion transformers, etc.) and maintain high-throughput PyTorch training loops for large-scale image and video diffusion models.
Run and manage large-scale training experiments on multi-GPU and multi-node setups (DDP, FSDP, DeepSpeed). Debug training instabilities, loss spikes, and convergence issues.
Apply quantization, pruning, and knowledge distillation techniques to compress models without sacrificing quality.
Collaborate with researchers and translate state-of-the-art research papers into working implementations in our internal codebase (e.g., new attention mechanisms, sampling schedules, or conditioning methods).
Build and maintain evaluation pipelines of image quality, video consistency, and perceptual metrics.
Set up and maintain human annotation and evaluation pipelines using services like AWS GroundTruth.
Profile and optimize training speed, GPU memory utilization, and iteration time. Implement inference optimizations to reduce latency and compute cost.
Work with acceleration toolchains such as torch.compile, Triton, TensorRT, or ONNX where appropriate

What You’ll Bring:

2–5 years of hands-on experience building and training ML systems, with strong ownership of results
Fluency in PyTorch: comfortable reading, writing, and debugging both training and inference code.
Experience training or fine-tuning generative models (diffusion models, transformers, VAEs, or similar) from scratch or near-scratch
Solid understanding of distributed training workflows and practical debugging of large training runs
Demonstrated ability to read and implement AI research papers in computer vision. Familiarity with cutting-edge computer vision models and research literature in the image and video domain.
Experience building data pipelines for large-scale image or video datasets
Strong debugging skills: comfortable diagnosing both engineering bugs and training failures
Strong engineering mindset: writing clean, reliable, debuggable code; profiling tools; handling numerical issues at scale.

Compensation:

The anticipated annual base salary range for this role is between $170,000-$210,000. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Benefits:

Competitive salary and generous company equity
Medical, dental, and vision insurance – 99.99% of premiums covered by Cantina
42 days of paid time off, including:
- 15 PTO days
- 10 sick days
- 15 company holidays
- 2 floating holidays
Generous parental leave & fertility support
401(k) retirement savings plan
Lifestyle spending account – $500/month to use however you’d like
Complimentary lunch and snacks for in-office employees
One Medical membership, and more!

Top Skills

AWS

Onnx

PyTorch

Tensorrt

Triton

Similar Jobs

Wells Fargo

Regional Coach California Division

15 Minutes Ago

Remote or Hybrid

Panorama, BC, CAN

Senior level

Fintech • Financial Services

Coach district and branch leaders to improve branch performance through scalable workshops, targeted coaching, and execution of the behavior framework, management cycle, affluent and small-business priorities. Support rollout pilots, analyze opportunities, and advise Regional and District Managers to drive customer experience and business growth.

Coinbase

Senior Security Engineer

An Hour Ago

Easy Apply

Remote

Canada

Easy Apply

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

The Senior Security Engineer will automate IAM processes, prototype AI solutions for access management, and ensure security standards in architecture. Responsibilities include developing tools in Go and guiding change towards the automated security posture.

Top Skills: Ci/CdGitGoNode.jsPython

Superhuman

Product Manager

2 Hours Ago

Easy Apply

Remote or Hybrid

Canada

Easy Apply

Senior level

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI

Drive product strategy and craft product experiences, collaborate with design and engineering, own projects from ideation to launch, and embody customer voice.

Top Skills: AICollaboration ToolsProductivity SoftwareUser Experience Design

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.