Level AI Jobs

Research Intern – Reinforcement Learning (RL) - Onsite

Level AI

Research Intern – Reinforcement Learning (RL) - Onsite

Reposted 11 Hours Ago

In-Office or Remote

Hiring Remotely in CA

Internship

In-Office or Remote

Hiring Remotely in CA

Internship

As a Research Intern, you'll build reinforcement learning environments and agents, define reward models using real-world data, and collaborate on deploying learning systems.

The summary above was generated by AI

🚀 Build the next generation of Agentic AI with us

Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agents across the entire customer experience lifecycle.

A core part of this vision is our investment in custom Small Language Models (SLMs)—purpose-built for CX workflows—paired with reinforcement learning systems that continuously improve decision-making in real-world environments.

We’re looking for a Research Intern (Reinforcement Learning) to join us in shaping this future.

What you’ll do

Design and build reinforcement learning environments that model real-world customer interaction workflows.
Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops
Define reward models and feedback loops using real-world signals (outcomes and human feedback)
Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning
Experiment with multi-agent systems and simulation frameworks for complex coordination and decision-making
Collaborate with engineering and product teams to deploy, evaluate, and iterate on learning systems in production at scale.

What we’re looking for

Currently pursuing (or recently completed) a degree in Computer Science, AI, Machine Learning, or related field
Strong understanding of reinforcement learning fundamentals
Familiarity with RL environments and training libraries such as Verl and Tinker
Strong foundation in probability, math, and optimization
Passion for building real-world AI systems

Nice to have

Experience with RLHF, LLM/SLM fine-tuning, or model alignment
Exposure to agent-based systems or multi-agent RL
Prior research, projects, or publications in RL or applied ML
Experience working with large-scale or production datasets

Why Level AI

Work on production-grade Agentic AI systems used by leading enterprises
Build alongside a team with deep expertise from Amazon, Google, and Meta
Be part of a fast-growing Series C AI company.
Direct exposure to 0→1 AI innovation in CX and decisioning systems

Similar Jobs

Webflow

Staff Engineer

11 Hours Ago

Easy Apply

Remote

Easy Apply

Senior level

Artificial Intelligence • Enterprise Web • Software • Design • Generative AI

As a Senior Staff Engineer at Webflow, you'll architect scalable AI products, partner with leadership for technical strategy, and mentor engineers to elevate architectural standards.

Top Skills: AWSGCPGoKubernetesNode.jsPulumiTerraformTypescript

Forward Financing

Senior Data Scientist

11 Hours Ago

Remote

Alberta, AB, CAN

Senior level

Fintech • Financial Services

Build, deploy, and monitor advanced statistical and machine learning models (credit risk, pricing, collections, fraud). Partner with cross-functional teams to integrate models into production, produce production-grade code, and communicate results to technical and non-technical stakeholders.

Top Skills: ArizeAWSDatabricksGitMetaflowPythonSagemakerSnowflakeSQLTaktileTecton

Cencora

Seasonal Associate/ Student/Coop

11 Hours Ago

Remote

Yukon, YT, CAN

Entry level

Healthtech • Logistics • Pharmaceutical

Assist in various responsibilities based on the department's needs while developing interpersonal and project management skills. Must be enrolled in a post-secondary program, with a flexible working schedule between 8 and 40 hours per week.

Top Skills: ExcelMicrosoft OutlookPowerPoint

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.