Level AI Logo

Level AI

Research Intern – Reinforcement Learning (RL) - Onsite

Posted 21 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in CA
Internship
In-Office or Remote
Hiring Remotely in CA
Internship
As a Research Intern, you'll build reinforcement learning environments and agents, define reward models using real-world data, and collaborate on deploying learning systems.
The summary above was generated by AI

🚀 Build the next generation of Agentic AI with us

Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agents across the entire customer experience lifecycle.

A core part of this vision is our investment in custom Small Language Models (SLMs)—purpose-built for CX workflows—paired with reinforcement learning systems that continuously improve decision-making in real-world environments.

We’re looking for a Research Intern (Reinforcement Learning) to join us in shaping this future.

What you’ll do
 
  • Design and build reinforcement learning environments that model real-world customer interaction workflows.

  • Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops

  • Define reward models and feedback loops using real-world signals (outcomes and human feedback)

  • Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning

  • Experiment with multi-agent systems and simulation frameworks for complex coordination and decision-making

  • Collaborate with engineering and product teams to deploy, evaluate, and iterate on learning systems in production at scale.

 

What we’re looking for
  • Currently pursuing (or recently completed) a degree in Computer Science, AI, Machine Learning, or related field

  • Strong understanding of reinforcement learning fundamentals

  • Familiarity with RL environments and training libraries such as Verl and Tinker

  • Strong foundation in probability, math, and optimization

  • Passion for building real-world AI systems

Nice to have
  • Experience with RLHF, LLM/SLM fine-tuning, or model alignment

  • Exposure to agent-based systems or multi-agent RL

  • Prior research, projects, or publications in RL or applied ML

  • Experience working with large-scale or production datasets

 

Why Level AI
  • Work on production-grade Agentic AI systems used by leading enterprises

  • Build alongside a team with deep expertise from Amazon, Google, and Meta

  • Be part of a fast-growing Series C AI company.

  • Direct exposure to 0→1 AI innovation in CX and decisioning systems

Top Skills

Agent-Based Systems
Conversation Intelligence
Llm/Slm Fine-Tuning
Multimodal Understanding
Optimization
Probability
Reinforcement Learning
Rl Environments
Rl Training Libraries
Rlhf
Small Language Models

Similar Jobs

4 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Junior
Junior
Big Data • Fintech • Mobile • Payments • Financial Services
As a Software Engineer II, you'll build the ML Feature Platform, collaborate on developing backend systems, and ensure operational availability while engaging in team growth.
Top Skills: AWSKotlinKubernetesMySQLPython
4 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Mid level
Mid level
Big Data • Fintech • Mobile • Payments • Financial Services
Develop and enhance machine learning systems for fraud detection, build pipelines, prototype models, ensure model health, and collaborate with cross-functional teams.
Top Skills: AirflowCatboostDaskKubeflowLightgbmMlflowPythonPyTorchRaySparkXgboost
4 Hours Ago
In-Office or Remote
CA
Mid level
Mid level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
As an ASIC Validation Engineer, you'll bridge custom mining silicon design and operation, debugging issues, developing test infrastructure, and validating functionality with ASIC designers.
Top Skills: AsicFpgaI2CJtagLogic AnalyzersOscilloscopesPower SuppliesPythonSpiUart

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account