Open to ML Research Roles · 2026

Prabhjyot
Singh

ML Researcher & Engineer at the University of Waterloo.
Ethical AI · Reinforcement Learning · Autonomous Systems

About Me

I'm a Master's student at the University of Waterloo researching ethical AI and reinforcement learning under the UWECEML Lab. My work focuses on building evaluation frameworks that surface misaligned agent behavior and training methods that respect human values.

Across 6 co-op work terms, I've applied ML and software engineering in industry — from building PyTorch computer vision models at Kindred AI to designing LLM-powered agentic systems at BrainRidge Consulting. I enjoy sitting at the intersection of research and engineering.

Outside of research, I'm drawn to the philosophical questions behind alignment: what does it mean for an AI to behave "ethically," and how do we measure that rigorously?

AI / ML
PyTorch TorchRL JAX TensorFlow Scikit-Learn Hugging Face OpenAI APIs Reinforcement Learning
Languages
Python C / C++ TypeScript JavaScript R
Infrastructure
Docker AWS Terraform Redis NestJS Angular GitHub Actions CI/CD
Robotics & Embedded
FANUC / ABB KAREL RTOS Webots Vivado

Work History

6 co-op terms and industry experience across ML, robotics, and software engineering.

UW

Graduate Research Assistant

UWECEML Lab, University of Waterloo · Waterloo, Canada
Oct 2025 – Present Research
  • Researching ethical AI within reinforcement learning environments on the Moral AI Systems team — defining experiment protocols and evaluation criteria.
  • Building Craftax RL experiments and JAX training/evaluation pipelines for agent-behavior analysis and benchmarking.
  • Developing a Compute Canada compatible framework for scalable, reproducible ethical-AI experiments (configs, seeding, logging, batch runs).
BR

Software Engineer

BrainRidge Consulting · Toronto, Canada
May 2025 – Sep 2025 Industry
  • Designed and developed LLM-powered agents using Claude Sonnet 4, implementing advanced prompt engineering and validation loops for reliable structured outputs.
  • Built and deployed scalable NestJS microservices enabling secure GitHub and Jira REST API integration for automated issue and repository management.
  • Architected a role-based authentication system with Auth0, Redis, and JWT for consistent RBAC across distributed services.
LE

Robotics Software Developer

Lincoln Electric Automation · Waterloo, Canada
Sep 2024 – Dec 2024 Industry
  • Developed and optimized embedded firmware for FANUC and ABB robotic systems in TypeScript, C++, and KAREL.
  • Improved trajectory planning and motion control algorithms, reducing erratic robotic movement by 30%.
  • Led a codebase refactoring initiative, reducing file count by 20% and improving overall architecture clarity.
XTL

System Analyst

XTL Transport Inc · Toronto, Canada
Jan 2024 – Apr 2024 Industry
  • Deployed an ITSM solution reducing ticket turnaround time by 45% and standardizing support workflows.
  • Rolled out networked digital signage across five warehouses, linking plug-and-play devices into a centralized dashboard.
KAI

Robotics Test Engineer

Kindred AI · Toronto, Canada
Jan 2023 – Apr 2023 Industry
  • Designed and implemented a PyTorch supervised learning model to adjust image brightness/contrast, reducing segmentation error from 30% to 10%.
  • Expanded automated end-to-end test coverage to 95% using Python and Cucumber in a virtual simulation environment.
  • Uncovered a 30% error rate in low-light scenarios, driving firmware calibration improvements for enhanced sensor reliability.
VTS

QA Engineer

View The Space Inc · Remote
May 2022 – Aug 2022 Industry
  • Implemented automated regression testing with Cypress, reducing manual QA effort by 15%.
  • Collaborated with product and engineering teams to improve documentation and accelerate feature releases by 20%.
PL

QA Engineer

Plooto Inc · Toronto, Canada
Sep 2021 – Dec 2021 Industry
  • Built automated test suites in JavaScript with Ghost Inspector to validate OCR workflows.
  • Optimized HubSpot web pages (HTML/CSS/JS), boosting Lighthouse performance scores by 30 points.
FO

Fullstack Engineer

FleetOperate · Remote
Jan 2021 – Apr 2021 Industry
  • Developed AWS-integrated REST APIs and built responsive frontends with Angular, TypeScript, and CSS.
  • Configured Ubuntu and AWS Linux servers for scalable, production-ready deployments.

Selected Work

Projects spanning ML systems, edge computing, human-robot interaction, and reinforcement learning.

Aether-Edge

Decentralized edge-native building management system reducing Age of Information from 19.4s (centralized) to near-zero.

NAO Robot Teacher Gender Study

Webots simulation investigating gender bias in human-robot interaction within educational environments using a humanoid NAO robot.

Aegis Lights

Self-adaptive urban traffic control system achieving a 45–49% reduction in average trip time across all traffic scenarios.

Canary

IoT personal air quality monitor — custom PCB, embedded BLE firmware, 3D-printed enclosure, and Android companion app. Built as a 4th year capstone.

MTG Archetype Predictor

ML classifier that predicts Magic: The Gathering commander deck archetypes by integrating Scryfall and EDHRec community data.

Ethical AI & Reinforcement Learning

Building AI systems that are both capable and aligned with human values. Currently at the UWECEML Lab, University of Waterloo.

Ethical AI & Value Alignment

Investigating how reinforcement learning agents can learn to respect human values and moral constraints — through reward modeling, constrained optimization, and alignment verification.

Agent Behavior Analysis

Designing rigorous evaluation protocols that measure ethical behavior in RL environments. Building reproducible benchmarks on Craftax to surface and quantify misaligned agent behaviors.

Autonomous & Adaptive Systems

Exploring how intelligent systems can dynamically adapt to complex environments while maintaining behavioral guarantees — from self-adaptive control loops to edge-native sensing.

Research in Progress

Methods for Training and Evaluating Ethical Reinforcement Learning Behaviour

UWECEML Lab · University of Waterloo · Moral AI Systems Team · Oct 2025 – Present
  • Researching ethical AI within reinforcement learning environments — defining experiment protocols and evaluation criteria for agent moral behavior.
  • Building Craftax RL experiments and JAX training/evaluation pipelines for large-scale agent-behavior analysis and benchmarking.
  • Developing a Compute Canada compatible framework for scalable, reproducible ethical-AI experiments with full configuration management, seeding, and batch run support.
Reinforcement Learning JAX Craftax Ethical AI Value Alignment Compute Canada Python

Academic Background

University of Waterloo, one of Canada's top engineering programs.

Master of Engineering
Electrical & Computer Engineering — AI Specialization
University of Waterloo
Sep 2025 – Present
GPA 84.67
Coursework
Algorithm Design & Analysis Intro to Artificial Intelligence Intelligent Sensors & Networks Self-Adaptive Software Systems Distributed & Network-Centric Computing Social Robotics
Bachelor of Applied Science
Computer Engineering — AI Option, Honours, Co-op
University of Waterloo
Sep 2020 – Apr 2025
Graduated
6 Co-op Terms
Awards
University of Waterloo President's Scholarship — 2021
Selected Coursework
Reinforcement Learning Introduction to Machine Learning Autonomous Vehicles Engineering Design Project (97) Compilers (87) Computer Architecture (84) Embedded Software Linear Algebra Probability & Statistics

Get In Touch

I'm currently seeking ML research internships and full-time roles for 2026. If you're working on interesting problems in AI alignment, RL, or autonomous systems — I'd love to connect.