Machine Learning Researcher

Biography

Sanket Kamthe is a third-year PhD student at Imperial College London. He is focusing on reinforcement learning for robotics and control for his PhD. He is particularly interested in Safe Model-based Reinforcement Learning, where the agent learns to perform tasks while being aware of risks and uncertainties. He primarily works with Gaussian process models for uncertainty quantification.

Interests

Reinforcement Learning
Gaussian Processes
Probabilistic Modelling
Variational Inference

Education

PhD in Computer Science, 2020
Imperial College London
MRes in Advanced Computing, 2016
Imperial College London
MSc in Information and Communication Engineering, 2014
Technische Universität Darmstadt
B.Eng. in Electronics & Telecommunications, 2008
University of Pune

Experience

Senior Associate

J P Morgan Chase

Sep 2020 – Present London, United Kingdom

Hardware software co design for multiplatform deployments of machine learning applications. Privacy preserving machine learning for business applications

Research Intern

J P Morgan Chase

Oct 2019 – Apr 2020 London, United Kingdom

Synthetic Data generation for privacy

Machine Learning Researcher

PROWLER.io

Apr 2018 – Sep 2019 Cambridge, United Kingdom

Data efficient reinforcement learning

Marie Curie Research Fellow

Dept. of Applied Mathematics, University of Twente

Apr 2014 – Sep 2015 Enscehde, Netherlands

Inference in high dimensional state space models.

Junior Research Fellow

Tata Institute of Fundamental Research

Mar 2010 – Aug 2011 Mumbai, India

Simulation, modelling and characterization of silicon based Single Photon Avalanche Detectors (SPAD) for CERN and TIFR, Ooty labs.

Project Engineer

Wipro Technologies

Nov 2008 – Feb 2010 Bangalore, India

Worked as a design engineer for Nortel Networks digital telephone systems. Managed a hardware product design cycle spread over 9 months and 3 continents from concept to field trials.

Featured Publications

sanket kamthe, marc deisenroth

April 2018 Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics

Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. We demonstrate that our approach does not only achieve state-of-the-art data efficiency, but also is a principled way for RL in constrained environments.

PDF