The advancements in robotics and artificial intelligence have paved the way for sophisticated robotic systems capable of performing complex manipulation and assembly tasks. However, achieving efficient and robust visuomotor policies for robots remains a significant challenge, especially in dynamic and unstructured environments. This proposal aims to explore the potential of diffusion models in learning visuomotor policies, leveraging their ability to handle multimodal action distributions, high-dimensional action spaces, and general-purpose multi-task conditioning by natural language instructions.
Visuomotor policy learning involves mapping visual observations to motor actions, enabling robots to interact with their environment effectively. Traditional approaches often struggle with the complexities of multimodal action distributions and the need for high precision and temporal consistency. The recent introduction of diffusion policies offers a promising solution by employing a conditional denoising diffusion process to generate robot actions. These models have demonstrated superior performance in generating complex behaviors, making them an ideal candidate for robot manipulation and assembly tasks. Moreover, integrating natural language processing (NLP) allows for versatile task conditioning, enabling robots to perform a variety of tasks based on human instructions.
Goals
The primary objectives of this research are:
To successfully undertake the proposed PhD project, a succesful candidate should possess a combination of skills and expertise across several domains. Here are the key skills best suited for this PhD proposal:
Machine Learning and AI:
Robotics:
Computer Vision:
Natural Language Processing (NLP):
Programming Languages:
Proficiency in programming languages such as Python and C++.
Strong software engineering skills, including code versioning (e.g., Git) and documentation.
be fluent in french and/or english
The successful candidate will work in direct collaboration with researchers having an established expertise in computer vision, machine learning and robotics in partnership with high profile international partners (Darmstadt University of Technology in Germany, Nagoya University and Osaka University in Japan). Ecole Centrale de Lyon is part of the top ten engineering schools in France (Grande Ecoles), part of the elite of "Grande Ecoles" offering access to excellent quality graduate and undergraduate students.
Prospective applicants should prepare a comprehensive application package that includes a detailed resume or CV highlighting relevant technical, analytical, and research skills, as well as any prior experience in robotics, machine learning, computer vision, and natural language processing.
Applicants should also submit a statement of purpose that outlines their research interests, motivation for applying, and how their background aligns with the proposed project.
Additionally, candidates are required to provide academic transcripts, two letters of recommendation from faculty or professionals familiar with their work, and any publications or project reports that demonstrate their capabilities.
Applications and letters should be sent via electronic mail to:
Pr Liming Chen (liming.chen@ec-lyon.fr)
(c) GdR IASIS - CNRS - 2024.