My Photo

I'm a Ph.D. Student in Aerospace Engineering at Iowa State University, advised by Prof. Peng Wei. I received my bachelor's degree from Harbin Institute of Technology with major in Applied Mathematics.

My research interests include Deep Learning, Reinforcement Learning, Markov Decision Process, and Statistical Machine Learning, with applications in Aerial Robotics Motion Planning, Intelligent Air Transportation Systems, and Real-time Decision Making Systems.

Selected Publications


Scalable Multi-Agent Computational Guidance with Separation Assurance for Autonomous Urban Air Mobility

, Peng Wei

In the Journal of Guidance, Control, and Dynamics, 2020

...

Electrical vertical take-off and landing (eVTOL) vehicles are becoming promising for ondemand air transportation in Urban Air Mobility (UAM). However, successfully bringing such vehicles and airspace operations to fruition will require introducing orders-of-magnitude more aircraft to a given airspace volume. Although there are existing solutions for communication technology, onboard computing capability, and sensor technology, the computation guidance algorithm to enable safe, efficient, and scalable flight operations for dense self-organizing air traffic still remains an open question. In this paper, a message-based decentralized computational guidance algorithm is proposed and analyzed for multiple cooperative aircraft by formulating this problem using Multiagent Markov Decision Process and solving it by Monte Carlo Tree Search algorithm. A novel coordination strategy is introduced by using the logit level-k model in behavioral game theory. To achieve higher scalability, we introduce the airspace sector concept into the UAM environment by dividing the airspace into sectors, so that each aircraft only needs to coordinate with aircraft in the same sector. At each decision step, all of the aircraft will run the proposed computational guidance algorithm onboard, which can guide all the aircraft to their respective destinations while avoiding potential conflicts among them. For validation and demonstration, a free-flight airspace simulator that incorporates environment uncertainty is built in an OpenAI Gym environment. Numerical experiment results over several case studies including the roundabout test problem show that the proposed computational guidance algorithm has promising performance even with the high-density air traffic case.
PDF Video Code



placeholder placeholPrioritized Sequence Experience Replay placeholder placehol

Marc Brittain*, Josh Bertram*, *, Peng Wei

* marks equal contribution
In the Neural Information Processing Systems (NIPS) Deep Reinforcement Learning Workshop
Vancouver, Canada, 2019

...

Experience replay is widely used in deep reinforcement learning algorithms and allows agents to remember and learn from experiences from the past. In an effort to learn more efficiently, researchers proposed prioritized experience replay (PER) which samples important transitions more frequently. In this paper, we propose Prioritized Sequence Experience Replay (PSER) a framework for prioritizing sequences of experience in an attempt to both learn more efficiently and to obtain better performance. We compare performance of uniform, PER and PSER sampling techniques in DQN on the Atari 2600 benchmark and show DQN with PSER substantially outperforms PER and uniform sampling.
PDF Code



Multi-Agent Autonomous On-Demand Free Flight Operations in Urban Air Mobility

, Lisen Deng, Peng Wei

In the AIAA Aviation 2019 Forum
Dallas, Texas, 2019

...

In urban air mobility (UAM), flying with electrical vertical takeoff and landing (eVTOL) aircraft will bring fundamental changes to city infrastructures and daily commutes. In order to enable safe and efficient autonomous on-demand free flight operations for the eVTOL aircraft in UAM, a centralized computational guidance algorithm is proposed and analyzed for multi cooperative aircraft. The approach proposed in this paper is to formulate this problem as a Markov Decision Process (MDP) and solve it using an online algorithm Monte Carlo Tree Search (MCTS). A coordination mechanism is designed to manage multiple cooperative aircraft. By generating real-time actions for all the cooperative aircraft to follow, the algorithm can guide all the aircraft to their respective destinations while avoiding potential conflicts between them. For the sake of illustration, a free flight airspace simulator is created to test the performance of this algorithm. Results show that this algorithm can help all the aircraft reach their trip destinations while only having 0.2% conflicts during the flights.
PDF Video Code



Autonomous On-Demand Free Flight Operations in Urban Air Mobility using Monte Carlo Tree Search

, Peng Wei

In the International Conference for Research in Air Transportation (ICRAT)
Barcelona, Spain, 2018

...

Vertical takeoff and landing (VTOL) aircraft for personal air transportation or on-demand air taxi will bring fundamental changes to city infrastructures and daily commutes. NASA, Uber, and Airbus have been exploring the exciting concept of Urban Air Mobility (UAM). In order to enable safe and efficient autonomous on-demand free flight operations in this UAM concept, a computational guidance algorithm was designed and analyzed with collision avoidance capability. The approach is to formulate this problem as a Markov Decision Process and solve it using an online algorithm called Monte Carlo Tree Search. For the sake of illustration, a simplified numerical experiment was created to test the performance of this algorithm. Results show that this algorithm can help aircraft quickly reach the trip destination and avoid conflicts with other aircraft.
PDF Video Code

About


Education

Iowa State University
Aug 2016 - Present
Ph.D., Aerospace Engineering
GPA: 3.93/4.00
Advisor: Peng Wei

Harbin Institute of technology
Aug 2012 - July 2016
B.S., Applied Mathmatics
GPA: 89.88/100

Selected Coursework:
Machine Learning (A)
Deep Learning (A)
Convex Optimization (A)
Linear Programming (A)
Linear System (A)
Optimal Control (A)
Analysis of Stochastic Systems (A)
Intelligent Air Transportation Systems (A)
Real Analysis (97/100)
Advanced Algebra (96/100)
Probability Theory (92/100)
Mathematical Statistics (98/100)
Artificial Intelligence (Online by Pieter Abbeel)
Reinforcement Learning (Online by David Silver)

Research Interests

  • Deep Learning
  • Reinforcement Learning
  • Markov Decision Process
  • Statistical Machine Learning

Work Experience

ByteDance, Mountain View, CA
June 2020 - Aug. 2020
Applied Machine Learning Intern

A^3 by Airbus, Sunnyvale, CA
July 2019 - Sept. 2019
UTM Decision Making Systems Intern

Airmap, Santa Monica, CA
May 2019 - July 2019
Software Engineer Intern

A^3 by Airbus, San Jose, CA
July 2017 - Aug. 2017
Research Intern

Programming Languages & Tools


Contact

Email: xuxiyang1993 at gmail dot com