DroneSim

Reinforcement learning for path following of an AirSim quadrotor implemented in Unity city environment

Drone in training:


Figure1. training started	Figure2. 4,000 time steps	Figure3. 12,000 time steps

Getting Started

Download the Unity package containing the customized city environment and the AirSim drone;
Add the python files in Path_Following to the AirSim/PythonClient/reinforcement_learning you've cloned;
You can
- Directly load the pre-trainined model by hitting python Model_Load.py in the terminal, and you'll see the drone following the city road
- Train your own model by python dqn_drone.py, your trained model is saved as "best_model.zip"

Implementation Explanation

Code snippet credit to AirSim/PythonClient/Reinforcement_learning/drone_env, and the reward function is as following:

def _compute_reward(self):
    thresh_dist = 7
    beta = 1
    x = -240
    y = 10
    z = 200
    pts = [
        np.array([x, y, z]),
        np.array([-350, y, z]),
        np.array([-350, y, 150]),
        np.array([-350, y, z-100]),
        np.array([-350, y, z-200]),
    ]
    ...
    ...
    if self.state["collision"]:
        reward = -100
    else:
        dist = 10000000
        for i in range(0, len(pts) - 1):
            dist = min(
                dist,
                np.linalg.norm(np.cross((quad_pt - pts[i]), (quad_pt - pts[i + 1])))
                / np.linalg.norm(pts[i] - pts[i + 1]),
            )

        if dist > thresh_dist:
            reward = -10
    ...
    ...
    done = 0
    if reward <= -10:
        done = 1

    return reward, done

The tuple of the coordinates represents the central line of the city road. The dist in the reward function computes the twice distance between the realtime drone and the central line comprised by the points. The distance computing is reperesented as following picture:

Future Work

Generate images data with imitation learning
A trained policy by cross-modal representations has been achieved by Rogerio Bonatti. The imitation learning data is generated for passing through the drone racing obstacles. The path following task should also work applied with the generated imitation learning data.
Implement the trained model in the real drone
Transfer the simulation algorithm to real-world platform.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Images		Images
Path_Following		Path_Following
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DroneSim

Reinforcement learning for path following of an AirSim quadrotor implemented in Unity city environment

Drone in training:

Getting Started

Download the Unity package containing the customized city environment and the AirSim drone;

Add the python files in Path_Following to the AirSim/PythonClient/reinforcement_learning you've cloned;

You can

Directly load the pre-trainined model by hitting `python Model_Load.py` in the terminal, and you'll see the drone following the city road

Train your own model by `python dqn_drone.py`, your trained model is saved as "best_model.zip"

Implementation Explanation

Code snippet credit to AirSim/PythonClient/Reinforcement_learning/drone_env, and the reward function is as following:

Future Work

Generate images data with imitation learning

Implement the trained model in the real drone

About

Releases 1

Packages

Languages

License

RealBrandonChen/DroneSim

Folders and files

Latest commit

History

Repository files navigation

DroneSim

Reinforcement learning for path following of an AirSim quadrotor implemented in Unity city environment

Drone in training:

Getting Started

Download the Unity package containing the customized city environment and the AirSim drone;

Add the python files in Path_Following to the AirSim/PythonClient/reinforcement_learning you've cloned;

You can

Directly load the pre-trainined model by hitting python Model_Load.py in the terminal, and you'll see the drone following the city road

Train your own model by python dqn_drone.py, your trained model is saved as "best_model.zip"

Implementation Explanation

Code snippet credit to AirSim/PythonClient/Reinforcement_learning/drone_env, and the reward function is as following:

Future Work

Generate images data with imitation learning

Implement the trained model in the real drone

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Directly load the pre-trainined model by hitting `python Model_Load.py` in the terminal, and you'll see the drone following the city road

Train your own model by `python dqn_drone.py`, your trained model is saved as "best_model.zip"

Packages