Skip to content

Chaitanya-01/P3-mini-drone-race

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sim2Real Quadcopter Perception Stack

  • In this project, a perception stack for DJI Tello EDU quadcopter was developed to enable precise navigation through multiple windows, whose texture and approximate dimensions are known.
  • To achieve this, simulation-to-reality (sim2real) techniques (such as domain randomization) were used to generate synthetic data from blender, creating a robust training set images and labels for the neural network model.
  • YOLOv8 was the chosen architecture model and it was trained to identify and segment the front window from a complex environment of multiple windows.
  • Once the segmentation mask was detected, the corners were extracted and a Perspective-n-Point (PnP) algorithm is applied to calculate the relative pose of the front window which is essential for guiding the quadcopter through the windows safely.
    (Check the full problem statements here project3a and project3b)

Steps to run the code

  • Install Numpy, Scipy, Matplotlib, blender python, pyquaternion, djitellopy, OpenCv, Ultralytics libraries before running the code.
  • Data generation:
    1. If you want to generate your own data:
      • Code for synthetic blender data generation is present in generate_data, open this file in the scripting tab of main.blend file.
      • Before running the script make sure the output file paths are set to appropriate locations in the compositing tab.
      • In the script loaded we are basically varying the camera location and lighting. Modify it according to your needs.
      • To generate images run the script file generate_data.py that is already loaded in blender.
      • Once data is generated use the code in data_augment after modifying appropriately to generate new warped and domain randomized images.
    2. If you want to use the dataset we generated, download it from this link (NOTE: size is nearly 4GB).
  • Training:
    • To train the YOLOv8 model run the window_seg_yolov8.ipynb file in YOLO Model folder and before doing that add the location to your dataset appropriately.
    • You can also use the already trained weights that gave an inference time of 33ms on Orin nano. The weights file last.pt is in YOLO Model folder.
  • Camera Calibration: Use the code in Calibration folder to get the camera intrinsics and distortion coefficients.
  • To fly the drone (no need to do any of the above steps for doing this step):
    • Set up the tello drone and NVIDIA Jetson Orin nano
    • On orin nano clone this repository
    • Go to the Code folder
    • Open the Wrapper.py file.
    • Connect to the network of the tello drone and run in Code folder the following command:
        python3 Wrapper.py
      

Report

For detailed description see the report here.

Plots and Animations

Data generated:

A sample set of images and labels generated from blender:

Data Augmentation pipeline:

The generated images are augmented using the following pipeline to create more diverse data.

Camera Calibration:

After proper camera calibration on the checkerboard.

Results:

The network prediction masks, corner inference, and pose estimated for a sample set of frames.

Deployment on the real drone:

Watch the test run on the real tello drone here (link1 and link2).

Collaborators

Chaitanya Sriram Gaddipati - cgaddipati@wpi.edu

Shiva Surya Lolla - slolla@wpi.edu

Ankit Talele - amtalele@wpi.edu