This project applies some common Data Visualization techniques based on Cactus data found on SEINet Website
- Classification based on flower reproducibility( Location wise) (Classification)
- Loc-wise heat map(Chloropleth) - (Heat Map)
- Living Temperature (here)
- Family Packing of Species (Zoomable packing)
- Word cloud based on Reproducibility and Occurrence (Flowering, Non Flowering)
- SEINet webpage was scraped using python code to get images and description
- We also collected locations (lattitude, longitude) for each species. Location for some species are missing (collected using Google Maps)
- For the Living Temperatures of the cacti, there were almost no data in the SEINet website. We needed to collect data missing using location (if a cactus grows in AZ, we took the temperature range for AZ)
- We used image classification-filter and rotation and flipping to enhance the data
- We calculated the population based on lat and long as the data is not available with state data
- Two distinct classes (1) Flower Reproducibility and (2) without Flower Reproducibility
- Images are classified by a DNN
- Classification Result displayed by clicking on the point
- Location Points based on the Latitude and Longitude, Description and Image on hovering the points
- Search by Auto complete
- Sequential color used to indicate the cacti population in various states of the countries
- Sequential is used because it is easier to identify the strength of the population
- Our state has the highest population
- Surprisingly East coast states like NY have some cacti too
-
We have divided the cacti into three types
- The pot-able cacti i.e which can be grown in the pots
- The creeper cacti, which really grows on groups
- The tall ones which are above 5ft
-
The classification is done manually by studying the species
-
The upper region is the high extreme temp the cacti can survive
-
And the lowers are low extremities
- Each cacti belongs to family and each family has many cacti in it
- To represent the family of the cacti we used zoomable circle packing
- This helps us to know the cacti and their 'brothers' and 'sisters' better
- Two fun and interactive word clouds, one for each class (flower producing and non producing)
- The size of text is based on the population of the family
- Prerequisite: need python installed in your machine.
- Clone or download the above mentioned github project.
- From the root folder of the project, run the following command:
python -m http.server 8000 --bind 127.0.0.1
- You can access it from theweb browser at: localhost:8000
- Youtube: https://youtu.be/L1cv--FboV0
- SEINet Website: We have collected all our cactus related data from this website. In order to collect specific details of each of the cactus, we scraped this website using python other programming techniques.
- Natural Earth - Datasets (for Geo Data): We downloaded the "Admin 1 States and Provinces" dataset in 'Shape' format and converted that into 'GeoJSON' format using QGIS software. Then we pruned that dataset to get the dataset which finally contains only information for the countries in North/Central/South - America.
- D3 was used for different types of interactive visualization in this project.
- Leaflet JS Was used to draw the choropleth map.
This was a class project done for the class CSE 578 Data Visualization - Spring 2020 by Dr. Sharon Hsiao