Traffic Sign Recognition

The goals / steps of this project are the following:

Load the data set (see below for links to the project data set)
Explore, summarize and visualize the data set
Design, train and test a model architecture
Use the model to make predictions on new images
Analyze the softmax probabilities of the new images
Summarize the results with a written report

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Writeup

Here is a link to my project code

Data Set Summary & Exploration

Here are some summary statistics of the traffic signs data set:

The size of training set is 34799
The size of the validation set is 4410
The size of test set is 12630
The shape of a traffic sign image is 32x32x3
The number of unique classes/labels in the data set is 43

The images below show the distribution of the classes in the training, validation and test data sets. As we can see, the distribution is not very uniform and there are some classes that are not very well represented. However the distribution seems to be similar across the training,validation and test datasets

Model Architecture

The original images in the data set are color images of size 32x32. Based on results reported in the literature, I decided to convert the images to grayscale as the first step. This helps to reduce the dimensionality of the input space. The images are then normalized by a simple transformation to center the data.

image = (image-128.0)/128.0

Here is an example of a traffic sign image before and after grayscaling and normalization.

Data Augmentation

As can be noted, the training set contains only around 35K images. In order to make the traning more generalizable, I decided to augment the data with samples generated from the training set itself. For this I implemented functions to add translation, rotation, zooming and perspective projection on the images.

Here is an example of an original image and 4 more images generated with the described transformations from the original image.

The augmented dataset hence should be more robust to differences in the pose of the camera, centering and rotation in the images presented to the neural network.

My final model consisted of the following layers:

Layer	Description
Input	32x32x1 Grayscale image
Convolution 5x5	1x1 stride, VALID padding, outputs 28x28x20
RELU
Max pooling	2x2 stride, outputs 14x14x20
Convolution 5x5	1x1 stride, VALID padding, outputs 10x10x36
RELU
Max pooling	2x2 stride, outputs 5x5x36
Fully connected	outputs 120
RELU
Fully connected	outputs 84
RELU
Fully connected	outputs 43

Training

To train the model, I used an Adam Optimizer. The training was done with 20 Epochs and a batch size of 128. In order for the model to generalize better, I used dropouts in the two fully connected layers before the output layer. The drop out probability was set to 0.5 during the training.

My final model results were:

training set accuracy of :99.7%
validation set accuracy of :98.3%
test set accuracy of :96.8%

The initial architecture I started with was the LeNet architecture. That gave around 94% validation accuracy without any data augnmentation. With the data augmnetaiton, the validation accuracy improved by ~2%. Fur further improvements, I added more complexity to the model by increasing the number of features in the first and second convolutional layers. This resulted in increasing the validation set accuracy to ~98%

The model seems to generalize reasonably well giving ~97% accuracy on the test set.

Testing Model on New Images

Here are five German traffic signs that I found on the web that seem reasonably similar to images in the traning set.

Here are the results of the prediction:

Image	Prediction
Speed limit (30km/h)	Speed limit (30km/h)
Speed limit (70km/h)	Speed limit (70km/h)
Speed limit (80km/h)	Speed limit (80km/h)
Go straight or right	Go straight or right
Slippery Road	Wild animals Crossing

The model was able to correctly guess 5 of the 5 traffic signs, which gives an accuracy of 100%. This seems comparable to the 96.8% accuracy achieved on the test set.

Next we look at how confident the model was in making the predictions. For the first image, the model is very
sure that this is a speed limit 30 km/h sign (probability of 1.0)

Probability	Prediction
1.0	Speed limit (30km/h)
~0	Speed limit (50km/h)
~0	Speed limit (70km/h)
~0	Speed limit (20km/h)
~0	Yield

The next four images also the model is very sure about the prediction with the most probable class having probability of ~1.0

Visualizing the Neural Network

The output of the first convolutional layer was visualized with the first traffic sign image from the web as the input. As can be seen from the figure below, the layer seems to be activating on the edges of the speed limit letters and the circular outline.

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
examples		examples
test_images_web		test_images_web
.gitignore		.gitignore
README.md		README.md
Traffic_Sign_Classifier-Copy1.ipynb		Traffic_Sign_Classifier-Copy1.ipynb
Traffic_Sign_Classifier.html		Traffic_Sign_Classifier.html
Traffic_Sign_Classifier.ipynb		Traffic_Sign_Classifier.ipynb
signnames.csv		signnames.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Sign Recognition

Rubric Points

Writeup

Model Architecture

Training

Testing Model on New Images

Visualizing the Neural Network

About

Releases

Packages

Contributors 13

Languages

iyerhari5/P2-TrafficSignClassification

Folders and files

Latest commit

History

Repository files navigation

Traffic Sign Recognition

Rubric Points

Writeup

Model Architecture

Training

Testing Model on New Images

Visualizing the Neural Network

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 13

Languages

Packages