EyeFood (Food-Recognition for Visually Impaired) [THESIS]

(Demo Video)[https://user-images.githubusercontent.com/47744559/235339233-676e6d52-94e4-428f-b068-b3072bb63795.mp4]

This project is a thesis submission for the degree of Bachelors of Science from Cairo University - Faculty of Engineering. The goal of this project was to develop an food recognition and detection system for visually impaired individuals.

Description

The system is designed to assist visually impaired individuals in identifying food(including oriental food) in their surroundings through the use of a camera in a smart glasses and machine learning algorithms. The system is able to recognize a wide range of plates (54 dish including both oriental and international dishes) and classify them.

Features

Object recognition and classification: The system uses deep learning algorithms to recognize and classify objects in real-time. Audio feedback: The system provides audio feedback to the user, identifying the food plate(s) ahead and their respective locations. User-friendly interface: The interface used for simulation is kivy running on Raspberry Pi 4B (Raspbian). This is just to simulate the use of smartglasses that embeds a camera.

Credits

This project was developed by Mostafa Sherif, Youssef Sayed, Amir Salah and myself under the supervision of Prof Ibrahim Sobh and Prof Ahmed Darwish. We would like to thank Valeo Egypt for selecting our project for the 2021 Valeo Mentorship Program.

Contact

If you have any questions or feedback, please contact karim-ibrahim or check the Thesis book and presentation Section at the end of this file.

Project Files Structure:

`\_Datasets`
`\_Custom Dataset`
`\_food-101`
` \_master [REPO]`
`\_classification weights`
`\_dataset manipulation`
`\_detection weights`
`\_images`
`\_unsuccessful trials`
`\_visuals`
`classification_inference.py`
`classification_training.py`
`classification_utils.py`
`detection_utils.py`
`pipeline.py`

Dataset

The dataset used was based on the Food101 Dataset which is a balanced ds that has a total of 101K images (1000 image per 101 Classes). DS processing process involved excluding unpopular dishes from the food-101 dataset and collecting/adding oriental dishes to the unexcluded classes. A total of 54 Class was included in the final Dataset:

Pipeline and Training

Pipeline:

A FasterRCNN object detector is used to identify the bounding blocks of the plates of food (if any) and output them to a MobileNetV2 classifier that is trained on the aforementioned Custom Dataset. The output is the location of each bounding box and the predicted label for that box.

Methodology:

The Classifier was Fine-tuned on the 54k image dataset for 70 epochs showing the following:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EyeFood (Food-Recognition for Visually Impaired) [THESIS]

Description

Features

Credits

Contact

Project Files Structure:

Dataset

Pipeline and Training

Pipeline:

Methodology:

- Training vs Val Accuracy

- Confusion Matrix and Classification Report

Sample Results

- Full Image (Object Detector output)

- Predicted Plates (Classifier output)

Thesis book and presentation:

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
__pycache__		__pycache__
android app/object_detection-main		android app/object_detection-main
classification weights		classification weights
dataset manipulation		dataset manipulation
images		images
kivy		kivy
server		server
unsuccessful trials		unsuccessful trials
visuals		visuals
.gitignore		.gitignore
README.md		README.md
classification_inference.py		classification_inference.py
classification_training.py		classification_training.py
classification_utils.py		classification_utils.py
custom_dataset_meta.py		custom_dataset_meta.py
detection_utils.py		detection_utils.py
logo.png		logo.png
pipeline.py		pipeline.py

KarimIbrahim11/EyeFood-Food-Recognition

Folders and files

Latest commit

History

Repository files navigation

EyeFood (Food-Recognition for Visually Impaired) [THESIS]

Description

Features

Credits

Contact

Project Files Structure:

Dataset

Pipeline and Training

Pipeline:

Methodology:

- Training vs Val Accuracy

- Confusion Matrix and Classification Report

Sample Results

- Full Image (Object Detector output)

- Predicted Plates (Classifier output)

Thesis book and presentation:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages