Product identification in a mall or a grocery store.

To identify products in a store, this repository utilizes a CLIP model that generates feature embeddings for various items. By leveraging cosine similarity, the model compares these embeddings against a database of pre-saved features from the store inventory to accurately identify products. The implementation specifically employs several versions of CLIP, including CLIP-ViT-L-14, CLIP-ViT-B-16, and CLIP-ViT-B-32, to generate these product embeddings.
However, prior preprocessing is essential. The process begins by detecting and identifying each person in the scene and assigning them a unique ID. Once individuals are identified, the next step is to capture images of the products (using GroundingDINO) they hold, as the model requires product images for recognition. This task can be challenging. Maybe GroundingDINO will perform better after removing the background, leaving only the human holding the products in the live image. Alternatively, the system can utilize cameras in shopping baskets or monitor items at checkout to obtain product images, thereby streamlining the identification process. For demonstration purposes, products in the frames captured by CCTV are manually cropped.
In Demo 1, inventory images are sourced from the Foodi-ML dataset, ensuring that each product image is of higher resolution than the zoomed CCTV images. In contrast, Demo 2 involves a similar technique for both the inventory images and the zoomed CCTV images, where both sets are zoomed in on the products, resulting in lower resolution. This approach highlights the differences in image quality and emphasizes the challenges faced in accurately identifying products from varying sources.
Please see all results from every CLIP model, below are just from CLIP-ViT-L-14.
These are Demo1 results. The first shows which two of the best inventory images match the product. And, the second one shows the similarity matrix of the product to all inventory items.
These are Demo2 results.

Requirements:

Please check right version in requirements.csv

Contacts:

borawarlokesh26@gmail.com

Credits:

foodi-ml-dataset
GroundingDINO
BackgroundMattingV2
Video used in GroundingDINO demo:
- https://www.youtube.com/watch?v=KMJS66jBtVQ&t=1s
Images used in Product-Identification demo:

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
BackgroundMattingV2		BackgroundMattingV2
CCTV		CCTV
Demo1		Demo1
Demo2		Demo2
GroundingDINO		GroundingDINO
.gitattributes		.gitattributes
README.md		README.md
Save_lib_v.py		Save_lib_v.py
product_idn.py		product_idn.py
requirements.csv		requirements.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Product identification in a mall or a grocery store.

Requirements:

Contacts:

Credits:

About

Releases

Packages

Languages

LokeshBorawar/Product-Identification

Folders and files

Latest commit

History

Repository files navigation

Product identification in a mall or a grocery store.

Requirements:

Contacts:

Credits:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages