Industrial and Systems Engineering, Lamar university, Beaumont, Texas USA 77705.
World Journal of Advanced Research and Reviews, 2025, 25(01), 1837-1844
Article DOI: 10.30574/wjarr.2025.25.1.0122
Received on 03 December 2024; revised on 21 January 2025; accepted on 24 January 2025
This research presents a transformative artificial intelligence solution that addresses critical operational challenges in modern retail environments. Our integrated system combines advanced computer vision and text recognition technologies to automate product identification, inventory tracking, and checkout processes. In response to the retail industry's pressing needs for automation amid labor shortages and rising operational costs, we developed and implemented a comprehensive solution that demonstrates significant business value. The system achieved a 94.6% accuracy rate in product recognition while processing 50-60 items per second, enabling real-time inventory management and automated checkout capabilities. Field testing across multiple retail locations showed a 35% reduction in inventory management time, a 40% decrease in checkout wait times, and a 25% improvement in stock accuracy. The solution encompasses a robust dataset of 538 distinct products, including challenging categories such as liquor bottles and grocery items, and features sophisticated optimization techniques that ensure consistent performance in diverse retail environments. Implementation of this system can lead to substantial operational cost savings, enhanced customer experience, and improved inventory accuracy. Our research demonstrates how AI-driven automation can address the retail industry's current challenges while providing a scalable foundation for future innovations in retail operations management.
Product Recognition; Computer Vision; OCR; Deep Learning; Retail Automation; YOLO; Multi-modal Learning; Vector Databases
Preview Article PDF
Saumil R Patel. Multi-Modal product recognition in retail environments: Enhancing accuracy through integrated vision and OCR approaches. World Journal of Advanced Research and Reviews, 2025, 25(01), 1837-1844. Article DOI: https://doi.org/10.30574/wjarr.2025.25.1.0122.
Copyright © 2025 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution Liscense 4.0