The Challenge of Bags, Trays, and Shadows: The Invisible Enemy of Computer Vision

At first glance, recognizing an apple or a tray of meat seems like a simple task. However, for a computer vision system, everyday supermarket elements—such as a plastic bag, a reflective tray, or a projected shadow—can quickly become obstacles that distort what the system perceives.

In retail environments, where accuracy is critical, these small physical details can lead to identification errors, inconsistent readings, or false detections. The result is misclassified products or, even worse, incorrect operations at self-checkout terminals.

Understanding how these factors interfere with vision models—and how modern computer vision solutions, like those developed by Grabit Retail, address them—is essential to achieving truly reliable AI at the point of sale.

1. Bags: the transparent false enemy

Plastic bags, especially semi-transparent ones, are one of the greatest challenges for any computer vision system. The problem isn’t just transparency: folds, reflections, wrinkles, and glare all alter the way a camera interprets the texture and contours of the product inside.

An apple inside a bag may appear as a blurry mass, with muted colors or “ghost” areas that the algorithm fails to recognize. This ultimately results in identification errors.

To solve this, modern computer vision models must be trained with datasets that include real examples of bagged products, simulating different levels of shine, opacity, and deformation. At Grabit AI, these scenarios are incorporated into training so the system learns to identify the main object regardless of its wrapping.

2. Shadows: the most subtle enemy

A partial shadow can completely change how a product appears to a vision model. Colors darken, edges distort, and textures disappear. In environments with multiple light sources—such as produce aisles or self-checkout areas—this happens constantly.

Advanced vision systems integrate algorithms that optimize the captured images automatically. Models are also trained with images taken under different lighting conditions, including extreme scenarios, to ensure robustness against unavoidable variations in the environment.

3. Overlapping and occlusion: when customers complicate the scene

In real-world conditions, customers rarely place a product “as in the lab.”
Bags fold, trays stack, and sometimes one item rests partially on top of another. This creates occlusion, meaning parts of the object become hidden and the model only perceives fragments.

AI must be able to visually reconstruct what it is seeing. Grabit AI’s models learn to identify patterns even when shapes or edges are not fully visible.

4. The importance of the physical environment and edge computing

The environment affects not only visual capture but also the speed and stability of recognition. If AI relies on the cloud, lighting changes or reflective surfaces can introduce delays in processing.

With edge computing, intelligence runs directly on the terminal, processing the image in real time and adjusting parameters according to local conditions. This enables more accurate detection and immediate response, without relying on connectivity or the constantly changing environment of a supermarket.

Conclusion

Bags, trays, and shadows may seem like small details… but their impact on computer vision can be enormous. Overcoming them requires more than good algorithms: it demands training with real-world data, optimizing optical capture, and running intelligence as close as possible to where the action happens.

With this approach, computer vision doesn’t just “see” better—it learns to understand context: to distinguish a reflection from a piece of fruit, and a shadow from a physical shape.

The Challenge of Bags, Trays, and Shadows: The Invisible Enemy of Computer Vision

Grabit

Policies

Contact