Applying t-SNE or UMAP to see where this image sits relative to its assigned class.
Recommendations for automated "cleaning" of datasets based on high-loss samples.
Is 148_1000.jpg a prototypical example of its class, or is it an outlier? 148_1000.jpg
(e.g., An animal, a vehicle, a medical scan?)
(e.g., ImageNet, a local project, or a specific website?) Applying t-SNE or UMAP to see where this
The rise of deep learning relies on massive datasets where individual image quality and annotation accuracy are often assumed rather than verified.
Testing how minor augmentations (rotations, color jitters) to this image change the model's confidence. 4. Conclusion or is it an outlier? (e.g.
Summary of how individual data point audits can lead to more robust AI models.