Visual prompts using data augmentations for robust out-of-distribution image classification

less than 1 minute read

Tunable Visual prompting using data augmentation techniques such as DeepAugment, CutMix and CutOut for robust predictions for vision-language models like CLIP. Performance evaluations led to 2-3% improvement on different corruptions of CIFAR-C dataset