: It proposes a FMSE (Feature-Superimposed Extraction) model based on Vision Transformers (ViT) for detecting defects in UAV (unmanned aerial vehicle) insulator images.