Transformer

HiFT: Hierarchical Feature Transformer for Aerial Tracking

Siamese-based visual tracking methods generally execute the classification and regression of the target object based on the similarity maps. However, existing works either solely employ a single map generated by the last convolutional layer which …