Tsinghua Science and Technology


person re-identification, non-local similarity, feature learning, semantic parts


In person re-IDentification (re-ID) task, the learning of part-level features benefits from fine-grained information. To facilitate part alignment, which is a prerequisite for learning part-level features, a popular approach is to detect semantic parts with the use of human parsing or pose estimation. Such methods of semantic partition do offer cues to good part alignment but are prone to noisy part detection, especially when they are employed in an off-the-shelf manner. In response, this paper proposes a novel part feature learning method for re-ID, that suppresses the impact of noisy semantic part detection through Supervised Non-local Similarity (SNS) learning. Given several detected semantic parts, SNS first locates their center points on the convolutional feature maps for use as a set of anchors and then evaluates the similarity values between these anchors and each pixel on the feature maps. The non-local similarity learning is supervised such that: each anchor should be similar to itself and simultaneously dissimilar to any other anchors, thus yielding the SNS. Finally, each anchor absorbs features from all of the similar pixels on the convolutional feature maps to generate a corresponding part feature (SNS feature). We evaluate our method with extensive experiments conducted under both holistic and partial re-ID scenarios. Experimental results confirm that SNS consistently improves re-ID accuracy using human parsing or pose estimation, and that our results are on par with state-of-the-art methods.


Tsinghua University Press