The VLAD-VGG-YFCC Dataset
We make available a set of state-of-the-art features associated with the images of the Yahoo Flickr Creative Commons (YFCC) dataset. The features are made available under the CC BY 4.0 license.
In particular, we make available the following features:
- VGG: These features were proposed by Simonyan and Zisserman for the the ILSVRC 2014 challenge and the respective system was ranked second best in the classification task. The features are extracted with a 16-layer CNN and we exploit the output of the last fully connected layer fc7, which consists of 4,096 dimensions. This layer is selected since its neurons convey a relatively high-level encoding of the image content that is well suited for finding semantically similar images for a query.
- VGG-PCA: Due to the significant cost of the full VGG features in large scale collections, such as YFCC, we also create a PCA compressed version of VGG. The PCA matrix is computed using a sample of 250,000 YFCC images, randomly selected from the collection. We make available representations consisting of the most significant 16 and 128 dimensions from the PCA representation of VGG, denoted respectively as VGG-16 and VGG-128.
- VLAD: These are improved VLAD vectors using SURF and the multiple vocabulary aggregation technique of Jegou and Chum with four visual vocabularies (of k=128 centroids). Their initial dimensions (32,768) were reduced with PCA+whitening. VLAD-1024, VLAD-128 and VLAD-16 correspond to retaining the 1024, 128 and 16 most significant dimensions respectively.
- host: mklab.iti.gr/public/vgg-vlad-yfcc
- port: 22
- username: vgg
- password: 123
A. Popescu, E. Spyromitros-Xioufis, S. Papadopoulos, H. Le Borgne, Y. Kompatsiaris. "Towards an Automatic Evaluation of Retrieval Performance with Large Scale Image Collections". In Proceedings of the MMCommons Workshop, co-located with ACM Multimedia, Brisbane, Australia