• Corpus ID: 17912704

Placing Images with Refined Language Models and Similarity Search with PCA-reduced VGG Features

@inproceedings{KordopatisZilos2016PlacingIW,
  title={Placing Images with Refined Language Models and Similarity Search with PCA-reduced VGG Features},
  author={Giorgos Kordopatis-Zilos and Adrian Daniel Popescu and Symeon Papadopoulos and Yiannis Kompatsiaris},
  booktitle={MediaEval Benchmarking Initiative for Multimedia Evaluation},
  year={2016}
}
We describe the participation of the CERTH/CEA-LIST team in the MediaEval 2016 Placing Task. We submitted five runs to the estimation-based sub-task: one based only on text by employing a Language Model-based approach with several refinements, one based on visual content, using geospatial clustering over the most visually similar images, and three based on a hybrid scheme exploiting both visual and textual cues from the multimedia items, trained on datasets of different size and origin. The… 

Figures and Tables from this paper

Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation

This paper addresses the problem of global-scale image geolocation by proposing a mixed classification-retrieval scheme that leverages the EfficientNet architecture and introduces a new residual architecture that is trained with contrastive learning to map input images to an embedding space that minimizes the pairwise geodesic distance of same-location images.

MM-Locate-News: Multimodal Focus Location Estimation in News

A novel dataset called Multimodal Focus Location of News (MM-Locate-News) is introduced and state-of-the-art methods on the new benchmark dataset are evaluated and novel models to predict the focus location of news using both textual and image content are suggested.

A Transformer-based Framework for POI-level Social Post Geolocation

A transformer-based general framework is presented, which builds upon pre-trained language models and considers non-textual data, for social post geolocation at the POI level and demonstrates that three variants of the proposed framework outperform multiple state-of-art baselines in terms of accuracy and distance error metrics.

Geotagging Text Content With Language Models and Feature Mining

This work presents a highly accurate geotagging approach for estimating the locations alluded by text annotations based on refined language models that are learned from massive corpora of social media annotations, and demonstrates the consistently superior geotagged accuracy and low median distance error of the proposed approach.

Leveraging Selective Prediction for Reliable Image Geolocation

This paper proposes two novel selection functions that leverage the output probability distributions of geolocation models to infer localizability at different scales, and benchmarked against the most widely used selective prediction baselines, outperforming them in all cases.

Location Extraction from Social Media

Five “best-of-class” location extraction algorithms are evaluated using an OpenStreetMap database and a language model constructed from social media tags and multiple gazetteers, and a detailed failure analysis for the approaches is performed.

Knowledge-based and data-driven approaches for geographical information access

This thesis attempts to improve the effectiveness results of GeoIA tasks by improving the detection, understanding, and use of a part of the geographical and the thematic content of queries and documents with Toponym Recognition, Toponym Disambiguation and Natural Language Processing (NLP) techniques.

Extracting localized information from a Twitter corpus for flood prevention

The goal here is to get a first estimation of the quality and precision of the geographical information featured in the collected corpus, as well as its analysis from both spatial and topical perspectives.

References

SHOWING 1-10 OF 10 REFERENCES

CERTH/CEA LIST at MediaEval Placing Task 2015

The participation of the CERTH/CEA LIST team in the Placing Task of MediaEval 2015 is described, with the best results obtained when both visual and textual features are combined, using external data for training.

CEA LIST's Participation at MediaEval 2013 Placing Task

Results show that all modifications proposed this year have a positive effect, and a “standard” based only on the training data (cues (1)+(2)) has the poorest performance.

Scalable domain adaptation of convolutional neural networks

Convolutional neural networks (CNNs) tend to become a standard approach to solve a wide array of computer vision problems. Besides important theoretical and practical advances in their design, their

Finding locations of flickr resources using language models and similarity search

A two-step approach to estimate where a given photo or video was taken, using only the tags that a user has assigned to it, to improve substantially over either language models or similarity search alone.

Very Deep Convolutional Networks for Large-Scale Image Recognition

This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

The Placing Task at MediaEval 2015

The sixth edition of the Placing Task at MediaEval introduces two new sub-tasks: (1) locale-based placing, which emphasizes the need to move away from an evaluation purely based on latitude and

In-depth Exploration of Geotagging Performance using Sampling Strategies on YFCC100M

This paper proposes an evaluation methodology based on an array of sampling strategies over a reference test collection, and a way of quantifying and summarizing the volatility of performance measurements, and demonstrates that the proposed methodology could help capture the performance of geotagging systems in a comprehensive manner that is complementary to existing evaluation approaches.

The New Data and New Challenges in Multimedia Research

The rationale behind the creation of the YFCC100M, the largest public multimedia collection that has ever been released, is explained, as well as the implications the dataset has for science, research, engineering, and development.

Geotagging Social Media Content with a Refined Language Modelling Approach

A new geotagging approach is presented that can estimate the location of a post based on its text using refined language models that are learned from massive corpora of social media content.

The new data and new challenges in multimedia research. CoRR, abs

  • The new data and new challenges in multimedia research. CoRR, abs
  • 1503