#### Filter Results:

- Full text PDF available (43)

#### Publication Year

2004

2017

- This year (1)
- Last 5 years (34)
- Last 10 years (44)

#### Publication Type

#### Co-author

#### Journals and Conferences

#### Data Set Used

#### Key Phrases

Learn More

- Philippe Weinzaepfel, Jérôme Revaud, Zaïd Harchaoui, Cordelia Schmid
- 2013 IEEE International Conference on Computer…
- 2013

Optical flow computation is a key component in many computer vision systems designed for tasks such as action detection or activity recognition. However, despite several major advances over the last decade, handling large displacement in optical flow remains an open problem. Inspired by the large displacement optical flow of Brox and Malik, our approach,… (More)

- Zeynep Akata, Florent Perronnin, Zaïd Harchaoui, Cordelia Schmid
- 2013 IEEE Conference on Computer Vision and…
- 2013

Attributes are an intermediate representation, which enables parameter sharing between classes, a must when training data is scarce. We propose to view attribute-based image classification as a label-embedding problem: each class is embedded in the space of attribute vectors. We introduce a function which measures the compatibility between an image and a… (More)

- Francis R. Bach, Zaïd Harchaoui
- NIPS
- 2007

We present a novel linear clustering framework (DIFFRAC) which relies on a linear discriminative cost function and a convex relaxation of a combinatorial optimization problem. The large convex optimization problem is solved through a sequence of lower dimensional singular value decompositions. This framework has several attractive properties: (1) although… (More)

In large video collections with clusters of typical categories, such as “birthday party” or “flash-mob”, category-specific video summarization can produce higher quality video summaries than unsupervised approaches that are blind to the video category. Given a video from a known category, our approach first efficiently performs a temporal segmentation into… (More)

Learning to localize objects with minimal supervision is an important problem in computer vision, since large fully annotated datasets are extremely costly to obtain. In this paper, we propose a new method that achieves this goal with only image-level labels of whether the objects are present or not. Our approach combines a discriminative submodular cover… (More)

- Julien Mairal, Piotr Koniusz, Zaïd Harchaoui, Cordelia Schmid
- NIPS
- 2014

An important goal in visual recognition is to devise image representations that are invariant to particular transformations. In this paper, we address this goal with a new type of convolutional neural network (CNN) whose invariance is encoded by a reproducing kernel. Unlike traditional approaches where neural networks are learned either to represent data or… (More)

- Hongzhou Lin, Julien Mairal, Zaïd Harchaoui
- NIPS
- 2015

We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective by approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. This strategy… (More)

- Miroslav Dudík, Zaïd Harchaoui, Jérôme Malick
- AISTATS
- 2012

We consider the minimization of a smooth loss with trace-norm regularization, which is a natural objective in multi-class and multitask learning. Even though the problem is convex, existing approaches rely on optimizing a non-convex variational bound, which is not guaranteed to converge, or repeatedly perform singular-value decomposition, which prevents… (More)

- Zaïd Harchaoui, Francis R. Bach
- 2007 IEEE Conference on Computer Vision and…
- 2007

We propose a family of kernels between images, defined as kernels between their respective segmentation graphs. The kernels are based on soft matching of subtree-patterns of the respective graphs, leveraging the natural structure of images while remaining robust to the associated segmentation process uncertainty. Indeed, output from morphological… (More)

- Philippe Weinzaepfel, Zaïd Harchaoui, Cordelia Schmid
- 2015 IEEE International Conference on Computer…
- 2015

We propose an effective approach for spatio-temporal action localization in realistic videos. The approach first detects proposals at the frame-level and scores them with a combination of static and motion CNN features. It then tracks high-scoring proposals throughout the video using a tracking-by-detection approach. Our tracker relies simultaneously on… (More)