• Publications
  • Influence
Multimedia content processing through cross-modal association
TLDR
This paper investigates different cross-modal association methods using the linear correlation model, and introduces a novel method for cross- modal association called Cross-modAL Factor Analysis (CFA), which shows several advantages in analysis performance and feature usage. Expand
Classification of general audio data for content-based retrieval
TLDR
This work describes a scheme that is able to classify audio segments into seven categories consisting of silence, single speaker speech, music, environmental noise, multiple speakers' speech, simultaneous speech and music, and speech and noise, and shows that cepstral-based features such as the Mel-frequency cep stral coefficients (MFCC) and linear prediction coefficients (LPC) provide better classification accuracy compared to temporal and spectral features. Expand
Video classification based on HMM using text and faces
TLDR
A novel method for video classification based on face and text trajectories based on Hidden Markov Models to classify a given video clip into predefined categories, e.g., commercial, news, sitcom and soap is presented. Expand
Applications of Video-Content Analysis and Retrieval
TLDR
Technologies and applications for video-content analysis and retrieval are surveyed and specific examples of how to manage multimedia data are given. Expand
Text detection for video analysis
TLDR
This work describes a method for detection and representation of text in video segments that can be applied to English as well as non-English text (such as Korean) with precision and recall of 85%. Expand
Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
TLDR
A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research and it is believed that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community. Expand
Computational prediction of methylation status in human genomic sequences.
TLDR
A computational pattern recognition method that is used to predict the methylation landscape of human brain DNA and can be applied both to CpG islands and to non-CpG island regions is described. Expand
Identification of a novel PARP14‐TFE3 gene fusion from 10‐year‐old FFPE tissue by RNA‐seq
TLDR
The identification of a novel TFE3 fusion partner, PARP14 in chromosome band3q21, expands the list of Tfe3 translocation partner genes and re‐emphasizes the essential oncogenic role of T FE3 fusion proteins in this tumor. Expand
Audio-visual talking face detection
TLDR
A novel method for finding the talking face using latent semantic indexing approach and it is shown that the LSI method accuracy degrades gracefully in a noisy environment as opposed to the correlation method which simply fails in presence of noise. Expand
Motion recovery for video content classification
TLDR
The specification of a language for retrieval of video based on the spatial as well as motion characteristics is presented and the algorithm for motion detection uses the motion compensation component of the MPEG video-encoding scheme and then computes trajectories for objects of interest. Expand
...
1
2
3
4
5
...