Learn More
We report the results of our experiments in the context of the NEWS 2015 Shared Task on Transliteration. We focus on methods of combining multiple base systems , and leveraging transliterations from multiple languages. We show error reductions over the best base system of up to 10% when using supplemental translitera-tions, and up to 20% when using system(More)
No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization(More)
We apply Support Vector Machines to differentiate between 11 native languages in the 2013 Native Language Identification Shared Task. We expand a set of common language identification features to include cognate interference and spelling mistakes. Our best results are obtained with a classifier which includes both the cognate and the misspelling features,(More)
When text is translated from one language into another, sentiment is preserved to varying degrees. In this paper, we use Arabic social media posts as stand-in for source language text, and determine loss in sentiment predictability when they are translated into En-glish, manually and automatically. As benchmarks , we use manually and automatically(More)
We present a shared task on automatically determining sentiment intensity of a word or a phrase. The words and phrases are taken from three domains: general English, English Twit-ter, and Arabic Twitter. The phrases include those composed of negators, modals, and degree adverbs as well as phrases formed by words with opposing polarities. For each of the(More)
Sentiment analysis research has predominantly been on English texts. Thus there exist many sentiment resources for English, but less so for other languages. Approaches to improve sentiment analysis in a resource-poor focus language include: (a) translate the focus language text into a resource-rich language such as English, and apply a powerful English(More)
Morphological segmentation is an effective sparsity reduction strategy for statistical machine translation (SMT) involving morphologically complex languages. When translating into a segmented language , an extra step is required to deseg-ment the output; previous studies have de-segmented the 1-best output from the de-coder. In this paper, we expand our(More)
Morphological segmentation is an effective strategy for addressing difficulties caused by morphological complexity. In this study, we use an English-to-Arabic test bed to determine what steps and components of a phrase-based statistical machine translation pipeline benefit the most from segmenting the target language. We test several scenarios that differ(More)
INTRODUCTION Inflammatory myofibroblastic tumor is a very rare benign tumor in children that mimics malignant tumors in its aggressiveness locally and by the possibility of recurrence after surgical resection, and causing anemia of chronic disease, which is a decrease in hemoglobin 1 to 2 g/dL below normal level in a patient with chronic illness. CASE(More)
Homozygous sickle cell disease in the eastern province of Saudi Arabia is clinically mild. Circulating fetal hemoglobin levels of 16.0 +/- 7.4% were found in these anemic patients, but only 1.09 +/- 0.97% in their sickle trait parents. To determine whether these sickle cell anemia patients inherit an increased capacity to synthesize fetal hemoglobin, a(More)