Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 222,934,148 papers from all fields of science
Search
Sign In
Create Free Account
Truecasing
Truecasing is the problem in natural language processing (NLP) of determining the proper capitalization of words where such information is…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
4 relations
Armenian alphabet
Machine translation
Natural language processing
Outline of natural language processing
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2020
2020
An Efficient Architecture for Predicting the Case of Characters using Sequence Models
Gopi Ramena
,
D. Nagaraju
,
Sukumar Moharana
,
D. Mohanty
,
Naresh Purre
International Computer Science Conference
2020
Corpus ID: 211010629
The dearth of clean textual data often acts as a bottleneck in several natural language processing applications. The data…
Expand
2018
2018
Robust parfda Statistical Machine Translation Results
Ergun Biçici
Conference on Machine Translation
2018
Corpus ID: 53237345
We build parallel feature decay algorithms (parfda) Moses statistical machine translation (SMT) models for language pairs in the…
Expand
2017
2017
Reference ResToRinG CaPitaLiZaTion in # TweeTs NEBHI ,
Kamel
2017
Corpus ID: 73560594
The rapid proliferation of microblogs such as Twitter has resulted in a vast quantity of written text becoming available that…
Expand
2016
2016
TOWARDS TRANSLATION OF EDUCATIONAL RESOURCES USING GIZA++
I. Obradovic
2016
Corpus ID: 156054237
E-learning courses are becoming progressively popular. Thanks to the Internet and new technologies, education has never been more…
Expand
2015
2015
CAPITALIZATION AND PUNCTUATION RESTORATION FOR ROMANIAN LANGUAGE
Alexandru Caranica
,
H. Cucu
,
Andi Buzo
,
C. Burileanu
2015
Corpus ID: 44188283
The text generated by an Automatic Speech Recognition system is usually characterized by low reading intelligibility…
Expand
2014
2014
NTT-NAIST syntax-based SMT systems for IWSLT 2014
Katsuhito Sudoh
,
Graham Neubig
,
Kevin Duh
,
K. Hayashi
International Workshop on Spoken Language…
2014
Corpus ID: 17862685
This paper presents NTT-NAIST SMT systems for English-German and German-English MT tasks of the IWSLT 2014 evaluation campaign…
Expand
2014
2014
Improving the Extraction of Clinical Concepts from Clinical Records
Xiao Fu
,
S. Ananiadou
2014
Corpus ID: 17064794
Essential information relevant to medical problems, tests, and treatments is often expressed in patient clinical records with…
Expand
2013
2013
Multilingual MoKi: How to Manage Multilingual Ontologies in a Wiki
M. Dragoni
,
Chiara Ghidini
,
A. Bosca
Extended Semantic Web Conference
2013
Corpus ID: 3223157
In this paper we describe an extension of the MoKi tool able to support the management of multilingual ontologies. The…
Expand
2011
2011
Hierarchical Phrase-Based MT at the Charles University for the WMT 2011 Shared Task
Daniel Zeman
WMT@EMNLP
2011
Corpus ID: 115401970
We describe our experiments with hierarchical phrase-based machine translation for the WMT 2011 Shared Task. We trained a system…
Expand
2011
2011
Using Apertium linguistic data for tokenization to improve Moses SMT performance
Sergio Ortiz Rojas
,
S. C. Vaíllo
2011
Corpus ID: 34875802
This paper describes a new method to tokenize texts, both to train a Moses SMT system and to be used during the translation…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE