Share This Author
SNACS Annotation of Case Markers and Adpositions in Hindi
We present in-progress annotation of semantic relations expressed through adpositions and case markers in a Hindi corpus. We used the multilingual SNACS annotation scheme, which has been applied to a…
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English
- Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, Nathan Schneider
- 23 October 2021
We present the Prepositions Annotated with Supsersense Tags in Reddit International English (“PASTRIE”) corpus, a new dataset containing manually annotated preposition supersenses of English data…
The SIGMORPHON 2022 Shared Task on Morpheme Segmentation
The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to decompose a word into a sequence of morphemes and covered most types of morphology: compounds, derivations, and…
Supervised Grapheme-to-Phoneme Conversion of Orthographic Schwas in Hindi and Punjabi
This work presents the first statistical schwa deletion classifier for Hindi, which relies solely on the orthography as the input and outperforms previous approaches.
SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection
The 2022 SIGMORPHON–UniMorph shared task on large scale morphological inflection generation included a wide range of typologically diverse languages: 33 languages from 11 top-level language families:…
Bhāṣācitra: Visualising the dialect geography of South Asia
Bhāṣācitra is presented, a dialect mapping system for South Asia built on a database of linguistic studies of languages of the region annotated for topic and location data that serves as a new kind of interactive bibliography for linguists of South Asian languages.
Estimating the Entropy of Linguistic Distributions
In a replication of two recent information-theoretic linguistic studies, there is evidence that the reported effect size is over-estimated due to over-reliance on poor entropy estimators.
Computational Historical Linguistics and Language Diversity in South Asia
South Asia is home to a plethora of languages, many of which severely lack access to new language technologies. This linguistic diversity also results in a research environment conducive to the study…
For the Purpose of Curry: A UD Treebank for Ashokan Prakrit
We present the first linguistically annotated treebank of Ashokan Prakrit, an early Middle IndoAryan dialect continuum attested through Emperor Ashoka Maurya’s 3rd century BCE rock and pillar edicts.…
Quasi-Passive Lower and Upper Extremity Robotic Exoskeleton for Strengthening Human Locomotion
QLUE-REX will be a feasible modular-type wearable system that incorporates orthotic elbow, knee, and ankle joints effectively in either synchronous or asynchronous modes depending on the users’ needs, utilizing human-walking analysis, data sensing and estimation technology, and measurement of the electromyography signals of user’s muscles.