作者: Raveesh Motlani
DOI: 10.18653/V1/N16-2008
关键词: Resource poor 、 Computer science 、 World Wide Web 、 Sindhi 、 Artificial intelligence 、 Transliteration 、 Natural language processing 、 Language industry 、 Language technology
摘要: Sindhi, an Indo-Aryan language with more than 75 million native speakers 1 is a resourcepoor in terms of the availability technology tools and resources. In this thesis, we discuss approaches taken to develop resources for special focus on Sindhi. The major contributions work include raw annotated datasets, POS Tagger, Morphological Analyser, Transliteration Machine Translation System.