South and Southeast Asian Natural Language Procesing
|
|
This website is dedicated to Natural Language Processing (NLP) and Computational Linguistic (CL) work on South and Southeast Asian Languages. Here you will find online systems for these languages, Computational Resources, a comprihensive contact list of people working on these languages, etc. South and Southeast Asian Region and its LanguagesSouth Asia comprises of the countries- Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan and Sri Lanka. Southeast Asia, on the other hand, consists of Burma, Cambodia, Laos, Thailand, Vietnam, Malaysia, Brunei, East Timor, Indonesia, Philippines and Singapore. The following table gives an idea about the size of population and the number of living languages in the regions of South and Southeast Asia.
Table 1: Population and Number of Living Languages of South and Southeast Asia The 2241 languages described in Table 1 belong to different language families like Indo-Aryan, Indo-Iranian, Dravidian, Sino-Tibetan, Austro-Asiatic, Kradai, Hmong-Mien, etc. In terms of population, South Asia and Southeast Asia represent 34.94% of the total population of the world. Some of the languages of these regions have a large number of native speakers: Hindi (5th largest according to number of its native speakers), Bengali (6th), Punjabi (12th), Tamil (18th), Urdu (20th), etc. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Back on Top | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||