Resources Developed

Test

LINGUISTIC RESOURCES DEVELOPED

 

1.  Modern Bengali text corpus of 5 million words from 65 disciplines

2.  A unicode compatible normalized version of the TDIL corpus

3.  Tokenized Bengali lexical database of 2 lakh words

4.  Frequency-based Bengali lexical database of 2 lakh words

5.  Hindi-Bengali Parallel Translation Corpus of 70 thousand sentences

6.  Hindi-Bengali POS tagged corpus of 1 lakh sentences

7.  Hindi-Bengali chunked corpus of 1 lakh sentences

8.  News Text Bengali Corpus of 2 million words

9.  Lexical Dataset of 5000 words from medical domain

10.  POS tagged Bengali monolingual corpus of 30K sentences

11.  Chunked Bengali monolingual corpus of 30K sentences

12.  Full list of Bengali Prefixes with examples

13.  Lexical database of 5000 Bengali prefixed words

14.  Full list of Bengali adjectival suffixes with examples

15.  Complete list of annotated Bengali pronouns

16.  English-Bengali bilingual database of 1000 idiomatic expressions

17.  News Text Corpus of Indian English of 10 million words

18.  POS tagged News Text Corpus of Indian English of 10 million words

19.  Hindi News Text Corpus of 2 million words

20.  Full list of Bengali nominal suffixes and case markers

21.  Full list of Bengali verbal suffixes and conjugation markers

22.  Full list of Bengali Postpositions with examples

23.  Exhaustive list of Bengali place names

24.  Exhaustive list of Bengali Basic Vocabulary

25.  Full list of consonant Grapheme Clusters used in Bengali text

26.  A large database of Scientific and Technical Terms in Bengali

27.  A large list of Bengali ‘words of multitude’ with examples

28.  1.5 Lakh multidisciplinary and benchmarked normalized Bengali sentences.

29.  BIS POS-Tagged 60K multidisciplinary and benchmarked Bengali sentences

30.  25K English-Bengali bilingual-bidirectional translational equivalents

31.  20K English-Bengali transliterated lexical database

32.  20K English-Bengali Scientific and Technical Terms and Multiword Expressions

33.  Complete database of Bengali Spatio-Temporal Expressions