Test
LINGUISTIC RESOURCES DEVELOPED
1. Modern Bengali text corpus of 5 million words from 65 disciplines
2. A unicode compatible normalized version of the TDIL corpus
3. Tokenized Bengali lexical database of 2 lakh words
4. Frequency-based Bengali lexical database of 2 lakh words
5. Hindi-Bengali Parallel Translation Corpus of 70 thousand sentences
6. Hindi-Bengali POS tagged corpus of 1 lakh sentences
7. Hindi-Bengali chunked corpus of 1 lakh sentences
8. News Text Bengali Corpus of 2 million words
9. Lexical Dataset of 5000 words from medical domain
10. POS tagged Bengali monolingual corpus of 30K sentences
11. Chunked Bengali monolingual corpus of 30K sentences
12. Full list of Bengali Prefixes with examples
13. Lexical database of 5000 Bengali prefixed words
14. Full list of Bengali adjectival suffixes with examples
15. Complete list of annotated Bengali pronouns
16. English-Bengali bilingual database of 1000 idiomatic expressions
17. News Text Corpus of Indian English of 10 million words
18. POS tagged News Text Corpus of Indian English of 10 million words
19. Hindi News Text Corpus of 2 million words
20. Full list of Bengali nominal suffixes and case markers
21. Full list of Bengali verbal suffixes and conjugation markers
22. Full list of Bengali Postpositions with examples
23. Exhaustive list of Bengali place names
24. Exhaustive list of Bengali Basic Vocabulary
25. Full list of consonant Grapheme Clusters used in Bengali text
26. A large database of Scientific and Technical Terms in Bengali
27. A large list of Bengali ‘words of multitude’ with examples
28. 1.5 Lakh multidisciplinary and benchmarked normalized Bengali sentences.
29. BIS POS-Tagged 60K multidisciplinary and benchmarked Bengali sentences
30. 25K English-Bengali bilingual-bidirectional translational equivalents
31. 20K English-Bengali transliterated lexical database
32. 20K English-Bengali Scientific and Technical Terms and Multiword Expressions
33. Complete database of Bengali Spatio-Temporal Expressions