Text independent root word identification in Hindi language using natural language processing
Abstract
In this paper, an attempt is made to parse Hindi words to identify root word from an inflected word using natural language processing (NLP) technique. Stemming is a heuristic process that chops off the ends of words to find the root word and often includes the removal of derived affixes. It is used to improve retrieval effectiveness and to reduce the size of indexing files. Our proposed work is capable to stem the words which are not prior stored in database. The major application of this work is to learn Hindi language and its grammar in a very interactive manner. Also, it is very useful in building natural language translators in Hindi.