Please use this identifier to cite or link to this item: http://nopr.niscpr.res.in/handle/123456789/62991
metadata.dc.identifier.doi: https://doi.org/10.56042/bvaap.v31i2.6388
Title: Pre-processing in the context of natural language resources for Hindi lessons
Authors: Kishanpuri, Anjana
Yadav, Dhirendra
Petkar, Harshalata
Issue Date: Dec-2023
Publisher: NIScPR-CSIR,India
Abstract: Nowadays, text data in digital format of online and offline mode is increasing rapidly, it becomes difficult to manage and retrieve the text documents. Natural language processing (NLP) is highly dependent on efficient pre-processing of text documents such as archival, retrieval, query response, text summarization, machine translation, etc.This specialized area of natural language processing has led inspired researchers to do apply machine learning algorithms to automatically pre-process documents based on languages, developing methods to process documents based on their context.Under the present research paper, a pre-processing application has been proposed in the context of natural language processing. Pre-processing is an important function in text mining, natural language processing (NLP), and information retrieval (IR). However, no raw text data can be worked on without pre-processing. Text pre-processing ensures optimum results when executed properly. In the field of natural language processing, text pre-processing is used to extract interesting and knowledgeable information from unstructured textual data.This paper proposes a pre-processing application for the Hindi legal domain to provide a comprehensive and useful understanding of important linguistic processes such as normalization, tokenization, stop word removal and stemming.
Page(s): 135-141
ISSN: 0975-2412 (Online); 0971-7706 (Print)
Appears in Collections:BVAAP Vol.31(2) [December 2023]

Files in This Item:
File Description SizeFormat 
BVAAP 31(2) 135-141.pdf666.68 kBAdobe PDFView/Open


Items in NOPR are protected by copyright, with all rights reserved, unless otherwise indicated.