IIT Kharagpur making Sanskrit accessible with their Artificial Intelligence-based system
India Today
Researchers at IIT Kharagpur are making Sanskrit accessible with their Artificial Intelligence-based system by processing Sanskrit texts.
There has been a renewed interest in Sanskrit since the announcement of NEP 2020. Various academic institutions both at school education as well as higher education are adopting various approaches for improving the reach of the language through training programs, research, and outreach initiatives. While various digital resources have improved the accessibility and use of world languages and even regional languages, Sanskrit presents unique challenges in automated computational processing. In addition to the sheer volume and diversity, both stylistic and chronological, found in these texts, the linguistic peculiarities expressed by the language, pose several challenges in making these works accessible to the world. Researchers led by Dr. Pawan Goyal have developed a digital infrastructure for the efficient processing of Sanskrit texts, by effectively combining state-of-the-art machine learning techniques and traditional linguistic knowledge from Sanskrit. The proposed framework is based on Energy-based models and it enables the encoding of relevant linguistic information as constraints. “Processing of Sanskrit texts poses several challenges owing to the high lexical productivity of the words, free word order in poetry, euphonic assimilation of sounds at the word boundaries and phonemic orthography followed in writing. Keeping these in mind, we proposed a generic graph-based framework that takes advantage of the free word order nature of the language. Further, we make use of linguistic insights from the traditional Sanskrit grammar for learning the feature function and applying the relevant constraints.” explained Dr. Goyal. He further adds, “Our proposed framework substantially reduces the training data requirements to as low as 10%, as compared to that of the neural state-of-the-art models. In all the Sanskrit-related tasks discussed in the work, we either achieve state-of-the-art results or ours is the only data-driven solution for those tasks,”More Related News