How are Indian languages faring in the age of AI and language models? Premium

The Hindu

Tuesday, May 30, 2023 10:54:20 AM UTC

As large language models like ChatGPT find more applications around the world, their adoption also passively spreads a prejudice against languages other than English, including Indian languages. Some researchers are working to remedy this.

“Sanskrit suits the language of computers and those learning artificial intelligence learn it,” Indian Space Research Organisation chairman S. Somanath said at an event in Ujjain on May 25. His was the latest in a line of statements exalting Sanskrit and its value for computing but without any evidence or explanation.

But beyond Sanskrit, how are other Indian languages faring in the realm of artificial intelligence (AI), at a time when its language-based applications have taken the world by storm?

The answer is a mixed bag. There is some passive discrimination even as the languages’ fates are buoyed by public-spirited research and innovation.

Behind both seemingly intelligent chatbots and art-making computers, algorithms and data-manipulation techniques turn linguistic and visual data into mathematical objects (like vectors), and combine them in specific ways to produce the desired output. This is how ChatGPT is able to respond to your questions.

When working with a language, a machine first has to break a sentence or a word down into little bits in a process called tokenisation. These are the bits that the machine’s data-processing model will work with. For example, “there’s a star” can be tokenised to “there”, “is”, “a”, and “star”.

There are several tokenisation techniques. A treebank tokeniser breaks up words and sentences based on the rules that linguists use to study them. A subword tokeniser allows the model to learn some common word and modifications to that word separately, such as “dusty” and “dustier”/“dustiest”.

OpenAI, the maker of ChatGPT and the GPT series of large language models, uses a type of the subword tokeniser called byte-pair encoding (BPE). Here’s an example of the OpenAI API using this on a statement by Gayathri Chakravorty Spivak:

Read full story on The Hindu

Share this story on:-

Primary Country (Mandatory)

Other Country (Optional)

Set News Language for United States

Set News Language for World

Set News Source for United States

Set News Source for World

How are Indian languages faring in the age of AI and language models? Premium

The Hindu

The Science Quiz | A quiz on science films at the Oscars through history Premium

Exploring Butwal, Nepal: Terai cuisine, Lumbini trails and a stay at Hyatt Place

Wallpaper trends 2026: Botanicals, handmade designs transform Indian homes

Mercedes-Benz CLA 250+: A new era of luxury electric sedans

World Sparrow Day | AI tool for awareness and conservation

BTS and ‘Arirang’: A guide to the album release, live concert and more

Life-saving numbers: what the 2026 U.S. cholesterol guidelines mean for everyone Premium

Olam festival in Thiruvananthapuram returns for its sixth edition

World Storytelling Day: Storytellers in Thiruvananthapuram weigh in on their journey

76-year-old athlete from Kerala wins seven medals at Open Masters Games Abu Dhabi

Where or what is the human mind? Premium

Large Hadron Collider discovers a new particle

Vibha Batra on her latest book, Spotless, a novel in verse

Why tiger enthusiasts should give Tadoba and Gothangaon a chance

Ahead of Chandrayaan-4, IIT and PRL team decodes moon’s titanium-rich rocks Premium

Biotech industry driving both human and animal nutrition: experts

Heart disease kills 28.6 lakh Indians every year and yet, treatment is uneven and erratic Premium

Paris Fashion Week 2026: WforWoman marks India’s first high-street showcase

Mercedes-Benz CLA 250+: A new era of luxury electric sedans

A Goa bartender bets big on Chennai with Roc-A-Coe, a 1930s-style cocktail bar

76-year-old athlete from Kerala wins seven medals at Open Masters Games Abu Dhabi

Can nations save the shorebird that flies 30,000 km a year?

What is the Minor Planet Centre? Premium

What is the Minor Planet Centre? Premium

Some moons may have conditions suitable for the emergence of life Premium