
Has Sarvam AI really beaten ChatGPT, Google Gemini? YES and NO
India Today
Sarvam AI has been at the centre of many discussions on social media in the past couple of days. The Indian AI company has outperformed ChatGPT and Google Gemini in some key areas. So, can we say Sarvam AI has beaten the best, including Google Gemini and ChatGPT? Yes and no.
It is not every day that an Indian startup makes a buzz by beating the best from the world. But when that happens, we over here in India can’t stop talking about it. That is what is happening with Sarvam AI, a company that has come out with two new tools called Vision and Bulbul. These tools are so good that they even beat ChatGPT, Anthropic Claude and Google Gemini. Result is headlines such as this one you might have read here on India Today: “India's Sarvam AI beats Google Gemini and ChatGPT, the world is impressed.”
But now that there is a conversation around Sarvam AI, different people are weighing in on the matter. Is it really the case that Sarvam AI is better than Gemini or ChatGPT?
Let’s take a closer look at the whole Sarvam AI beating ChatGPT saga. Because there are nuances involved. Sarvam AI does beat Google Gemini and ChatGPT. And yet, it also does not. Confused, let us explain.
On February 5, Sarvam AI cofounder Pratyush Kumar announced that the startup’s Sarvam Vision outperformed every major AI model on the olmOCR-Bench. This benchmark measures optical character recognition (OCR), that is the ability of an AI model to recognise and understand images, scanned documents and other visual elements. The benchmark measures whether the AI models can identify and understand complex fonts, handwriting, and other data from such inputs.
In olmOCR-Bench, Sarvam Vision had an accuracy of 84.3 per cent. The indigenous AI model outperformed the likes of OpenAI’s ChatGPT, Google’s Gemini 3 Pro and even China’s DeepSeek OCR v2. On OmniDocBench v1.5, Sarvam Vision scored 93.28 per cent. The AI particularly performed well with complex layouts, technical tables and mathematical formulas.
The Vision is apparently incredible at doing OCR on Indic scripts. That is probably because it has been trained on Indian languages and Indian way of writing. It is better familiar with Indian scripts, including scripts of regional languages. While ChatGPT, Gemini and others too have good OCR capabilities, they are not fine-tuned for Indic scripts in ways Sarvam Vision is.

Reddit is exploring biometric verification methods such as Face ID and Touch ID to ensure users are real humans, not bots, while pledging to maintain the platform's tradition of anonymity. CEO Steve Huffman said the company is planning to address the rising influence of AI-generated content and protect authentic user engagement.

In a push towards more inclusive school environments, the Central Board of Secondary Education has rolled out fresh directives on menstrual hygiene across its affiliated institutions. The move comes after a landmark ruling by the Supreme Court of India that places menstrual health within the framework of fundamental rights.











