Under fire, Sarvam AI co-founder says worries about Indic GenAI model premature | Bengaluru News

Share the Reality


Under fire, Sarvam AI co-founder says worries about Indic GenAI model premature

Chennai/Bengaluru: India’s latest home-grown generative AI language model, Sarvam-M, has drawn fire from sections of the developer community for what they describe as “under-whelming” performance. But Pratyush Kumar, co-founder of Sarvam AI, insists the scepticism is premature and betrays a misunderstanding of how AI frontier models mature.“The ecosystem is early and people are worried too early,” he tells TOI. “We are scrambling amongst ourselves when the world moves fast. We want to create an AI ecosystem where more people can positively collaborate.”Released this month, the relatively small 24-billion-parameter Sarvam-M model was trained to reason across ten Indian languages while tackling maths and coding tasks. Kumar says benchmarks on Hugging Face (a platform and open-source library primarily used for leveraging machine learning models) show the model matching or outscoring popular open-source rivals (like Meta’s Llama, Mistral Small and Gemma 3) in mathematics, programming and Indic-language comprehension. “With this we want to show that we cracked post-training (process of refining and optimising a machine learning model after its initial training phase) problems and our methodology is comparable with other models,” he explains. “We open-sourced this because we want to show that such a model can be built and encourage other people to do it.”Much of the social-media push-back has centred on relatively modest early-stage download numbers and the perception that Sarvam-M offers few breakthrough capabilities. Kumar counters that India’s sovereign-AI ambitions demand more than one blockbuster release. “These things involve both scientific explorations and resource consumption,” he says. “I think we are on the path to building state-of-the-art models.“Sarvam AI is the first startup chosen to build a frontier model under the government’s IndiaAI Mission, which is funding compute, data and research partnerships to reduce reliance on overseas platforms. Although the latest model is a private effort separate from the IndiaAI Mission, Kumar says the initiative will benefit everyone.He declines to give a timeline for the AI Mission-backed foundation model, noting that the company has yet to receive graphics-processing units (GPUs) from government suppliers. “We will open-source the foundational model,” he says, but warns that schedules depend on hardware access and collaborative research cycles.Industry weighs inSeasoned AI practitioners say early criticism overlooks the scale of what Sarvam is attempting. “Building a 24-billion-parameter model in India is not easy, especially when deep research isn’t encouraged in most universities or companies,” says Jaspreet Bindra, co-founder of consultancy AI&Beyond. “Sarvam-M demonstrates robust multilingual reasoning by supporting ten Indian languages – no other model in the world has such a strong Indic component.” Sourabh Deorah, CEO & co-founder of AdvantageClub.ai, an AI-powered employee engagement and rewards platform, says that as someone deeply involved in machine learning, he understands how challenging it is to create a 24-bn parameter model that not only handles reasoning tasks like math and programming but also delivers high-quality performance across multiple Indian languages – many of which have long been underserved in the AI space. Piyush Goel, CEO & founder of IT consulting company Beyond Key, says that the new model’s potential to drive agentic AI in education, healthcare, and automation is exciting. Agentic AI is a type of AI that makes decisions and takes actions based on context and objectives without constant human intervention. Karthikeyan G, senior director of engineering architecture at software company Ascendion, says Sarvam-M’s architecture will enable AI agents to interact among themselves (to take complex decisions) thanks to the standardised protocols being used. This will be crucial for the next stage of the AI wave.





Source link

Leave a Reply

Your email address will not be published. Required fields are marked *