
Government-backed national initiative to create sovereign foundational AI models for Indian languages, BharatGen will cover all 22 scheduled Indian languages by June 2026. Currently, the models span nine languages: Hindi, Marathi, Tamil, Malayalam, Bengali, Punjabi, Gujarati, Telugu, and Kannada. The roadmap sets a milestone of expanding coverage to 15 languages, including Assamese, Maithili, Nepali, Odia, Sanskrit, and Sindhi, by December 2025, before completing the list in mid-2026.
BharatGen operates under the Department of Science and Technology’s National Mission on Interdisciplinary Cyber-Physical Systems (NM-ICPS). The TIH Foundation for IoT and IoE at IIT Bombay serves as the central hub, managing program execution, national academic collaboration, and ecosystem partnerships for compute, data, and talent. IITM Pravartak Technologies Foundation at IIT Madras functions as an implementation partner, focusing on governance, security, and media-related applications.
The AI models address multiple modalities such as large language models for text, text-to-speech, automatic speech recognition, and vision-language systems. According to the government, BharatGen has already developed pilot applications in agriculture, governance, and defence, which are planned to be rolled out nationwide once fully deployed.
Union Minister Dr Jitendra Singh stated that while BharatGen is in its pilot phase and not yet publicly available, the plan is to deploy it across all states and districts. Partnerships with additional research institutions in Karnataka may also be explored.
See What’s Next in Tech With the Fast Forward Newsletter
Tweets From @varindiamag
Nothing to see here - yet
When they Tweet, their Tweets will show up here.