MagazineCoverage

AI Costs Fall, Demand Rises

By 2030, running inference on trillion-parameter AI models will cost over 90% less than in 2025, according to Gartner.

This dramatic cost reduction will be driven by advances in semiconductors, infrastructure efficiency, and optimized model design.

Improved chip utilization, inference-specific silicon, and edge computing will further accelerate efficiency gains across AI workloads.

As a result, large language models could become up to 100 times more cost-efficient than early versions developed in 2022.

However, lower token costs will not necessarily translate into cheaper AI for enterprises using advanced systems.

Agentic AI models consume significantly more tokens—up to 30 times per task—offsetting gains from reduced unit costs.

Ultimately, value will shift toward platforms that intelligently route workloads, using smaller, domain-specific models for routine tasks while reserving high-cost frontier models for complex, high-value reasoning.