All posts tagged: ai models

Can tech companies learn to love cheaper AI models? 

Can tech companies learn to love cheaper AI models? 

The AI boom has been built on a basic assumption: Bigger models are more powerful, and the most powerful models win. Now, the industry is about to learn what happens if that assumption starts to break.   Mounting costs have already pressured users to give smaller and cheaper models a second look. This cost-conscious model-shopping is new and it’s unclear how it will affect the industry, but the impact is likely to be significant.  One prediction, laid out best by Coinbase co-founder Brian Armstrong, is that it will result in the vast majority of tasks shifting to cheaper models.  “[D]emand for intelligence is near infinite, but 80% of workloads will be running on 99% cheaper models within 12-18 months,” Armstrong wrote on X. “20% of workloads will still run on latest gen models where IQ maxing is important.”  It’s hard to overstate what a significant shift it will be for the AI industry if Armstrong’s prediction comes true.   Before now, most AI companies have competed on quality, which has meant defaulting to the most advanced available model. If those same jobs can be handled by cheaper models without affecting quality, it would mean a massive shift in the economics of AI. And critically, much of the savings …

Multiverse Computing pushes its compressed AI models into the mainstream

Multiverse Computing pushes its compressed AI models into the mainstream

With private company defaults running at upwards of 9.2% — the highest rate in years — VC firm Lux Capital recently advised companies relying on AI to get their compute capacity commitments confirmed in writing. With financial instability rippling through the AI supply chain, Lux warned, a handshake agreement isn’t enough. But there’s another option entirely, which is to stop relying on external compute infrastructure altogether. Smaller AI models that run directly on a user’s own device — no data center, no cloud provider, no counterparty risk — are getting good enough to be worth considering. And Multiverse Computing is raising its hand. The Spanish startup has so far kept a lower profile than some of its peers, but as demand for AI efficiency grows, this is changing. After compressing models from major AI labs including OpenAI, Meta, DeepSeek and Mistral AI, it has launched both an app that showcases the capabilities of its compressed models and an API portal — a gateway that lets developers access and build with those models — that makes …

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

Most enterprise AI projects fail not because companies lack the technology, but because the models they’re using don’t understand their business. The models are often trained on the internet, rather than decades of internal documents, workflows, and institutional knowledge.  That gap is where Mistral, the French AI startup, sees opportunity. On Tuesday, the company announced Mistral Forge, a platform that lets enterprises build custom models trained on their own data. Mistral announced the platform at Nvidia GTC, Nvidia’s annual technology conference, which this year is focused heavily on AI and agentic models for enterprise. It’s a pointed move for Mistral, a company that has built its business on corporate clients while rivals OpenAI and Anthropic have soared ahead in terms of consumer adoption. CEO Arthur Mensch says Mistral’s laser focus on the enterprise is working: The company is on track to surpass $1 billion in annual recurring revenue this year. A big part of doubling down on enterprise is giving companies more control over their data and their AI systems, Mistral says.  “What Forge does is …

Cohere launches a family of open multilingual models

Cohere launches a family of open multilingual models

Enterprise AI company Cohere launched a new family of multilingual models on the sidelines of the ongoing India AI Summit. The models, dubbed Tiny Aya, are open-weight — meaning their underlying code is publicly available for anyone to use and modify — support over 70 languages, and can run on everyday devices like laptops without requiring an internet connection. The model, launched by the company’s research arm Cohere Labs, supports South Asian languages such as Bengali, Hindi, Punjabi, Urdu, Gujarati, Tamil, Telugu, and Marathi.  The base model contains 3.35 billion parameters — a measure of its size and complexity. Cohere has also launched TinyAya-Global, a version fine-tuned to better follow user commands, for apps that require broad language support. Regional variants round out the family: TinyAya-Earth for African languages; TinyAya-Fire for South Asian languages; and TinyAya-Water for Asia Pacific, West Asia, and Europe. Image Credits: Cohere “This approach allows each model to develop stronger linguistic grounding and cultural nuance, creating systems that feel more natural and reliable for the communities they are meant to serve. …