All posts tagged: Scaling

Train-to-Test scaling explained: How to optimize your end-to-end AI compute budget for inference

Train-to-Test scaling explained: How to optimize your end-to-end AI compute budget for inference

The standard guidelines for building large language models (LLMs) optimize only for training costs and ignore inference costs. This poses a challenge for real-world applications that use inference-time scaling techniques to increase the accuracy of model responses, such as drawing multiple reasoning samples from a model at deployment. To bridge this gap, researchers at University of Wisconsin-Madison and Stanford University have introduced Train-to-Test (T2) scaling laws, a framework that jointly optimizes a model’s parameter size, its training data volume, and the number of test-time inference samples. In practice, their approach proves that it is compute-optimal to train substantially smaller models on vastly more data than traditional rules prescribe, and then use the saved computational overhead to generate multiple repeated samples at inference. For enterprise AI application developers who are training their own models, this research provides a proven blueprint for maximizing return on investment. It shows that AI reasoning does not necessarily require spending huge amounts on frontier models. Instead, smaller models can yield stronger performance on complex tasks while keeping per-query inference costs manageable …

Mind the scaling gap: female ambition needs better funding, not more advice

Mind the scaling gap: female ambition needs better funding, not more advice

A male founder’s pitchdeck is taken at face value – and more likely to be backed – despite often being higher risk. Female entrepreneurs’ forecasts, meanwhile, are ‘discounted’ by investors for appearing conservative,even though they are more likely to be met. This is one of many depressing truths that are holding back women in business that were discussed on Sunday’s International Women’s Day. It’s not all grim. Research published today in the Pathway Report by ScaleUp Institute found Scotland’s 139 scaling female founded businesses are generating £1.5 billion in revenues and employing more than 16,000 people. There’s growing investment into female-founded Scottish scale ups – up 80% on last year – but this remains concentrated in a small number of firms. “Many continue to rely on bootstrapping and traditional finance rather than growth equity. Too often women are scaling in spite of the system not because of it,” the report found. In a bid to tacking a more practical approach to levelling the business playing field at ScaleUp Standard, this week, Stripe and Stare’s co-founder …

Scaling agentic AI means trusting your data – here’s what most CDOs are investing in

Scaling agentic AI means trusting your data – here’s what most CDOs are investing in

J Studios /DigitalVision via getty Images Follow ZDNET: Add us as a preferred source on Google. ZDNET’s key takeaways Half of agentic AI adopters cite data quality and retrieval issues as deployment barriers. 76% of data leaders report that governance has not kept pace with the rise in AI use. 86% plan to increase investment in data management to support AI growth. A new survey of 600 chief data officers (CDOs) found that 69% of companies with revenues of $500M+ are using generative AI in their operations, up from 48% in 2025. Although AI adoption is increasing, the report found that data and AI literacy are a concern. Of the CDOs surveyed, 75% believe their workforce needs upskilling in data literacy, and 74% in AI literacy to responsibly use AI or AI outputs in day-to-day operations. Improved data and AI literacy will increase AI adoption in business.    Also: AI agents are fast, loose, and out of control, MIT study finds The report, from Informatica, Wakefield Research, and Deloitte, noted that although skill set is a challenge, trust …

AI Agents are delivering real ROI — Here’s what 1,100 developers and CTOs reveal about scaling them

AI Agents are delivering real ROI — Here’s what 1,100 developers and CTOs reveal about scaling them

Presented by DigitalOcean From refactoring codebases to debugging production code, AI agents are already proving their value. But scaling them in production remains the exception, not the rule. In DigitalOcean’s 2026 Currents research report, based on a survey of more than 1,100 developers, CTOs, and founders, 67% of organizations using agents report productivity gains. Meanwhile, 60% of respondents say applications and agents represent the greatest long-term value in the AI stack. Yet, only 10% are scaling agents in production.  The top blocker? Forty-nine percent cite the high cost of inference. It’s not just the price of a single API call. It’s the compounding cost as agents chain tasks and run autonomously. Nearly half of respondents now spend 76–100% of their AI budget on inference alone. This is a problem DigitalOcean is working to solve. What’s needed is infrastructure designed around inference economics: predictable performance, cost control under load, and fewer moving parts. That’s how 2026 becomes the year agents graduate from pilot to product. 52% of companies are actively implementing AI solutions (including agents) Just …

Apple Reportedly Scaling Back This Long-Rumored iOS 27 Feature

Apple Reportedly Scaling Back This Long-Rumored iOS 27 Feature

iOS 27 will no longer include a long-rumored feature known as Apple Health+ inside Apple, according to Bloomberg‘s Mark Gurman. Apple Health+ was supposed to be a virtual health coach that could give users AI-powered health recommendations in the Apple Health app, based on their personal health data, the report said. The feature would have provided users with detailed health reports, videos that explained medical conditions and offered wellness tips, and more. “The major new service would have combined new surveys and health assessments with data from Apple Watches and external lab reports,” the report added. It is unclear if Apple Health+ would have been a paid subscription service. The feature is being scaled back instead of outright canceled. The report said some of the components of Apple Health+, such as suggestions based on existing Health app data, will be “repurposed and introduced as early as this year.” Apple Health+ was initially rumored to be an iOS 26 feature, so it was seemingly in development for a long time. But now, only bits and pieces …

How OpenAI is scaling the PostgreSQL database to 800 million users

How OpenAI is scaling the PostgreSQL database to 800 million users

While vector databases still have many valid use cases, organizations including OpenAI are leaning on PostgreSQL to get things done. In a blog post on Thursday, OpenAI disclosed how it is using the open-source PostgreSQL database. OpenAI runs ChatGPT and its API platform for 800 million users on a single-primary PostgreSQL instance — not a distributed database, not a sharded cluster. One Azure PostgreSQL Flexible Server handles all writes. Nearly 50 read replicas spread across multiple regions handle reads. The system processes millions of queries per second while maintaining low double-digit millisecond p99 latency and five-nines availability. The setup challenges conventional scaling wisdom and offers enterprise architects insight into what actually works at massive scale. The lesson here isn’t to copy OpenAI’s stack. It’s that architectural decisions should be driven by workload patterns and operational constraints — not by scale panic or fashionable infrastructure choices. OpenAI’s PostgreSQL setup shows how far proven systems can stretch when teams optimize deliberately instead of re-architecting prematurely. “For years, PostgreSQL has been one of the most critical, under-the-hood data …

Salesforce Research: Across the C-suite, trust is the key to scaling agentic AI

Salesforce Research: Across the C-suite, trust is the key to scaling agentic AI

Presented by Salesforce In 2025, Salesforce conducted a series of C-suite research studies to capture if and how top decision-makers are building an agentic AI strategy. While the research shows positive signals like agent adoption is expected to surge 327% over the next two years, the dominant one is clear: leaders may be racing to deploy AI agents, but unlocking real value hinges on trust in data, systems, employees, and, above all, the leadership guiding the change. Trust is the connective tissue that determines whether companies can actually scale AI agents and unlock the value they’re projecting. At Salesforce, this trust imperative is operationalized through Agentforce. The Agentforce 360 Platform, the foundational layer of the company’s agentic platform, embeds trust directly into how agents reason, act, and collaborate with humans. This ensures leaders can implement agentic AI at scale. “As organizations scale AI agents, trust becomes the accelerator,” says Joe Inzerillo, chief digital officer of Salesforce. “When leaders trust their data, their systems, and their governance, AI moves from experimentation to enterprise impact. Trust isn’t …

Scaling leadership, inside and out: Reflections from 2025

Scaling leadership, inside and out: Reflections from 2025

As 2025 comes to a close, I’ve been reflecting on what it means to practice leadership while helping others develop it. I’m Charlotte Sharpe, Managing Director of Research and Innovation at Big Think+. My role is to drive alignment between our content and platform, ensuring that what we build, design, and deliver truly serves our clients—organizations that are bringing leadership development to life within their own cultures. Across this year, our team’s work has revolved around three ideas: clarity, collaboration, and storytelling. Together, they’ve shaped how we scale leadership: both inside Big Think+ and across the organizations we partner with.  1. Clarity Scales One of our most important realizations this year is that clarity is a form of leadership. The ability to define what we mean, decide what matters, and move forward even when information is incomplete has been essential to every major milestone we reached. In her Big Think+ lesson “Systematic Strategies for Making Hard Calls,” Suzy Welch reminds us that decisiveness is not about being impulsive. It’s about using structure to make confident …

Scaling solid-state battery manufacturing for Europe

Scaling solid-state battery manufacturing for Europe

The SOLiD project is developing next-generation lithium-metal batteries, combining safety and performance with recyclability for Europe’s clean energy future. On the path to a climate-neutral future, few technologies are as indispensable and as closely scrutinised as the battery. Powering everything from smartphones to electric vehicles (EVs), batteries sit at the heart of the energy transition. But as demand increases, so do concerns about sustainability, cost, and the physical limits of today’s dominant lithium-ion technology. SOLiD is an ambitious EU-funded project aimed at redesigning the rules of battery manufacturing. Launched in September 2022, SOLiD brings together 14 partners across nine countries, ranging from industrial powerhouses to nimble SMEs and leading research institutes. The goal of the project is to create a sustainable pilot-scale process for the next generation of solid-state lithium-metal batteries, which promise higher energy density, improved safety, longer lifespan, and recyclability by design. Three years after SOLiD’s outset, the project has reached a turning point. Its breakthroughs in electrode coating, electrolyte formulation, and digital quality control are paving the way for Europe to scale …

Scaling Europe’s hydrogen refuelling infrastructure

Scaling Europe’s hydrogen refuelling infrastructure

In 2026, the H2REF-DEMO project will demonstrate and test its hydrogen compression technology, advancing towards the delivery of reliable hydrogen refuelling systems for heavy-duty vehicles. Europe’s transition to a net-zero economy depends on clean, reliable hydrogen infrastructure capable of supporting both light and heavy-duty mobility. To enable the hydrogen sector to thrive and meet the demands of the future, there is a clear need for continued investment, research, and innovation in hydrogen technologies. An example of such research and innovation is the H2REF-DEMO project, which aims to develop cost-effective and reliable hydrogen fuel cell vehicle refuelling systems. Demonstration: From laboratory to full-scale prototype H2REF-DEMO is co-ordinated by the French Technical Center for Mechanical Industries, CETIM, and will run for a total of 42 months (2023–2026). The project brings together industry leaders and research institutes to validate a hydraulic compression and refuelling system capable of delivering hydrogen at unprecedented flow rates of 150 kg per hour, with consumption below 3.5 kWh per kilogram. In its final year, the project will install and test a full-scale hydrogen …