The hidden scaling cliff that’s about to break your agent rollouts

Bylaszlocsaba June 26, 2025June 27, 2025

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more

Enterprises that want to build and scale agents also need to embrace another reality: agents aren’t built like other software.

Agents are “categorically different” in how they’re built, how they operate, and how they’re improved, according to Writer CEO and co-founder May Habib. This means ditching the traditional software development life cycle when dealing with adaptive systems.

“Agents don’t reliably follow rules,” Habib said on Wednesday while on stage at VB Transform. “They are outcome-driven. They interpret. They adapt. And the behavior really only emerges in real-world environments.”

Knowing what works — and what doesn’t work — comes from Habib’s experience helping hundreds of enterprise clients build and scale enterprise-grade agents. According to Habib, more than 350 of the Fortune 1000 are Writer customers, and more than half of the Fortune 500 will be scaling agents with Writer by the end of 2025.

Using non-deterministic tech to produce powerful outputs can even be “really nightmarish,” Habib said — especially when trying to scale agents systemically. Even if enterprise teams can spin up agents without product managers and designers, Habib thinks a “PM mindset” is still needed for collaborating, building, iterating and maintaining agents.

“Unfortunately or fortunately, depending on your perspective, IT is going to be left holding the bag if they don’t lead their business counterparts into that new way of building.”

>>See all our Transform 2025 coverage here<<

Why goal-based agents is the right approach

One of the shifts in thinking includes understanding the outcome-based nature of agents. For example, she said that many customers request agents to assist their legal teams in reviewing or redlining contracts. But that’s too open-ended. Instead, a goal-oriented approach means designing an agent to reduce the time spent reviewing and redlining contracts.

“In the traditional software development life cycle, you are designing for a deterministic set of very predictable steps,” Habib said. “It’s input in, input out in a more deterministic way. But with agents, you’re seeking to shape agentic behavior. So you are seeking less of a controlled flow and much more to give context and guide decision-making by the agent.”

Another difference is building a blueprint for agents that instructs them with business logic, rather than providing them with workflows to follow. This includes designing reasoning loops and collaborating with subject experts to map processes that promote desired behaviors.

While there’s a lot of talk about scaling agents, Writer is still helping most clients with building them one at a time. That’s because it’s important first to answer questions about who owns and audits the agent, who makes sure it stays relevant and still checks if it’s still producing desired outcomes.

“There is a scaling cliff that folks get to very, very quickly without a new approach to building and scaling agents,” Habib said. “There is a cliff that folks are going to get to when their organization’s ability to manage agents responsibly really outstrips the pace of development happening department by department.”

QA for agents vs software

Quality assurance is also different for agents. Instead of an objective checklist, agentic evaluation includes accounting for non-binary behavior and assessing how agents act in real-world situations. That’s because failure isn’t always obvious — and not as black and white as checking if something broke. Instead, Habib said it’s better to check if an agent behaved well, asking if fail-safes worked, evaluating outcomes and intent: “The goal here isn’t perfection It is behavioral confidence, because there is a lot of subjectivity in this here.”

Businesses that don’t understand the importance of iteration end up playing “a constant game of tennis that just wears down each side until they don’t want to play anymore,” Habib said. It’s also important for teams to be okay with agents being less than perfect and more about “launching them safely and running fast and iterating over and over and over.”

Despite the challenges, there are examples of AI agents already helping bring in new revenue for enterprise businesses. For example, Habib mentioned a major bank that collaborated with Writer to develop an agent-based system, resulting in a new upsell pipeline worth $600 million by onboarding new customers into multiple product lines.

New version controls for AI agents

Agentic maintenance is also different. Traditional software maintenance involves checking the code when something breaks, but Habib said AI agents require a new kind of version control for everything that can shape behavior. It also requires proper governance and ensuring that agents remain useful over time, rather than incurring unnecessary costs.

Because models don’t map cleanly to AI agents, Habib said maintenance includes checking prompts, model settings, tool schemas and memory configuration. It also means fully tracing executions across inputs, outputs, reasoning steps, tool calls and human interactions.

“You can update a [large language model] LLM prompt and watch the agent behave completely differently even though nothing in the git history actually changed,” Habib said. “The model links shift, retrieval indexes get updated, tool APIs evolve and suddenly the same prompt does not behave as expected…It can feel like we are debugging ghosts.”

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Uncategorized

Here are the laptops I’d tell any parent to consider for their back-to-school student
Bylaszlocsaba July 26, 2025July 26, 2025

Buying GuideIf your back-to-school planning for this year calls for a new laptop, here are your best bets.If your back-to-school planning for this year calls for a new laptop, here are your best bets.by

Read More Here are the laptops I’d tell any parent to consider for their back-to-school student
Uncategorized

Razer Pro Click V2 Vertical Review: A Hybrid Gaming Mouse
Bylaszlocsaba July 26, 2025July 26, 2025

Switching to a vertical mouse is a hard sell. Having to change how you use a mouse completely can be an intimidating task, especially with how unnatural the new hand position feels at first—you’re going entirely against the muscle memory you’ve spent years building up.One of the largest challenges to the switch is the initial loss of pointer accuracy. If you’re in an office setting, you may find yourself wandering around a bit or struggling to move your new mouse as quickly as you did before. But in a slow-paced setting like that, all you struggle with is a few mis-clicks or slightly slower navigation. If you try to make this transition with gaming, it’s far more jarring, and the consequences are much more immediately noticeable.But even if it’s difficult to adapt to, could vertical mice be the future of gaming? Razer’s new Pro Click V2 Vertical Edition is a hybrid productivity and gaming vertical mouse. Vertical mice typically cater to office workers, but the focus on gaming performance makes the $120 Pro Click V2 one of a kind.Desk PresenceThe Pro Click V2 Vertical looks, more than anything else, like a modern gaming mouse. It has the textured exterior, metallic highlights, and slightly organic, H.R. Giger-esque curvature typical of Razer’s design language. But everything has been shifted around. The curved, cutting thumb rest sits on top of the mouse instead of on the side. A flare juts out from the right side as a place to rest the underside of your hand. The gunmetal highlight sits at the peak of the mouse rather than between the two buttons. Even the USB port is vertical, a humorous attention to detail.It’s intentionally designed as a gaming mouse that just happens to be vertical. Aesthetically, the only downside is the minimal RGB lighting. With only one section of lighting that runs along the bottom of the mouse, RGB lighting fans might feel disappointed. Still, it’s bright, reactive, and has great color accuracy. It’s more than enough for me, especially with how customizable it is with Razer’s Chroma software.The Pro Click V2 Vertical has the same specs as the standard Pro Click V2, with a 1,000-Hz polling rate, a 2.4-GHz dongle that can be stored on the underside, Bluetooth multi-device connectivity, and a reprogrammable button on top. The only features lost are the mouse wheel’s horizontal scrolling and toggleable non-ratcheted rotation.This mouse includes two major productivity features: app-specific profiles and multi-device connectivity, and both work effortlessly. Razer Synapse immediately detected different software and changed the active profile in response, and pressing the button on the underside of the mouse swapped between paired devices instantaneously.Beyond that, Razer Synapse is as impressive as always. I consistently find the software to be one of the best and most intuitive on the market, and that’s the case here. All of the menus are simple and efficient, the settings can be changed in real time, and the adjustments all have tooltips and explanations to tell you exactly what you’re changing.Annoyingly, Razer Synapse has advertisements on the homepage, something I’ve complained about when reviewing SteelSeries products in the past. However, unlike Steelseries GG, these “recommendations” can be permanently disabled in the app’s settings.Performance and PracticeThe overall hand position of the Pro Click V2 Vertical is natural, but incredibly upright. While some vertical mice, like those from Logitech or Hansker, find a middle ground between a standard and truly “vertical” hand position, Razer opted for a nearly perpendicular shape. While this is technically an ideal ergonomic shape, it will be harder to adapt if you’re moving directly from a standard mouse, and might not be as comfortable during the adjustment period.It felt unnatural for the first week or so, and required practice to use comfortably and confidently. Once I had acclimated, my speed and accuracy were nearly at the same level as a standard mouse, although consistent use still felt clunky and unfamiliar compared to the horizontal mice I’d been using for most of my life.

Read More Razer Pro Click V2 Vertical Review: A Hybrid Gaming Mouse
Uncategorized

Gridcare thinks more than 100 GW of data center capacity is hiding in the grid
Bylaszlocsaba May 27, 2025July 27, 2025

Hyperscalers and data center developers are in a pickle: They all want to add computing power tomorrow, but utilities frequently play hard to get, citing years-long waits for grid connections.

“All the AI data centers are struggling to get connected,” Amit Narayan, founder and CEO of Gridcare, told TechCrunch. “They’re so desperate. They are looking for solutions, which may or may not happen. Certainly not in the five-year timelines they cite.”

That has led many data centers to pursue what’s called “behind the meter” power sources — basically, they build their own power plants, a costly endeavor that hints at just how desperate they are for electricity.

But Narayan knew there was plenty of slack in the system, even if utilities themselves haven’t discovered it yet. He has studied the grid for the last 15 years, first as a Stanford researcher then as a founder of another company. “How do we create more capacity when everyone thinks that there is no capacity on the grid?” he said.

Narayan said that Gridcare, which has been operating in stealth, has already discovered several places where extra capacity exists, and it’s ready to play matchmaker between data centers and utilities.

Gridcare recently closed an oversubscribed $13.5 million seed round, the company told TechCrunch. The round was led by Xora, Temasek’s deep tech venture firm, with participation from Acclimate Ventures, Aina Climate AI Ventures, Breakthrough Energy Discovery, Clearvision, Clocktower Ventures, Overture Ventures, Sherpalo Ventures, and WovenEarth.

For Narayan and his colleagues at Gridcare, the first step to finding untapped capacity was to map the existing grid. Then the company used generative AI to help forecast what changes might be implemented in the coming years. It also layers on other details, including the availability of fiber optic connections, natural gas, water, extreme weather, permitting, and community sentiment around data center construction and expansion.

Techcrunch event

San Francisco
|
October 27-29, 2025

“There are 200,000-plus scenarios that you have to consider every time you’re running this study,” Narayan said.

To make sure it’s not running afoul of regulations, Gridcare then takes that data and weighs it against federal guidelines that dictate grid usage. Once it finds a spot, it starts talking with the relevant utility to verify the data.

“We’ll find out where the maximum bang for the buck is,” Narayan said.

At the same time, Gridcare works with hyperscalers and data center developers to identify where they are looking to expand operations or build new ones. “They have already told us what they’re willing to do. We know the parameters under which they can operate,” he said.

That’s when the matchmaking begins.

Gridcare sells its services to data center developers, charging them a fee based on how many megawatts of capacity the startup can unlock for them. “That fee is significant for us, but it’s negligible for data centers,” Narayan said.

For some data centers, the price of admission might be forgoing grid power for a few hours here and there, relying on on-site backup power instead. For others, the path might be clearer if their demand helps green-light a new grid-scale battery installation nearby. In the future, the winner might be the developer that is willing to pay more. Utilities have already approached Gridcare inquiring about auctioning access to newfound capacity.

Regardless of how it happens, Narayan thinks that Gridcare can unlock more than 100 gigawatts of capacity using its approach. “We don’t have to solve nuclear fusion to do this,” he said.

Update: Corrected spare capacity on the grid to gigawatts from megawatts.

Read More Gridcare thinks more than 100 GW of data center capacity is hiding in the grid
Uncategorized

Google DeepMind’s new AI can help historians understand ancient Latin inscriptions
Bylaszlocsaba July 23, 2025July 23, 2025

Google DeepMind has unveiled new artificial-intelligence software that could help historians recover the meaning and context behind ancient Latin engravings. Aeneas can analyze words written in long-weathered stone to say when and where they were originally inscribed. It follows Google’s previous archaeological tool Ithaca, which also used deep learning to reconstruct and contextualize ancient text, in its case Greek. But while Ithaca and Aeneas use some similar systems, Aeneas also promises to give researchers jumping-off points for further analysis. To do this, Aeneas takes in partial transcriptions of an inscription alongside a scanned image of it. Using these, it gives possible dates and places of origins for the engraving, along with potential fill-ins for any missing text. For example, a slab damaged at the start and continuing with … us populusque Romanus would likely prompt Aeneas to guess that Senat comes before us to create the phrase Senatus populusque Romanus, “The Senate and the people of Rome.” This is similar to how Ithaca works. But Aeneas also cross-references the text with a stored database of almost 150,000 inscriptions, which originated everywhere from modern-day Britain to modern-day Iraq, to give possible parallels—other catalogued Latin engravings that feature similar words, phrases, and analogies.
This database, alongside a few thousand images of inscriptions, makes up the training set for Aeneas’s deep neural network. While it may seem like a good number of samples, it pales in comparison to the billions of documents used to train general-purpose large language models like Google’s Gemini. There simply aren’t enough high-quality scans of inscriptions to train a language model to learn this kind of task. That’s why specialized solutions like Aeneas are needed. The Aeneas team believes it could help researchers “connect the past,” said Yannis Assael, a researcher at Google DeepMind who worked on the project. Rather than seeking to automate epigraphy—the research field dealing with deciphering and understanding inscriptions—he and his colleagues are interested in “crafting a tool that will integrate with the workflow of a historian,” Assael said in a press briefing.
Their goal is to give researchers trying to analyze a specific inscription many hypotheses to work from, saving them the effort of sifting through records by hand. To validate the system, the team presented 23 historians with inscriptions that had been previously dated and tested their workflows both with and without Aeneas. The findings, which were published today in Nature, showed that Aeneas helped spur research ideas among the historians for 90% of inscriptions and that it led to more accurate determinations of where and when the inscriptions originated. In addition to this study, the researchers tested Aeneas on the Monumentum Ancyranum, a famous inscription carved into the walls of a temple in Ankara, Turkey. Here, Aeneas managed to give estimates and parallels that reflected existing historical analysis of the work, and in its attention to detail, the paper claims, it closely matched how a trained historian would approach the problem. “That was jaw-dropping,” Thea Sommerschield, an epigrapher at the University of Nottingham who also worked on Aeneas, said in the press briefing. However, much remains to be seen about Aeneas’s capabilities in the real world. It doesn’t guess the meaning of texts, so it can’t interpret newly found engravings on its own, and it’s not clear yet how useful it will be to historians’ workflows in the long term, according to Kathleen Coleman, a professor of classics at Harvard. The Monumentum Ancyranum is considered to be one of the best-known and most well-studied inscriptions in epigraphy, raising the question of how Aeneas will fare on more obscure samples. Google DeepMind has now made Aeneas open-source, and the interface for the system is freely available for teachers, students, museum workers, and academics. The group is working with schools in Belgium to integrate Aeneas into their secondary history education. “To have Aeneas at your side while you’re in the museum or at the archaeological site where a new inscription has just been found—that is our sort of dream scenario,” Sommerschield said.

Read More Google DeepMind’s new AI can help historians understand ancient Latin inscriptions
Uncategorized

Walmart cracks enterprise AI at scale: Thousands of use cases, one framework
Bylaszlocsaba June 26, 2025June 27, 2025

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Walmart continues to make strides in cracking the code on deploying agentic AI at enterprise scale. Their secret? Treating trust as an engineering requirement, not some compliance checkbox you tick at the…

Read More Walmart cracks enterprise AI at scale: Thousands of use cases, one framework
Uncategorized

Our first long-duration energy storage partnership
Bylaszlocsaba July 25, 2025

Electricity powers modern life. And we’re accelerating a wide range of technologies, from enhanced geothermal to advanced nuclear to even fusion technologies, that can enable a future where on-demand electricity needs are met with clean energy, every hour of every day.Today, we’re adding another technology to our portfolio: long duration energy storage (LDES). Through a new long-term partnership with Energy Dome, we plan to support multiple commercial projects globally to deploy their LDES technology.Energy Dome’s novel CO₂ Battery can store excess clean energy and then dispatch it back to the grid for 8-24 hours, bridging the gap between when renewable energy is generated and when it is needed. With this commercial partnership, as well as an investment in the company, we believe these projects can unlock new clean energy for grids where we operate before 2030, helping meet near-term electricity system needs and moving us closer to our 24/7 carbon-free energy goal.By bringing this first-of-a-kind LDES technology to market faster, we aim to rapidly bring its potential to communities everywhere — making reliable, affordable electricity available around the clock and supporting the resilience of grids as they integrate growing amounts of renewable energy sources.Why it’s importantLithium-ion batteries, which typically store and dispatch power for 4 hours or less, have been critical for adding electricity capacity to grids and managing short-term fluctuations in renewable generation — when the sun isn’t shining or the wind isn’t blowing. Google’s support for these shorter-duration batteries has helped the grids we rely on, from Belgium to Nevada, meet peak electricity demand and reduce the need to ramp up fossil fuel power plants.But what if we could store and dispatch clean energy for more than a few hours, or even a full day? Studies by the Electric Power Research Institute show that LDES technologies can cost-effectively integrate a growing volume of renewables onto power systems and contribute to more flexible, reliable grids. The LDES Council estimates that deploying up to 8 terawatts (TW) of LDES by 2040 could result in $540 billion in annual savings globally, thanks in part to their ability to optimize grids.How the technology worksEnergy Dome’s novel approach to energy storage uses carbon dioxide (CO₂) held in a unique dome-shaped battery. When there’s an abundance of renewable energy on the grid, the system uses that power to compress CO₂ gas into a liquid. When the grid needs more clean power, the liquid CO₂ expands back into a hot gas under pressure, creating a powerful force — much like steam escaping a pressure cooker — which spins a turbine. This spinning turbine generates carbon-free energy that can flow directly back into the grid for durations ranging from 8 to 24 hours.Energy Dome has already signed contracts to build commercial scale projects in Italy, the U.S., and India. And their technology has already proven successful, having injected electrons into the Italian grid for more than three years, thanks to their commercial demonstration facility and now with their full-scale 20 megawatt (MW) commercial plant in Sardinia, Italy.Why scale is crucialLDES has the potential to commercialize much faster than some of the other advanced clean energy technologies in our portfolio. This means we can use it in the near term to help the electricity system grow more flexibly and reliably, alongside other tools we’re developing such as data center demand response.By supporting multiple commercial deployments of Energy Dome’s technology globally, we aim to bring this technology to scale faster and at lower costs. Beyond our long-term collaboration with Energy Dome, we plan to support a growing range of LDES technologies under development through both commercial agreements that can catalyze wider market adoption of more mature technologies, like Energy Dome’s, as well as earlier-stage investments.To remove barriers to the deployment and commercialization of LDES and other advanced carbon-free energy technologies, we’re also advocating for clean energy policies, ensuring that energy markets fully value firm, flexible carbon-free technologies, and advancing policy measures that enable infrastructure essential for grid decarbonization and energy security.We’re excited to take this first step with Energy Dome to unlock the full potential of LDES. Our partnership will strengthen grid resilience while enabling us to power our technologies, grow our economies and keep the lights on in our homes with 24/7 clean energy.

Read More Our first long-duration energy storage partnership

Why goal-based agents is the right approach

QA for agents vs software

New version controls for AI agents

Similar Posts

Leave a Reply Cancel reply