Walmart cracks enterprise AI at scale: Thousands of use cases, one framework

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more


Walmart continues to make strides in cracking the code on deploying agentic AI at enterprise scale. Their secret? Treating trust as an engineering requirement, not some compliance checkbox you tick at the end.

During the “Trust in the Algorithm: How Walmart’s Agentic AI Is Redefining Consumer Confidence and Retail Leadership” session at VB Transform 2025, Walmart’s VP of Emerging Technology Desirée Gosby, explained how the retail giant operationalizes thousands of AI use cases. One of the retailer’s primary objectives is to consistently maintain and strengthen customer confidence among its 255 million weekly shoppers.

“We see this as a pretty big inflection point, very similar to the internet,” Gosby told industry analyst Susan Etlinger during Tuesday’s morning session. “It’s as profound in terms of how we’re actually going to operate, how we actually do work.”

The session delivered valuable lessons learned from Walmart’s AI deployment experiences. Implicit throughout the discussion is the retail giant’s continual search for new ways to apply distributed systems architecture principles, thereby avoiding the creation of technical debt.

>>See all our Transform 2025 coverage here<<

Four-stakeholder framework structures AI deployment

Walmart’s AI architecture rejects horizontal platforms for targeted stakeholder solutions. Each group receives purpose-built tools that address specific operational frictions.

Customers engage Sparky for natural language shopping. Field associates get inventory and workflow optimization tools. Merchants access decision-support systems for category management. Sellers receive business integration capabilities. “And then, of course, we’ve got developers, and really, you know, giving them the superpowers and charging them up with, you know, the new agent of tools,” Gosby explained.

“We have hundreds, if not thousands, of different use cases across the company that we’re bringing to life,” Gosby revealed. The scale demands architectural discipline that most enterprises lack.

The segmentation acknowledges the fundamental need of each team in Walmart to have purpose-built tools for their specific jobs. Store associates managing inventory need different tools from merchants analyzing regional trends. Generic platforms fail because they ignore operational reality. Walmart’s specificity drives adoption through relevance, not mandate.

Trust economics are driving AI adoption at Walmart

Walmart discovered that trust is built through value delivery, not just mandatory training programs that associates, at times, question the value of.

Gosby’s example resonated as she explained her mother’s shopping evolution from weekly store visits to COVID-era deliveries, illustrating exactly how natural adoption works. Each step provided an immediate, tangible benefit. No friction, no forced change management, yet the progression happened faster than anyone could have predicted.

“She’s been interacting with AI through that whole time,” Gosby explained. “The fact that she was able to go to the store and get what she wanted, it was on the shelf. AI was used to do that.”

The benefits customers are getting from Walmart’s predictive commerce vision are further reflected in Gosby’s mother’s experiences. “Instead of having to go weekly, figure out what groceries you need to have delivered, what if it just showed up for you automatically?” That’s the essence of predictive commerce and how it delivers value at scale to every Walmart customer.

“If you’re adding value to their lives, helping them remove friction, helping them save money and live better, which is part of our mission, then the trust comes,” Gosby stated. Associates follow the same pattern. When AI actually improves their work, saves them time and helps them excel, adoption happens naturally and trust is earned.

Fashion cycles compress from months to weeks

Walmart’s Trend to Product system quantifies the operational value of AI. The platform synthesizes social media signals, customer behavior and regional patterns to slash product development from months to weeks.

“Trend to Product has gotten us down from months to weeks to getting the right products to our customers,” Gosby revealed. The system creates products in response to real-time demand rather than historical data.

The months-to-weeks compression transforms Walmart’s retail economics. Inventory turns accelerate. Markdown exposure shrinks. Capital efficiency multiplies. The company maintains price leadership while matching any competitor’s speed-to-market capabilities. Every high-velocity category can benefit from using AI to shrink time-to-market and deliver quantifiable gains.

How Walmart uses MCP Protocol to create a scalable agent architecture

Walmart’s approach to agent orchestration draws directly from its hard-won experience with distributed systems. The company uses Model Context Protocol (MCP) to standardize how agents interact with existing services.

“We break down our domains and really looking at how do we wrap those things as MCP protocol, and then exposing those things that we can then start to orchestrate different agents,” Gosby explained. The strategy transforms existing infrastructure rather than replacing it.

The architectural philosophy runs deeper than protocols. “The change that we’re seeing today is very similar to what we’ve seen when we went from monoliths to distributed systems. We don’t want to repeat those mistakes,” Gosby stated.

Gosby outlined the execution requirements: “How do you decompose your domains? What MCP servers should you have? What sort of agent orchestration should you have?” At Walmart, these represent daily operational decisions, not theoretical exercises.

“We’re looking to take our existing infrastructure, break it down, and then recompose it into the agents that we want to be able to build,” Gosby explained. This standardization-first approach enables flexibility. Services built years ago now power agentic experiences through proper abstraction layers.

Merchant expertise becomes enterprise intelligence

Walmart leverages decades of employee knowledge, making it a core component of its growing AI capabilities. The company systematically captures category expertise from thousands of merchants, creating a competitive advantage no digital-first retailer can match.

“We have thousands of merchants who are excellent at what they do. They are experts in the categories that they support,” Gosby explained. “We have a cheese merchant who knows exactly what wine goes or what cheese pairing, but that data isn’t necessarily captured in a structured way.”

AI operationalizes this knowledge. “With the tools that we have, we can capture that expertise that they have and really bring that to bear for our customers,” Gosby said. The application is specific: “When they’re trying to figure out, hey, I need to throw the party, what kind of appetizers should I have?”

The strategic advantage compounds. Decades of merchant expertise become accessible through natural language queries. Digital-first retailers lack this human knowledge foundation. Walmart’s 2.2 million associates represent proprietary intelligence that algorithms cannot synthesize independently.

New metrics measure autonomous success

Walmart pioneers measurement systems designed for autonomous AI rather than human-driven processes. Traditional funnel metrics fail when agents handle end-to-end workflows.

“In an agentic world, we’re starting to work through this, and it’s going to change,” Gosby said. “The metrics around conversion and things like that, those are not going to change, but we’re going to be looking at goal completion.”

The shift reflects operational reality. “Did we actually achieve what is the ultimate goal that our associate, that our customers, are actually solving?” Gosby asked. The question reframes success measurement.

“At the end of the day, it’s a measure of, are we delivering the benefit? Are we delivering the value that we expect, and then working back from there to basically figure out the right metrics?” Gosby explained. Problem resolution matters more than process compliance. How AI is helping customers achieve their goals is prioritized over conversion funnels.

Enterprise lessons from Walmart’s AI transformation

Walmart’s Transform 2025 session delivers actionable intelligence for enterprise AI deployment. The company’s operational approach provides a framework that has been validated at scale.

  • Apply architectural discipline from day one. The shift from monolithic to distributed systems provided Walmart with the lessons it needed to learn to succeed with AI deployments. The key lesson learned is to build proper foundations before scaling and define a systematic approach that prevents expensive rework.
  • Match solutions to specific user needs. One-size-fits-all AI fails every time. Store associates need different tools than merchants. Suppliers require different capabilities than developers. Walmart’s targeted approach drives adoption.
  • Build trust through proven value. Start with clear wins that deliver measurable results. Walmart moved from basic inventory management to predictive commerce step by step. Each success earns insights and knowledge for the next.
  • Turn employee knowledge into enterprise assets. Decades of specialist expertise exists within your organization. Walmart systematically captures merchant intelligence and operationalizes it across 255 million weekly transactions. This institutional knowledge creates competitive advantage no algorithm can replicate from scratch.
  • Measure what matters in autonomous systems. Conversion rates miss the point when AI handles entire workflows. Focus on problem resolution and value delivery. Walmart’s metrics evolved to match operational reality.
  • Standardize before complexity hits. Integration failures killed more projects than bad code ever did. Walmart’s protocol decisions prevent the chaos that derails most AI initiatives. Structure enables speed.

“It always comes back to basics,” Gosby advised. “Take a step back and first understand what problems do you really need to solve for your customers, for our associates. Where is there friction? Where is there manual work that you can now start to think differently about?”

Walmart’s blueprint scales beyond retail

Walmart demonstrates how enterprise AI succeeds through engineering discipline and systematic deployment. The company processes millions of daily transactions across 4,700 stores by treating each stakeholder group as a distinct challenge requiring tailored, real-time solutions.

“It’s permeating everything it is that we do,” Gosby explained. “But at the end of the day, the way that we look at it is we always start with our customers and our members and really understanding how it’s going to impact them.”

Their framework applies across industries. Financial services organizations balancing customer needs with regulatory requirements, healthcare systems coordinating patient care across providers, manufacturers managing complex supply chains are all facing similar multi-stakeholder challenges. Walmart’s approach provides a tested methodology for addressing this complexity.

“Our customers are trying to solve a problem for themselves. Same thing for our associates,” Gosby stated. “Did we actually solve that problem with these new tools?” This focus on problem resolution rather than technology deployment drives measurable outcomes. Walmart’s scale validates the approach for any enterprise ready to move beyond pilot programs.

Similar Posts

  • America’s AI watchdog is losing its bite

    Most Americans encounter the Federal Trade Commission only if they’ve been scammed: It handles identity theft, fraud, and stolen data. During the Biden administration, the agency went after AI companies for scamming customers with deceptive advertising or harming people by selling irresponsible technologies. With yesterday’s announcement of President Trump’s AI Action Plan, that era may now be over.  In the final months of the Biden administration under chair Lina Khan, the FTC levied a series of high-profile fines and actions against AI companies for overhyping their technology and bending the truth—or in some cases making claims that were entirely false. It found that the security giant Evolv lied about the accuracy of its AI-powered security checkpoints, which are used in stadiums and schools but failed to catch a seven-inch knife that was ultimately used to stab a student. It went after the facial recognition company Intellivision, saying the company made unfounded claims that its tools operated without gender or racial bias. It fined startups promising bogus “AI lawyer” services and one that sold fake product reviews generated with AI. These actions did not result in fines that crippled the companies, but they did stop them from making false statements and offered customers ways to recover their money or get out of contracts. In each case, the FTC found, everyday people had been harmed by AI companies that let their technologies run amok.
    The plan released by the Trump administration yesterday suggests it believes these actions went too far. In a section about removing “red tape and onerous regulation,” the White House says it will review all FTC actions taken under the Biden administration “to ensure that they do not advance theories of liability that unduly burden AI innovation.” In the same section, the White House says it will withhold AI-related federal funding from states with “burdensome” regulations. This move by the Trump administration is the latest in its evolving attack on the agency, which provides a significant route of redress for people harmed by AI in the US. It’s likely to result in faster deployment of AI with fewer checks on accuracy, fairness, or consumer harm.
    Under Khan, a Biden appointee, the FTC found fans in unexpected places. Progressives called for it to break up monopolistic behavior in Big Tech, but some in Trump’s orbit, including Vice President JD Vance, also supported Khan in her fights against tech elites, albeit for the different goal of ending their supposed censorship of conservative speech.  But in January, with Khan out and Trump back in the White House, this dynamic all but collapsed. Trump released an executive order in February promising to “rein in” independent agencies like the FTC that wage influence without consulting the president. The next month, he started taking that vow to—and past—its legal limits. In March, he fired the only two Democratic commissioners at the FTC. On July 17 a federal court ruled that one of those firings, of commissioner Rebecca Slaughter, was illegal given the independence of the agency, which restored Slaughter to her position (the other fired commissioner, Alvaro Bedoya, opted to resign rather than battle the dismissal in court, so his case was dismissed). Slaughter now serves as the sole Democrat. In naming the FTC in its action plan, the White House now goes a step further, painting the agency’s actions as a major obstacle to US victory in the “arms race” to develop better AI more quickly than China. It promises not just to change the agency’s tack moving forward, but to review and perhaps even repeal AI-related sanctions it has imposed in the past four years. How might this play out? Leah Frazier, who worked at the FTC for 17 years before leaving in May and served as an advisor to Khan, says it’s helpful to think about the agency’s actions against AI companies as falling into two areas, each with very different levels of support across political lines.  The first is about cases of deception, where AI companies mislead consumers. Consider the case of Evolv, or a recent case announced in April where the FTC alleges that a company called Workado, which offers a tool to detect whether something was written with AI, doesn’t have the evidence to back up its claims. Deception cases enjoyed fairly bipartisan support during her tenure, Frazier says. “Then there are cases about responsible use of AI, and those did not seem to enjoy too much popular support,” adds Frazier, who now directs the Digital Justice Initiative at the Lawyers’ Committee for Civil Rights Under Law. These cases don’t allege deception; rather, they charge that companies have deployed AI in a way that harms people. The most serious of these, which resulted in perhaps the most significant AI-related action ever taken by the FTC and was investigated by Frazier, was announced in 2023. The FTC banned Rite Aid from using AI facial recognition in its stores after it found the technology falsely flagged people, particularly women and people of color, as shoplifters. “Acting on false positive alerts,” the FTC wrote, Rite Aid’s employees “followed consumers around its stores, searched them, ordered them to leave, [and] called the police to confront or remove consumers.”

    The FTC found that Rite Aid failed to protect people from these mistakes, did not monitor or test the technology, and did not properly train employees on how to use it. The company was banned from using facial recognition for five years.  This was a big deal. This action went beyond fact-checking the deceptive promises made by AI companies to make Rite Aid liable for how its AI technology harmed consumers. These types of responsible-AI cases are the ones Frazier imagines might disappear in the new FTC, particularly if they involve testing AI models for bias. “There will be fewer, if any, enforcement actions about how companies are deploying AI,” she says. The White House’s broader philosophy toward AI, referred to in the plan, is a “try first” approach that attempts to propel faster AI adoption everywhere from the Pentagon to doctor’s offices. The lack of FTC enforcement that is likely to ensue, Frazier says, “is dangerous for the public.”

  • Best Noise-Canceling Headphones: Sony, Bose, Apple, and More

    Honorable MentionsNow that the majority of new headphones and earbuds offer at least a modicum of noise canceling, it’d be impossible (and unproductive) to list everything we like above. If you haven’t yet found your fit, here are more favorites worth considering.Beyerdynamic Amiron 300 for $280: These simple-looking earbuds (8/10, WIRED Recommends) are a great way to experience quiet luxury. They have 10 hours of battery life with noise canceling engaged, and they have some of the best-sounding drivers for vocals I’ve heard in any earbuds.Sony WF-1000XM5 earbuds for $298: Sony’s fifth-generation flagship earbuds (7/10, WIRED Recommends) slim down while stepping up. These buds are smaller and slicker (maybe too slick when it comes to grabbing them) than the previous XM4 buds. As before, they provide great sound and noise canceling that outduels plenty of options, with a cost to match. In true Sony style, they serve up a truckload of adaptive features and EQ controls while retaining a solid eight hours of playback time per charge with ANC and 12 hours without it. —Ryan WaniataSoundcore Life Q30 for $60-85: Anker’s Soundcore line is nothing if not value-conscious, and the Life Q30 provide an embarrassing list of extras for their bargain-basement pricing. You’ll get clear and warm sound, great features, tons of battery life, and noise canceling that gets the job done even on a long flight, though it can’t keep up with flagship pairs. It’s hard to complain when they cost hundreds less, especially with sale pricing that sometimes drops to around $50.Sony WH-1000XM4 for $250-350: Sony’s WH-1000X lineup has produced some of the best noise-canceling headphones for nearly a decade, and the aging WH-1000XM4 (9/10, WIRED Recommends) are no exception. They periodically go on sale for under $300, but it’s getting harder to find them below full price, which is tough for a five-year-old model.Bowers & Wilkins Pi8 Earbuds for $400: Bowers & Wilkins’ Pi8 (8/10, WIRED Recommends) offer a sleek, comfortable design, solid (albeit not Bose-beating) noise canceling, and great sound. Call quality is also excellent, which makes these perhaps the perfect business-class earbuds, though their hefty price won’t appeal to everyone.Bowers and Wilkins PX7 S2e for $400: The Px7 S2e feature upgraded audio quality for fantastic sound in stylish and sophisticated design. They’re also among the most comfortable headphones we’ve tested, but their noise canceling doesn’t rise to the level of the top players for the money.Beyerdynamic Aventho 300 for $400: These over-ears from Beyerdynamic (7/10, WIRED Recommends) have the brand’s classic studio sound, with a tight crisp high range and punchy lows. The downside is that they don’t cancel noise quite as well as models from Sony, Bose, and others above. Still, they sound great and are worth considering, especially if you can snag them on sale.Soundcore Space A40 for $60: Another top value buy from Anker’s Soundcore brand, the Space A40 (8/10, WIRED Recommends) are some of our favorite cheap earbuds, especially as their price continues to fall. You’ll find a classy design, lots of features, quality sound, and great noise canceling for their class.Apple Beats Fit Pro for $199: The Beats Fit Pro are an aging but still knockout pair of wireless buds, with great sound, easy-access physical buttons, and solid noise canceling to boot. Add to that six hours of battery life, spatial audio compatibility with Apple Music and other services, and you’ve got one of the best pairs of earbuds ever “designed in California.”Epos/Sennheiser Adapt 660 for $210: Want excellent sound, a comfortable fit, and high-quality noise-canceling tech for less than what you’d pay for Sony or Bose headphones? Check out this collaboration between Epos and Sennheiser. The Epos/Sennheiser Adapt 660 (8/10, WIRED Recommends) sound fantastic and are some of the lightest noise-canceling headphones I’ve ever worn. They also feature excellent microphones for great silence on calls and Zooms.

  • Gridcare thinks more than 100 GW of data center capacity is hiding in the grid

    Hyperscalers and data center developers are in a pickle: They all want to add computing power tomorrow, but utilities frequently play hard to get, citing years-long waits for grid connections.

    “All the AI data centers are struggling to get connected,” Amit Narayan, founder and CEO of Gridcare, told TechCrunch. “They’re so desperate. They are looking for solutions, which may or may not happen. Certainly not in the five-year timelines they cite.”

    That has led many data centers to pursue what’s called “behind the meter” power sources — basically, they build their own power plants, a costly endeavor that hints at just how desperate they are for electricity.

    But Narayan knew there was plenty of slack in the system, even if utilities themselves haven’t discovered it yet. He has studied the grid for the last 15 years, first as a Stanford researcher then as a founder of another company. “How do we create more capacity when everyone thinks that there is no capacity on the grid?” he said.

    Narayan said that Gridcare, which has been operating in stealth, has already discovered several places where extra capacity exists, and it’s ready to play matchmaker between data centers and utilities.

    Gridcare recently closed an oversubscribed $13.5 million seed round, the company told TechCrunch. The round was led by Xora, Temasek’s deep tech venture firm, with participation from Acclimate Ventures, Aina Climate AI Ventures, Breakthrough Energy Discovery, Clearvision, Clocktower Ventures, Overture Ventures, Sherpalo Ventures, and WovenEarth.

    For Narayan and his colleagues at Gridcare, the first step to finding untapped capacity was to map the existing grid. Then the company used generative AI to help forecast what changes might be implemented in the coming years. It also layers on other details, including the availability of fiber optic connections, natural gas, water, extreme weather, permitting, and community sentiment around data center construction and expansion. 

    Techcrunch event

    San Francisco
    |
    October 27-29, 2025

    “There are 200,000-plus scenarios that you have to consider every time you’re running this study,” Narayan said.

    To make sure it’s not running afoul of regulations, Gridcare then takes that data and weighs it against federal guidelines that dictate grid usage. Once it finds a spot, it starts talking with the relevant utility to verify the data.

    “We’ll find out where the maximum bang for the buck is,” Narayan said.

    At the same time, Gridcare works with hyperscalers and data center developers to identify where they are looking to expand operations or build new ones. “They have already told us what they’re willing to do. We know the parameters under which they can operate,” he said.

    That’s when the matchmaking begins.

    Gridcare sells its services to data center developers, charging them a fee based on how many megawatts of capacity the startup can unlock for them. “That fee is significant for us, but it’s negligible for data centers,” Narayan said.

    For some data centers, the price of admission might be forgoing grid power for a few hours here and there, relying on on-site backup power instead. For others, the path might be clearer if their demand helps green-light a new grid-scale battery installation nearby. In the future, the winner might be the developer that is willing to pay more. Utilities have already approached Gridcare inquiring about auctioning access to newfound capacity.

    Regardless of how it happens, Narayan thinks that Gridcare can unlock more than 100 gigawatts of capacity using its approach. “We don’t have to solve nuclear fusion to do this,” he said.

    Update: Corrected spare capacity on the grid to gigawatts from megawatts.

Leave a Reply

Your email address will not be published. Required fields are marked *