Connect with us

Tech

An exclusive tour of Amazon’s Trainium lab, the chip that’s won over Anthropic, OpenAI, even Apple 

Shortly after Amazon CEO Andy Jassy announced AWS’s groundbreaking $50 billion investment deal with OpenAI, Amazon invited me on a private tour of the chip development lab at the heart of the deal, at (mostly*) its own expense. 

Industry experts are watching Amazon’s Trainium chip, created at that facility, for its implications for lower-cost AI inference and, potentially, a dent in Nvidia’s near monopoly.  

Curious, I agreed to go.  

My tour guides for the day were the lab’s director, Kristopher King (pictured below right) and director of engineering Mark Carroll (below left), as well as the team’s PR person who arranged the visit, Doron Aronson (pictured with yours truly later in the story). 

ASW Chip lab leaders Mark Carroll, Kristopher King
AWS Chip lab leaders Mark Carroll and Kristopher King.Image Credits:TechCrunch/Julie Bort

AWS has been Anthropic’s major cloud platform since the AI lab’s early days — a relationship significant enough to survive Anthropic later adding Microsoft as a cloud partner as well, and Amazon’s growing partnership with OpenAI.

The OpenAI deal makes AWS the exclusive provider of the model maker’s new AI agent builder, Frontier, which could become an important part of OpenAI’s business if agents become as big as Silicon Valley thinks they will. We’ll see if that exclusivity stands exactly as announced. The Financial Times reported this week that Microsoft may believe OpenAI’s deal with Amazon violates its own deal with OpenAI, namely with Redmond getting access to all of OpenAI’s models and tech.

What makes AWS so appealing to OpenAI? As part of this deal, the cloud giant has agreed to supply OpenAI with 2 gigawatts of Trainium computing capacity. This is a giant commitment, given that Anthropic and Amazon’s own Bedrock service are already consuming Trainium chips faster than Amazon can produce them. 

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

There are 1.4 million Trainium chips deployed across all three generations, and Anthropic’s Claude runs on over 1 million of the Trainium2 chips deployed, the company said.

It’s worth noting that while Trainium was originally geared toward faster, cheaper model training (a bigger priority a couple of years ago), it’s now tuned and used for inference as well. Inference — the process of actually running an AI model to generate responses — is currently the biggest performance bottleneck in the industry. 

Case in point: Trainium2 handles the majority of the inference traffic on Amazon’s Bedrock service, which supports the building of AI applications by Amazon’s many enterprise customers and allows the apps to use multiple models.

“Our customer base is just expanding as fast as we can get capacity out there,” King said. “Bedrock could be as big as EC2 one day,” he added, referring to AWS’s behemoth compute cloud service. 

Amazon's Trainium3 chip
Amazon’s Trainium3 chip.Image Credits:Amazon

Trainium vs. Nvidia

Beyond offering an alternative to Nvidia’s backlogged, hard-to-acquire GPUs, Amazon says its new chips running on its new specialty Trn3 UltraServers cost up to 50% less to run for comparable performance than using classic cloud servers. 

Along with Trainium3, released in December, this AWS team also built new Neuron switches, and Carroll says that combo is transformative.

“What that gives us is something huge,” Carroll said. The switches allow every Trainium3 chip to talk to every other chip in a mesh configuration, reducing latency. “That’s why Trainium3 is breaking all kinds of records,” particularly in “price per power,” he said. 

When trillions of tokens a day are involved, such improvements add up.  

In fact, Amazon’s chip team was lauded by Apple in 2024. In a rare moment of openness for the secretive company, Apple’s director of AI publicly described how it used another of the team’s chips — Graviton, a low-power, ARM-based server CPU and the first breakout chip this team designed. Apple also lauded Inferentia — a chip specifically designed for inference — and gave a nod to Trainium, which was new at the time. 

These chips represent the classic Amazon playbook: See what people want to buy, then build an in-house alternative that competes on price. 

The catch for chips, historically, has been switching costs. Applications written for Nvidia’s chips must be re-architected to work with others — a time-consuming process that discourages developers from switching.

But the AWS chip team proudly told me that Trainium now supports PyTorch, a popular open source framework for building AI models. That includes many of the ones hosted on Hugging Face, a vast library where developers share open source models.

The transition, Carroll told me, requires “basically a one-line change, and then recompile, and then run on Trainium.” In other words, Amazon is attempting to chip away at Nvidia’s market dominance wherever possible.

AWS has also this month announced a partnership with Cerebras Systems, integrating that company’s inference chip on servers running Trainium for what Amazon promises will be superpowered, low-latency AI performance. 

But Amazon’s ambitions go beyond the chips themselves. It also designs the server that hosts the chips. Besides the networking components, this team has designed “Nitro,” a hardware-software combo that provides virtualization tech (which allows many instances of software to run separately on the same server); new state-of-the-art liquid cooling technology; and the server sleds (pictured below) that host this gear. 

All of that is to control cost and performance. 

AWS Austin chip lab tour, sled with components
AWS Austin chip lab tour, sled with components.Image Credits:TechCrunch/Julie Bort

Working 24/7 on the “bring-up” 

Amazon’s custom chip-designing unit was born when the cloud giant bought Israeli chip designer Annapurna Labs in January 2015 for about $350 million. So this team has now had more than 10 years designing chips for AWS. The unit has retained its Annapurna roots and name — its logo is everywhere in the office. 

This chip lab is located in a shiny, chrome-windowed building in Austin’s upscale “The Domain” district, a walkable area filled with shops and restaurants that’s sometimes called Austin’s Silicon Valley

The offices have your classic tech corporate vibe: desks in cubicles, gathering spots, and conference rooms. But tucked away at the back of a high floor in the building is the actual lab, with sweeping views of the city.  

The shelving-filled lab, about the size of two large conference rooms, is a noisy industrial space thanks to the fans on the equipment. It looks like a cross between a high school shop class and a Hollywood set for a high-end lab, except the engineers are dressed in jeans, not white lab coats.

ASW Chip Lab
AWS Austin Chip Lab.Image Credits:TechCrunch/Julie Bort
ASW Austin chip lab
AWS Austin chip lab.Image Credits:TechCrunch/Julie Bort

Note that this is not where the chips are manufactured, so no white hazmat suits were necessary. The Trainium3 is a state-of-the-art 3-nanometer chip, produced by TSMC, arguably the leader in 3-nanometer manufacturing, with other chips produced by Marvell. 

But this is the room where the magic of the “bring-up” occurs.  

“A silicon bring-up is when you get the chip for the first time, and it’s like a big overnight party. You stay here, like a lock-in,” King explains. After 18 months of work, the chip is activated for the first time to verify it works as designed. The team even filmed some of the Trainium3 bring-up and posted it on YouTube.

Spoiler alert: It’s never problem-free.  

For Trainium3, the prototype chip was originally air-cooled, like previous versions. The current chip is now liquid-cooled, which offers energy advantages and was quite an engineering feat.

During the bring-up, the dimensions for how the chip attached to the air-cooling heat sink were off, so the chip couldn’t be activated. 

Unfazed, the team “immediately got a grinder and just started grinding off the metal,” King said. Because they didn’t want the noise disrupting the bring-up pizza party atmosphere, they snuck off and did the grinding in a conference room.  

Staying up all night and solving problems “is what silicon bring-up is all about,” King said. 

The lab even has a welding station, where hardware lab engineer and master welder Isaac Guevara demonstrated welding tiny integrated circuit components through a microscope. This is such insanely difficult work that senior leader Carroll openly admitted he couldn’t do it, to the guffaws of Guevara and the rest of the engineers in the room. 

ASW Chip tour welding station
AWS Austin chip lab tour, welding station.Image Credits:TechCrunch/Julie Bort

The lab also contains both custom-made and commercial tools for testing and analyzing issues with chips. Here’s signal engineer Arvind Srinivasan demonstrating how the lab tests each tiny component on the chip:

AWS Austin chip lab tour, testing equipment
AWS Austin chip lab tour, testing equipment.Image Credits:TechCrunch/Julie Bort

Sleds are the star of the lab 

But the star of the lab is an entire row showcasing each generation of the “sleds” the team designed. 

AWS Austin chip lab tour wall of sleds
AWS Austin chip lab tour wall of sleds.Image Credits:TechCrunch/Julie Bort

Sleds are the trays that house the Trainium AI chips, Graviton CPU chips, and supporting boards and components. Stack them together on a rack with the networking component, also custom-designed by this team, and you get the systems that are at the heart of Anthropic Claude’s success. 

Here’s the sled that was shown off during the AWS re:invent conference in December: 

AWS Austin chip lab tour, Tranium3 sled
AWS Austin chip lab tour, Trainium3 sled.Image Credits:TechCrunch/Julie Bort

Proven by Anthropic and OpenAI

I expected my guides to crow about the OpenAI deal during the tour. But they didn’t. 

The reticence could have been related to the aforementioned potential legal haze that might hang over the deal. But the sense I got was that these boots-on-the-ground engineers (who are currently designing the next version, Trainium4) haven’t had much chance to work with OpenAI yet. Their day-to-day work has so far been focused on Anthropic’s and Amazon’s needs.

Currently, the biggest chunk of Trainium2 chips is deployed in Project Rainier — one of the world’s largest AI compute clusters — which went live in late 2025 with 500,000 chips. It’s used by Anthropic. 

But there was a wall monitor in the main office displaying a quote about how OpenAI will be using Trainium. The pride was there, if subtle.  

In addition to this lab, the team also has its own private data center for quality and testing purposes. A short drive away, it doesn’t run customer workloads, so it’s housed at a co-location facility, not an AWS data center.

Security is tight: There are strict protocols to enter the building and to access Amazon’s area within.

The data center’s cooling system is so loud that earplugs are mandatory, and the air is thick with the acrid smell of heated metal. It’s not a pleasant place for the average person to hang out. 

AWS Austin chip lab tour data center
Here’s me and Aronson at the AWS Austin chip lab data center, protecting our ears next to live servers.Image Credits:TechCrunch / Julie Bort

At this data center, there are rows and rows of servers filled with sleds that integrate all of Amazon’s newest custom chips: Graviton CPU, liquid-cooled Trainium3, Amazon Nitro, all happily computing away. The liquid runs on a closed system, meaning it is reused, which should also help reduce the environmental impact, the engineers said. 

Here’s what a current Trn3 UltraServer looks like: Multiple sleds are on top and bottom, with the Neuron switches in the middle. Hardware development engineer David Martinez-Darrow is seen here performing maintenance on a sled:

AWS Austin chip lab tour data center
AWS Austin chip lab tour data center.Image Credits:TechCrunch/Julie Bort

While attention on the team has always been high, the scrutiny has really ratcheted up as of late. 

Amazon CEO Andy Jassy keeps a close eye on this lab, publicly bragging about its products like a proud dad. In December, he said Trainium was already a multibillion-dollar business for AWS and called it one piece of AWS tech he’s most excited about. He also gave the chip a shout-out when announcing the OpenAI agreement.  

The team feels the pressure, too. Engineers will work 24/7 for three to four weeks around each bring-up event to fix any issues so the chips can be mass-produced and put into data centers.

“It’s very important that we get as fast as possible to prove that it’s actually going to work,” Carroll said. “So far, we’ve been doing really well.” 

*Disclosure: Amazon provided airfare and covered the cost of one night at a local hotel. Honoring its Leadership Principle of Frugality, this was a back-of-the-plane middle seat and a modest room. TechCrunch picked up the other associated travel costs like Ubers and luggage fees. (Yes, I checked a bag for an overnight trip. I’m high maintenance that way.) 

source

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Tech

Anthropic releases Opus 4.8 with new ‘dynamic workflow’ tool

On Thursday, Anthropic released Opus 4.8, the newest version of its most advanced publicly available model. The model is available everywhere, with standard pricing at the same level as the previous Opus release.

The new model comes just 41 days after Opus 4.7 was released, a much faster upgrade cycle than normal for Anthropic. (The most recent Sonnet and Haiku models are three and seven months old, respectively.) The fast turnaround may have something to do with the chilly reception to Opus 4.7, which some users found disappointing

That interval has also seen significant new releases for OpenAI’s Codex and Google’s Gemini Flash model, increasing the pressure on Anthropic to keep pace.

Opus 4.8 comes with the expected best-in-class benchmark results, but there’s also particular attention to how the model manages bad or uncertain data. In the launch post, Anthropic’s early testers found that the new model is “more likely to flag uncertainties about its work and less likely to make unsupported claims.”

Echoing this point, a testimonial from Bridgewater associates said the biggest difference in the upgrade was “Opus 4.8’s tendency to proactively flag issues with the inputs and outputs of an analysis, something other models routinely missed and left to the users to catch.”

Together with the new model, Anthropic launched a feature called Dynamic Workflows, which will be available in research preview. The system is designed to help larger models like Opus manage complex tasks across hundreds of parallel subagents.

“Claude Code alongside Opus 4.8 can now carry out codebase-scale migrations across hundreds of thousands of lines of code from kickoff to merge, with the existing test suite as its bar,” the post explains.

Anthropic is still holding back its most advanced Mythos model after a tentative preview last month raised cybersecurity concerns. However, the company hinted in today’s Opus release that the Mythos preview period might soon end, once necessary safeguards are complete.

“We’re making swift progress on developing these safeguards and expect to be able to bring Mythos-class models to all our customers in the coming weeks,” the company wrote.

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

source

Continue Reading

Tech

Corgi announces $106M raise at $2.6B valuation — double what it was worth 3 weeks ago

Insurance tech Corgi on Thursday announced a $106 million Series B1 raise, valuing the company at $2.6 billion, just three weeks after announcing a $160 million Series B at a $1.3 billion valuation and four months after its $108 million Series A. The company offers insurance, working specifically with startups in areas like tech, cyber, and general liability; it counts Deel and Artisan among its customers. 

Even in the current go-go dealmaking environment, that sequencing is remarkable. While startups raising back-to-back rounds at steep step-ups have become almost routine, a company whose valuation doubles in three weeks is unusual enough to raise questions, particularly given the investor set in both rounds is the same.

Asked what material event justified that kind of jump in such a short window, investor Kanyi Maqubela of Kindred Ventures cited the company’s momentum. It’s an explanation may satisfy some, but the practice more generally is starting to attract scrutiny in LP circles. “There’s growing distrust of internal markups,” said one LP who backs numerous venture funds and asked not to be named. Said this person of exit mechanisms specifically, “[I]f a company [is] just getting re-priced upward with no real liquidity event, LPs notice.”

The specific concern is that a fund that invests at one valuation, then marks it up three weeks later can make portfolio performance look stronger on paper than the underlying business may justify.

In this case, Maqubela suggested, that’s not an issue for Kindred’s limited partners, nor for Corgi’s other investors, which include Prime Capital, Leblon Capital, Alumni Ventures, and Y Combinator.

“LPs really like exits above all,” Maqubela said in a message to TechCrunch. “They discount the value of markups since those aren’t always reflective of reality.” He added that in this case, revenue growth rationalized the new round.

Founded in 2024 by Emily Yuan and Nico Laqua, Corgi says it’s building coverage for what it calls “newer categories” of risk while also addressing an often underserved market among legacy insurance carriers — startups and the unique liability problems they face, including those related to AI.

“Corgi covers anything from when an AI system causes financial loss, misinformation, operational failures, or compliance issues,” Laqua told TechCrunch. “Many legacy policies either exclude these risks or handle them ambiguously. 

Corgi is not alone in the insurtech market; Vouch, which is backed by Y Combinator, operates in a similar space.

When asked about the back-to-back rounds, Laqua said that insurance is a “highly capital-intensive industry,” and that “demand has accelerated quickly across new product lines and partnerships.” Building an AI-native platform compounds those costs further.  

“We’re best known for our business insurance products, but the additional capital will be used to expand into new insurance categories, scale the AI underwriting platform, grow embedded distribution partnerships, and continue growing our team,” Laqua said.

Corgi has now raised $378 million in total funding from its investors.

Correction: The title of this headline originally misstated the valuation due to an editing error.

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

source

Continue Reading

Tech

Startup Battlefield 200 application deadline extended to June 8 after overwhelming demand

Founders, the battlefield is still open, but not for much longer.

After overwhelming demand from founders around the world, TechCrunch has extended the Startup Battlefield 200 application deadline to June 8. If you thought you missed your opportunity to pitch live on the Disrupt Stage in October at San Francisco’s Moscone West, this is your final chance to step into one of tech’s most competitive startup arenas.

Nominate a standout startup or submit your application before the deadline.

TechCrunch Startup Battlefield
Image Credits:Kimberly White / Getty Images

What is Startup Battlefield 200?

Startup Battlefield 200 is where ambitious early-stage startups go from unknown to impossible to ignore. Selected founders will take the spotlight at TechCrunch Disrupt 2026, pitching live in front of elite investors, influential media, and the global startup ecosystem. One startup will walk away with $100,000 in equity-free funding, but every company selected gains visibility that can reshape its trajectory.

More than 1,700 startups have participated in Startup Battlefield over the years. Together, they’ve raised more than $32 billion and produced over 250 exits, including acquisitions by companies like Microsoft, Google, Salesforce, Uber, and Amazon.

This is the same competition that helped launch companies like Dropbox, Discord, Mint, Fitbit, and Trello. More than 1,500 startups have competed in Startup Battlefield, and many have gone on to become category-defining businesses.

Why founders are still racing to apply

Competition for Startup Battlefield 200 has intensified as founders look for ways to stand out in a crowded fundraising environment. The extension gives more startups the opportunity to enter, but expectations are higher than ever.

Selected startups receive:

  • A free exhibit table for all three days of Disrupt.
  • Four complimentary Disrupt passes.
  • Branding and visibility inside the Disrupt event app.
  • Press exposure and lead-generation opportunities.
  • Access to founder-only masterclasses.
  • The opportunity to pitch live on the Disrupt Stage.
  • Direct feedback from leading venture capitalists.
  • A chance to win $100,000 in equity-free funding.
Salva Health Co-Founder & CEO Valentina Agudelo Vargas, winner of the Startup Battlefield 2024, poses onstage during TechCrunch Disrupt 2024 Day 3 at Moscone Center on October 30, 2024 in San Francisco.
Image Credits:Kimberly White / Getty Images

Who should apply

TechCrunch is looking for bold early-stage startups with a working MVP and a vision capable of disrupting an industry. Bootstrapped, pre-seed, and seed-stage startups are encouraged to apply. Select Series A startups in capital-intensive sectors may also qualify.

If you are building something category-changing, this is your chance to prove it on one of the biggest stages in tech.

The clock is still ticking

The deadline extension was driven by overwhelming demand, but the battlefield will not stay open forever. Thousands of startups are competing for a limited number of spots, and every application is reviewed closely by the TechCrunch team.

This is your opportunity to get in front of investors, customers, media, and future partners all in one place. Nominate or apply before June 8 and fight for your place among the next generation of breakout startups.

Startup Battlefield 200 2026
Image Credits:TechCrunch

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

source

Continue Reading