Category

AI Industry

Company news, funding, and the business of building AI.

11Articles
Latest
Google Assembles Four-Partner Custom Chip Supply Chain for AI Inference

Google is building out a custom AI chip supply chain with four design partners — Broadcom, MediaTek, Marvell, and Intel — and a roadmap extending to TSMC's 2-nanometre process in late 2027, according to reporting from Bloomberg and The Next Web ahead of Google Cloud Next. The seventh-generation Ironwood TPU, Google's first inference-specific chip, is now generally available on Google Cloud, with Anthropic committing to up to one million TPUs. The next-generation split assigns Broadcom's "Sunfish" to training workloads and MediaTek's "Zebrafish" to inference, with Marvell in talks for a memory processing unit.

Apr 214 min readVerified across 11 sourcesRead full story →
Google Cloud logo with interconnected partner logos representing chip design collaboration
Google logo with circuit board pattern and blue network nodes
AI Industry

Google Eyes New Chips to Speed Up AI Results, Challenging Nvidia

Google is developing custom AI chips aimed at accelerating inference — the process of generating responses from trained models — as part of a push to reduce dependence on Nvidia GPUs, according to Bloomberg. The report says Google is in talks with Marvell Technology to co-design new silicon, expanding beyond its longstanding Broadcom partnership on Tensor Processing Units. The move targets the inference segment of AI compute, where hyperscalers are increasingly looking for cost and latency advantages over general-purpose GPUs. Multiple outlets, including Benzinga and Business Today, have corroborated the Marvell discussions. Neither Google nor Marvell has issued an official statement.

Apr 21 · 2:00am
21 sourcesRead full →
AI Industry

Trump-Branded Fermi America Data Center Project Stalls as CEO Exits

Axios reports that Fermi America, the Texas Panhandle AI data center project co-founded by former Energy Secretary Rick Perry and bearing President Trump's name, is facing delays, cooling supply chain issues, and the lack of a publicly confirmed anchor tenant. CEO Toby Neugebauer departed Friday, sending shares lower after a 75% six-month decline.

Apr 20 · 9:00am
1 sourceRead full →
Marvell Technology logo and company branding on tech background
AI Industry

Google in Talks With Marvell for Custom AI Inference Chips, Diversifying Beyond Broadcom

Google is discussing a chip design partnership with Marvell Technology that would add a third custom silicon partner alongside Broadcom and MediaTek, according to The Information as reported by The Next Web. The talks cover a memory processing unit and an inference-optimised TPU, and follow Broadcom's through-2031 TPU agreement signed days earlier.

Apr 20 · 4:30am
2 sourcesRead full →
AI Industry

Truck-Deployable AI Data Centers Cut Build Times from Years to Months

Companies including Duos Edge AI and LG CNS are shipping pre-fabricated, GPU-packed compute pods that can be operational in roughly six months — a fraction of the two-to-three years a conventional data center requires. Each unit holds up to 576 Nvidia GPUs, can be networked with others on a concrete pad, and costs around $25 million for a five-megawatt deployment. The approach is drawing wider industry attention as AI hardware sits idle waiting for permanent facilities.

Apr 8 · 11:29am
1 sourceRead full →
AI Industry

Nvidia Unveils Groq 3 LPU at GTC, Signaling Shift to AI Inference Computing

At Nvidia's GTC conference in San Jose this week, CEO Jensen Huang announced the Groq 3 language processing unit (LPU), a chip built specifically for AI inference. The announcement came just two and a half months after Nvidia licensed intellectual property from startup Groq in a deal valued at $20 billion. The chip features an SRAM-based architecture delivering 150 terabytes per second of memory bandwidth — seven times faster than Nvidia's own Rubin GPU — and will ship as part of a combined compute tray alongside Vera Rubin chips.

Apr 8 · 11:22am
2 sourcesRead full →
AI Industry

Meta Lays Out Its AI Stack: Custom Chips, Open Models, and 'Personal Superintelligence'

Meta has published a comprehensive overview of its current AI portfolio, highlighting four custom MTIA chips shipped in two years, a long-term hardware partnership with AMD, the Llama 4 open model family, and a new AI video feature called Vibes. The company frames its overarching mission as delivering 'personal superintelligence' to individuals — a deliberate contrast to competitors it accuses of seeking centralised control over AI output.

Apr 7 · 8:43am
1 sourceRead full →
GGML and llama.cpp Join Hugging Face to Secure the Future of Local AI
AI Industry

GGML and llama.cpp Join Hugging Face to Secure the Future of Local AI

GGML and llama.cpp, the open-source libraries powering much of the local AI inference ecosystem, have joined Hugging Face in a move designed to ensure their long-term development and sustainability. The partnership brings two of the most widely used tools for running large language models on consumer hardware under the Hugging Face umbrella, signalling a consolidation of the open-source AI infrastructure stack.

Mar 26 · 6:23pm
2 sourcesRead full →
Mistral AI Launches Forge Platform and Mistral Small 4 in Triple Product Release
AI Industry

Mistral AI Launches Forge Platform and Mistral Small 4 in Triple Product Release

Mistral AI unveiled three significant releases on March 16–17, 2026: Forge, an enterprise platform for building proprietary AI models; Mistral Small 4, its latest compact research model; and a founding partnership with NVIDIA's Nemotron Coalition. The cluster of announcements signals Mistral's push to compete across enterprise deployment, open-source development, and hardware-level AI infrastructure simultaneously.

Mar 26 · 6:22pm
2 sourcesRead full →
Man presents on stage with robot graphic background
AI Industry

Google's February 2026 AI Roundup: What the Company Announced This Month

Google published its monthly AI update summary for February 2026 via the Google AI Blog, consolidating product and research announcements from across the company. The post signals Google's continued practice of bundling AI news into digest-style releases, though the specific details of the announcements were not available in the source material at time of publication.

Mar 26 · 6:20pm
2 sourcesRead full →
a circular maze with the words open ai on it
AI Industry

Hugging Face Spring 2026 Report Signals Maturing Open-Source AI Ecosystem

Hugging Face has published its Spring 2026 State of Open Source report, offering a snapshot of activity across its platform. The report tracks model uploads, dataset growth, and community engagement trends that reflect the broader health of the open-source AI movement. As proprietary and open models continue to compete for developer mindshare, Hugging Face's platform data provides one of the clearest windows into where open-source AI development is heading.

Mar 26 · 6:20pm
2 sourcesRead full →

Stay informed

Get DeepBrief delivered to your inbox.