Apple Selects Google Gemini as AI Model Provider for Next-Gen Siri
AI Daily13 Jan

Apple Selects Google Gemini as AI Model Provider for Next-Gen Siri

Description: Apple announces a multi-year partnership with Google worth approximately $1 billion annually to power the next generation of Siri using Gemini's 1.2 trillion parameter model. We break down the three-component architecture (Query Planner, Knowledge Search, Summarizer), how Apple's Private Cloud Compute maintains privacy guarantees while running Google's model, the implications for OpenAI's ChatGPT integration, and what platform engineers should learn from Apple's "infrastructure first" approach to AI.

Episode URL: /aidaily/00015

Summary: - Apple partners with Google to use Gemini (1.2T parameters) for Siri, 8x larger than current Apple Intelligence model - Three-component architecture: Query Planner (Gemini), Knowledge Search (on-device), Summarizer (Gemini) - Privacy preserved via Private Cloud Compute (PCC) - Gemini runs on Apple's servers with no persistent storage - Deal is non-exclusive (~$1B/year), allowing Apple to add other providers like Anthropic later - Tim Cook mentioned continued interest in Model Context Protocol (MCP) for AI-app integration - Key lesson: Build the infrastructure layer first, then plug in best-in-class models

Duration: 10:58 Speakers: Jordan, Alex Target Audience: Platform Engineers, SREs, DevOps Engineers, AI/ML Engineers

Episoder(70)

Anthropic Blocks Third-Party Claude Code Tools: The $200 vs $1,000 Arbitrage Explained

Anthropic Blocks Third-Party Claude Code Tools: The $200 vs $1,000 Arbitrage Explained

On January 9, 2026, thousands of developers woke up to find their AI coding workflows completely broken. Anthropic blocked third-party CLI wrappers like OpenCode without warning - and the economics be...

10 Jan 23min

ChatGPT Health & FlashAttention in Your Browser: llama.cpp WebGPU Deep Dive

ChatGPT Health & FlashAttention in Your Browser: llama.cpp WebGPU Deep Dive

Today's deep dive: llama.cpp brings FlashAttention to WebGPU, enabling datacenter-grade LLM inference in your browser. In this 16-minute episode of AI Daily, Jordan and Alex break down how the llama.c...

9 Jan 16min

SpikySpace: Neuromorphic AI for Ultra-Efficient Time Series Forecasting

SpikySpace: Neuromorphic AI for Ultra-Efficient Time Series Forecasting

Today's deep dive: SpikySpace combines Spiking Neural Networks with State-Space Models to achieve 98% energy reduction for time series forecasting on neuromorphic hardware. In this 21-minute episode o...

8 Jan 21min

Failure-Driven Fine-Tuning: How Logics-STEM Patches LLM Reasoning Gaps

Failure-Driven Fine-Tuning: How Logics-STEM Patches LLM Reasoning Gaps

Today's deep dive: Logics-STEM shows how to debug and patch your fine-tuned models like software. In this 19-minute episode of AI Daily, Jordan and Alex break down a new approach to LLM fine-tuning th...

7 Jan 19min

Architecture Beats Model Scale: JourneyBench Proves Smaller LLMs Can Outperform GPT-4

Architecture Beats Model Scale: JourneyBench Proves Smaller LLMs Can Outperform GPT-4

Architecture Beats Model Scale: JourneyBench Proves Smaller LLMs Can Outperform GPT-4 A smaller model with smart architecture just beat GPT-4 using a massive static prompt. Here's why that changes eve...

6 Jan 18min

Vector Search Gets Smarter: Milvus 2.6.8 Deep Dive

Vector Search Gets Smarter: Milvus 2.6.8 Deep Dive

Milvus 2.6.8 drops with search highlighting for RAG explainability, smarter query optimization, and enterprise-grade fixes. Here's what you need to know. In this 15-minute episode of AI Daily, Jordan ...

5 Jan 17min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
popradet
stopp-verden
dine-penger-pengeradet
rss-gukild-johaug
det-store-bildet
nokon-ma-ga
fotballpodden-2
lydartikler-fra-aftenposten
hanna-de-heldige
rss-ness
aftenbla-bla
rss-espen-lee-usensurert
rss-dannet-uten-piano
rss-penger-polser-og-politikk
frokostshowet-pa-p5
e24-podden