How Azure AI Search powers RAG in ChatGPT and global scale apps

How Azure AI Search powers RAG in ChatGPT and global scale apps

Millions of people use Azure AI Search every day without knowing it. You can enable your apps with the same search that enables retrieval-augmented generation (RAG) capabilities when you build Custom GPTs or attach files in your ChatGPT prompts.

Pablo Castro, Microsoft CVP and Distinguished Engineer Azure AI Search, joins Jeremy Chapman to share how with Azure AI Search, you can create custom applications that retrieve the most relevant information quickly and accurately, even from billions of records.

Manage massive-scale datasets while maintaining high-quality search results with ultra-compact, binary quantized vector search indexes that use Matryoshka Representation Learning (MRL) and oversampling to equal the search accuracy of vector indexes up to 96 times larger. These approaches drive significant cost savings by optimizing your vector indexes without compromising quality.

► QUICK LINKS:
00:00 - RAG powered by Azure AI Search
00:50 - Azure AI Search role in ChatGPT
02:01
- Azure AI Search use case - AT&T
03:27 - Start in Azure Portal
04:35 - Massive scale and vector index
06:08 - Scalar & Binary Quantization
07:21 - Martyoshka technique
09:07 - Oversampling
11:31
- How to build an app using Azure AI Search
13:00 - See it in action
14:28 - Enable binary quantization with oversampling
14:54
- Wrap up

► Link References

Get sample code on GitHub at https://aka.ms/SearchQuantizationSample

Check out search solutions at https://aka.ms/AzureAISearch

► Unfamiliar with Microsoft Mechanics?

As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft.

• Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries

• Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog

• Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast

► Keep getting this insider knowledge, join us on social:

• Follow us on Twitter: https://twitter.com/MSFTMechanics

• Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/

• Enjoy us on Instagram: https://www.instagram.com/msftmechanics/

• Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(100)

Agent 365 | Identity & Access Controls in Entra

Agent 365 | Identity & Access Controls in Entra

Take control of every AI agent, managed or not, running in your environment using Agent 365 and Microsoft Entra. Surface agents across AWS Bedrock, Google Vertex, Databricks, and Salesforce in one reg...

9 Jun 8min

Introducing Azure HorizonDB - PostgreSQL

Introducing Azure HorizonDB - PostgreSQL

Run enterprise Postgres workloads on Azure HorizonDB with around 3x the throughput of self-managed deployments — zone-resilient by default, no architectural trade-offs. Call AI models directly from SQ...

3 Jun 13min

Agent 365 | Security Operations in Defender

Agent 365 | Security Operations in Defender

Surface every AI agent in your tenant and expose the ones throwing security signals — across both the IT and SOC view. Triage high-severity alerts as IT in the Microsoft 365 admin center, then pivot i...

29 Mai 7min

Microsoft Entra Tenant Governance | Find Configuration Drift

Microsoft Entra Tenant Governance | Find Configuration Drift

Ensure your tenant configuration doesn't drift from defined security and compliance requirements with Microsoft Entra Tenant Governance. Capture configuration as code across 200+ resource types in Ent...

27 Mai 8min

Automate evaluations | Microsoft Foundry

Automate evaluations | Microsoft Foundry

Build AI agents that meet your standards for quality, safety, and performance using Microsoft Foundry. Trace every run end-to-end, generate synthetic datasets to stress-test on demand, fire automated ...

21 Mai 9min

Microsoft Excel Beginners Tutorial (2026)

Microsoft Excel Beginners Tutorial (2026)

This is the Microsoft Excel guide and tutorial for beginners. If you're new to and getting started with Excel or coming from another app, in this video we teach the basics of Excel, the user interface...

18 Mai 12min

Work IQ | Data, Context, Skills & Tools for Copilot and Your Agents

Work IQ | Data, Context, Skills & Tools for Copilot and Your Agents

Ground every Microsoft 365 Copilot response in your real work data. Pull context from SharePoint, OneDrive, Teams, email, and meetings — all through Work IQ. Draft Word documents that carry your exist...

13 Mai 9min

Azure Arc | On-prem + Multi-cloud Management

Azure Arc | On-prem + Multi-cloud Management

Managing Servers, and Kubernetes across on-prem, and multiple clouds, can quickly become complex, especially when you're juggling multiple tools. In this video, we explore how Azure Arc simplifies hyb...

8 Mai 14min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
fotballpodden-2
forklart
popradet
stopp-verden
det-store-bildet
lydartikler-fra-aftenposten
rss-gukild-johaug
nokon-ma-ga
dine-penger-pengeradet
hanna-de-heldige
rss-espen-lee-usensurert
rss-ness
aftenbla-bla
rss-utenrikskomiteen-med-bogen-og-grasvik
frokostshowet-pa-p5
e24-podden
rss-penger-polser-og-politikk