Arjun Patel on Vector Databases and the Future of Semantic Search
Data Driven21 Tammi 2025

Arjun Patel on Vector Databases and the Future of Semantic Search

Today, we delve into the intriguing world of vector databases, retrieval augmented generation, and a surprising twist—origami.

Our special guest, Arjun Patel, a developer advocate at Pinecone, will be walking us through his mission to make vector databases and semantic search more accessible. Alongside his impressive technical expertise, Arjun is also a self-taught origami artist with a background in statistics from the University of Chicago. Together with co-host Frank La Vigne, we explore Arjun’s unique journey from making speech coaching accessible with AI at Speeko to detecting AI-generated content at Appen.

In this episode, get ready to unravel the mysteries of natural language processing, understand the impact of the attention mechanism in transformers, and discover how AI can even assist in the art of paper folding. From discussing the nuances of RAG systems to sharing personal insights on learning and technology, we promise a session that’s both enlightening and entertaining. So sit back, relax, and get ready to fold your way into the fascinating layers of AI with Arjun Patel on Data Driven.


Show Notes

00:00 Arjun Patel: Bridging AI & Education

04:39 Traditional NLP and Geometric Models

08:40 Co-occurrence and Meaning in Text

13:14 Masked Language Modeling Success

16:50 Understanding Tokenization in AI Models

18:12 "Understanding Large Language Models"

22:43 Instruction-Following vs Few-Shot Learning

26:43 "Rel AI: Open Source Data Tool"

31:14 "Retrieval-Augmented Generation Explained"

33:58 "Pinecone: Efficient Vector Database"

37:31 "AI Found Me: Intern to Innovator"

41:10 "Impact of Code Generation Models"

45:25 Personalized Learning Path Technology

46:57 Mathematical Complexity in Origami Design

50:32 "Data, AI, and Origami Insights"

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(300)

Token Economy and AI Agents: The Final Hurdles in Enterprise Deployment

Token Economy and AI Agents: The Final Hurdles in Enterprise Deployment

In this episode, Frank La Vigne sits down with his Red Hat colleague Christopher Newland for a deep dive into the evolving challenges and opportunities at the intersection of AI, open source, and ente...

15 Touko 42min

The Neuroresilient Leader: Staying Human in the Age of Artificial Intelligence

The Neuroresilient Leader: Staying Human in the Age of Artificial Intelligence

In this episode, hosts Frank La Vigne and Candace Gillhoolley sit down with Angus Nelson—author, podcaster, and expert on leadership and human potential in the age of AI. Together, they dive deep into...

4 Touko 56min

Governance, Architecture, and the Evolving Role of Data Engineers in the AI Age

Governance, Architecture, and the Evolving Role of Data Engineers in the AI Age

In this week’s show, Frank La Vigne sits down with data and analytics engineer Wasim Rana for a deep dive into the realities of building, managing, and securing data infrastructure in modern businesse...

27 Huhti 49min

Alternative Data and Its Impact on Modern Investing

Alternative Data and Its Impact on Modern Investing

Today, we journey into the fast-evolving world of prediction markets, KPI trading, and the new frontiers of retail finance. Joining us is Candace, alongside our guest from Benzinga, a fintech innovato...

21 Huhti 51min

Andy Boettcher on Why This is the Age of the Chaotic Brain

Andy Boettcher on Why This is the Age of the Chaotic Brain

this episode, Frank sits down with Andy Boettcher, the Chief Innovation Officer at DoubleTrack, for a candid conversation about embracing the power (and chaos!) of modern AI tools.Together, they dive ...

8 Huhti 52min

Synthetic Populations and the Future of Decision Intelligence

Synthetic Populations and the Future of Decision Intelligence

In this episode of Data Driven, Frank and Andy dive into the future of market intelligence with Dr. Jill Axline, co-founder and CEO of Mavera—a company building synthetic populations that simulate rea...

29 Tammi 50min

Microsoft Fabric Unpacked: AI, Data Sovereignty, and a Bit of Clippy Nostalgia

Microsoft Fabric Unpacked: AI, Data Sovereignty, and a Bit of Clippy Nostalgia

In today’s show, BAILeY, your semi-sentient hostess with the mostest metadata, teams up with Frank La Vigne to welcome the ever-insightful Andrew Brust for a deep dive into the evolving Microsoft data...

12 Tammi 54min

Celebrating 400 Episodes – How AI Turbocharges Coding, Podcasting, and Creativity

Celebrating 400 Episodes – How AI Turbocharges Coding, Podcasting, and Creativity

Welcome to a milestone episode of Data Driven! In episode 400, hosts BAILeY, Frank La Vigne, and Andy Leonard gather to reflect on nearly a decade at the forefront of podcasting about data, AI, and th...

8 Tammi 1h

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
rss-poliisin-mieli
tiedekulma-podcast
rss-tiedetta-vai-tarinaa
utelias-mieli
docemilia
sotataidon-ytimessa
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-bios-podcast
rss-ranskaa-raakana
rss-duodecim-lehti
rss-duokkari-ekstra
rss-astetta-parempi-elama-podcast
rss-metsantuntijat-podcast
rss-ilmasto-kriisissa
rss-ylistys-elaimille
rss-sosiopodi