Arjun Patel on Vector Databases and the Future of Semantic Search
Data Driven21 Tammi 2025

Arjun Patel on Vector Databases and the Future of Semantic Search

Today, we delve into the intriguing world of vector databases, retrieval augmented generation, and a surprising twist—origami.

Our special guest, Arjun Patel, a developer advocate at Pinecone, will be walking us through his mission to make vector databases and semantic search more accessible. Alongside his impressive technical expertise, Arjun is also a self-taught origami artist with a background in statistics from the University of Chicago. Together with co-host Frank La Vigne, we explore Arjun’s unique journey from making speech coaching accessible with AI at Speeko to detecting AI-generated content at Appen.

In this episode, get ready to unravel the mysteries of natural language processing, understand the impact of the attention mechanism in transformers, and discover how AI can even assist in the art of paper folding. From discussing the nuances of RAG systems to sharing personal insights on learning and technology, we promise a session that’s both enlightening and entertaining. So sit back, relax, and get ready to fold your way into the fascinating layers of AI with Arjun Patel on Data Driven.


Show Notes

00:00 Arjun Patel: Bridging AI & Education

04:39 Traditional NLP and Geometric Models

08:40 Co-occurrence and Meaning in Text

13:14 Masked Language Modeling Success

16:50 Understanding Tokenization in AI Models

18:12 "Understanding Large Language Models"

22:43 Instruction-Following vs Few-Shot Learning

26:43 "Rel AI: Open Source Data Tool"

31:14 "Retrieval-Augmented Generation Explained"

33:58 "Pinecone: Efficient Vector Database"

37:31 "AI Found Me: Intern to Innovator"

41:10 "Impact of Code Generation Models"

45:25 Personalized Learning Path Technology

46:57 Mathematical Complexity in Origami Design

50:32 "Data, AI, and Origami Insights"

Jaksot(300)

Christopher Nuland on Stacking Servers & Superintelligence: Hype and Reality Behind AI 2027

Christopher Nuland on Stacking Servers & Superintelligence: Hype and Reality Behind AI 2027

Welcome to another episode of Data Driven—the podcast where we explore the future of technology, one neural network at a time. In this episode, your hosts Frank La Vigne and Bailey are joined by Chris...

15 Heinä 202543min

Amir Berman on Making Construction Smarter with AI and Analytics

Amir Berman on Making Construction Smarter with AI and Analytics

In this episode, host Frank La Vigne is joined by Amir Berman, VP of Industry Transformation at Buildots, to explore how AI, computer vision, and cutting-edge analytics are revolutionizing the constru...

13 Kesä 20251h 1min

The AI Driven Leader: Rethinking Strategy, Decision Making, and Personal Growth

The AI Driven Leader: Rethinking Strategy, Decision Making, and Personal Growth

Welcome to the season nine premiere of Data Driven, where we kick things off with a thought-provoking deep dive into the world of AI-powered leadership. In this episode, hosts Frank La Vigne, Andy Leo...

22 Touko 202557min

*Special Announcement* From 386 Computers to AI Leadership: Our Season Nine Kickoff

*Special Announcement* From 386 Computers to AI Leadership: Our Season Nine Kickoff

Andy and I are kicking off Season Nine of Data Driven with a bang: an insightful interview with Jeff Woods, author of “The AI Driven Leader.”Andy, Bailey, and I are thrilled to bring you brand-new con...

21 Touko 202530s

Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM

Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM

This week, Frank sat down with Dr. Jacob Leverich—Stanford PhD, cofounder of Observe, and a veteran of the Google MapReduce team and Splunk. Jacob’s journey, from tinkering with video game code as a k...

22 Huhti 202558min

István Mészáros on going From CERN to Startup & The Cat That Launched a Thousand Queries

István Mészáros on going From CERN to Startup & The Cat That Launched a Thousand Queries

Welcome to another insightful episode of Data Driven! Today, we're diving into the world of warehouse-native analytics with our special guest, István Mészáros, cofounder of Mitsu. Join us as we explor...

14 Huhti 202558min

Barr Moses on How Data Observability Can Save Your Company Millions

Barr Moses on How Data Observability Can Save Your Company Millions

On this episode of Data Driven, we welcome Barr Moses, CEO and co-founder of Monte Carlo, as she delves into the fascinating world of data observability. Join hosts Frank La Vigne and Andy Leonard as ...

1 Huhti 202554min

Sanjay Annadate on Data Driven Digital Transformation

Sanjay Annadate on Data Driven Digital Transformation

In this episode, Sanjay joins Frank for a deep dive into the heart of digital transformation and AI-powered automation. Here are some of the key takeaways:Digital Transformation Evolution: Sanjay refl...

4 Maalis 202545min

Suosittua kategoriassa Tiede

rss-poliisin-mieli
rss-mita-tulisi-tietaa
tiedekulma-podcast
rss-tiedetta-vai-tarinaa
rss-lihavuudesta-podcast
hippokrateen-vastaanotolla
utelias-mieli
koodikahvit
rss-astetta-parempi-elama-podcast
rss-tervetta-skeptisyytta
university-of-eastern-finland
radio-antro
rss-duodecim-lehti
rss-duokkari-ekstra
rss-metsantuntijat-podcast
rss-ylistys-elaimille
rss-sosiopodi
rss-kasvikutsut
rss-miljonaarien-tasavalta