Arjun Patel on Vector Databases and the Future of Semantic Search
Data Driven21 Jan 2025

Arjun Patel on Vector Databases and the Future of Semantic Search

Today, we delve into the intriguing world of vector databases, retrieval augmented generation, and a surprising twist—origami.

Our special guest, Arjun Patel, a developer advocate at Pinecone, will be walking us through his mission to make vector databases and semantic search more accessible. Alongside his impressive technical expertise, Arjun is also a self-taught origami artist with a background in statistics from the University of Chicago. Together with co-host Frank La Vigne, we explore Arjun’s unique journey from making speech coaching accessible with AI at Speeko to detecting AI-generated content at Appen.

In this episode, get ready to unravel the mysteries of natural language processing, understand the impact of the attention mechanism in transformers, and discover how AI can even assist in the art of paper folding. From discussing the nuances of RAG systems to sharing personal insights on learning and technology, we promise a session that’s both enlightening and entertaining. So sit back, relax, and get ready to fold your way into the fascinating layers of AI with Arjun Patel on Data Driven.


Show Notes

00:00 Arjun Patel: Bridging AI & Education

04:39 Traditional NLP and Geometric Models

08:40 Co-occurrence and Meaning in Text

13:14 Masked Language Modeling Success

16:50 Understanding Tokenization in AI Models

18:12 "Understanding Large Language Models"

22:43 Instruction-Following vs Few-Shot Learning

26:43 "Rel AI: Open Source Data Tool"

31:14 "Retrieval-Augmented Generation Explained"

33:58 "Pinecone: Efficient Vector Database"

37:31 "AI Found Me: Intern to Innovator"

41:10 "Impact of Code Generation Models"

45:25 Personalized Learning Path Technology

46:57 Mathematical Complexity in Origami Design

50:32 "Data, AI, and Origami Insights"

Episoder(300)

The Fast-Moving Train of AI - Sovereignty, Acceleration, & Lessons from History

The Fast-Moving Train of AI - Sovereignty, Acceleration, & Lessons from History

On this episode of Data Driven, hosts Frank La Vigne and Leonard celebrate a major milestone: the 30th anniversary of Franksworld.com, one of the OGs of tech blogging that’s survived multiple browser ...

13 Okt 20251h 15min

Compute, Carbon, and Cashflow Silicon Data’s Big Bet on GPU Markets

Compute, Carbon, and Cashflow Silicon Data’s Big Bet on GPU Markets

Welcome to another episode of Data Driven, where we dive deep into how data and AI are shaping—sometimes shaking—the modern world. In this episode, hosts Frank La Vigne, Andy Leonard, and Carmen Li si...

1 Okt 202550min

Why Simulating Reality Is the Key to Advancing Artificial Intelligence

Why Simulating Reality Is the Key to Advancing Artificial Intelligence

In this episode, we're joined once again by Christopher Nuland, technical marketing manager at Red Hat, whose globe-trotting schedule rivals the complexity of a Kubernetes deployment. Christopher sits...

25 Sep 202553min

Dr Ido Zamberg on The Role of AI in Modern Healthcare Delivery From Databases to Defibrillators

Dr Ido Zamberg on The Role of AI in Modern Healthcare Delivery From Databases to Defibrillators

Welcome to another episode of Data Driven! Today, hosts Frank La Vigne and Andy Leonard, are joined by Dr. Ido Zamberg—a rare breed who’s equally comfortable rebooting servers and saving lives. Dr. Za...

25 Aug 202552min

Thanos Diakakis on Surviving the Software Apocalypse – AI, Agile, and Good Engineering

Thanos Diakakis on Surviving the Software Apocalypse – AI, Agile, and Good Engineering

On this episode of Data Driven, we venture into the ever-shifting landscape of software engineering, AI-assisted coding, and the sometimes chaotic future of development teams with special guest Thanos...

20 Aug 202558min

Dr Mike Orkin on Blackjack, Lightning, and Apophenia: The Surprising Psychology of Probability

Dr Mike Orkin on Blackjack, Lightning, and Apophenia: The Surprising Psychology of Probability

On this episode of Data Driven, we’re shuffling up some probability, statistics, and a bit of Las Vegas magic with Dr. Michael Orkin—a renowned statistician, data scientist, and former advisor to casi...

12 Aug 20251h 8min

From Cold War to Code Wars: Unpacking America’s Bold AI Strategy

From Cold War to Code Wars: Unpacking America’s Bold AI Strategy

Welcome to another episode of Data Driven, where we delve deep into the crossroads of data, technology, and the ever-shifting world of geopolitics. In this packed episode, hosts Frank La Vigne and Bai...

30 Jul 20251h 5min

Dr Alan Bekker on Multimodal Avatars, Education, and Authentic Digital Connections

Dr Alan Bekker on Multimodal Avatars, Education, and Authentic Digital Connections

In today’s conversation, hosts BAILeY and Frank La Vigne sit down with Dr. Alan Becker, co-founder and CEO of E Self AI and former co-founder of Voca AI, which was acquired by Snap in 2020. Dr. Becker...

23 Jul 202557min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
jss
rekommandert
forskningno
sinnsyn
tomprat-med-gunnar-tjomlid
villmarksliv
rss-paradigmepodden
rss-nysgjerrige-norge
liberal-halvtime
nevropodden
fjellsportpodden
kvinnehelsepodden
diagnose
tidlose-historier
rss-inn-til-kjernen-med-sunniva-rose
psykopoden
nordnorsk-historie
rss-hoyt-lavt-med-ida-tonseth