177: Vector Databases

177: Vector Databases

Intro topic: Buying a Car

News/Links:

Book of the Show


Patreon Plug https://www.patreon.com/programmingthrowdown?ty=h


Tool of the Show

Topic: Vector Databases (~54 min)

  • How computers represent data traditionally
    • ASCII values
    • RGB values
  • How traditional compression works
    • Huffman encoding (tree structure)
    • Lossy example: Fourier Transform & store coefficients
  • How embeddings are computed
    • Pairwise (contrastive) methods
    • Forward models (self-supervised)
  • Similarity metrics
  • Approximate Nearest Neighbors (ANN)
  • Sub-Linear ANN
    • Clustering
    • Space Partitioning (e.g. K-D Trees)
  • What a vector database does
    • Perform nearest-neighbors with many different similarity metrics
    • Store the vectors and the data structures to support sub-linear ANN
    • Handle updates, deletes, rebalancing/reclustering, backups/restores
  • Examples
    • pgvector: a vector-database plugin for postgres
    • Weaviate, Pinecone
    • Milvus

★ Support this podcast on Patreon ★

Jaksot(185)

Unit Testing

Unit Testing

This show covers unit testing, a way to put your code through the ringer before you show it to your users. Tools of the show: JsFiddle and towel.blinkenlights.nl. Books of the show: The Lean Startup (Kindle: http://amzn.to/157xbEl ), (Hardcover: http://amzn.to/12HwaDp) and Ender’s Game (Kindle: http://amzn.to/VcfVtD), (Paperback: http://amzn.to/Wg32hx) ★ Support this podcast on Patreon ★

22 Helmi 20131h 14min

JVM Languages (Jython, Clojure, Scala)

JVM Languages (Jython, Clojure, Scala)

This show covers some JVM languages, languages that are built on top of Java. Tools of the show: Ripple Emulator and Battle of the Bulge. Books of the show: Reinforcement Learning: An Introduction http://amzn.to/X6DpwS and Ready Player One http://amzn.to/Yg3zzP ★ Support this podcast on Patreon ★

29 Tammi 20131h

Hadoop

Hadoop

This show covers Hadoop, a set of several languages and libraries for working with big data. Tools of the show: Emacs and Chrome Browser Sync. Books of the show: Hadoop: The Definitive Guide http://tinyurl.com/cp3mw32 and Anathem http://tinyurl.com/cas8bux ★ Support this podcast on Patreon ★

26 Joulu 20121h 7min

LaTeX

LaTeX

This show covers LaTeX, a digital typesetting language. Tools of the show: Cocos2d-X and Snapseed. Books of the show: Algorithms in C++ http://tinyurl.com/agbc8t7 and Head First Design Patterns http://tinyurl.com/ayxb7q6 ★ Support this podcast on Patreon ★

9 Joulu 20121h 6min

Mailbag

Mailbag

This show covers some of the top questions asked by the fans. ★ Support this podcast on Patreon ★

19 Marras 20121h 20min

Query Languages

Query Languages

This show covers Query Languages, programming languages designed at accessing data from databases. Tools of the biweek: Pixen and Downcast ★ Support this podcast on Patreon ★

28 Loka 20121h 27min

BASIC

BASIC

This show covers Beginner's All-purpose Symbolic Instruction Code (BASIC), a starter language for many programmers. Tools of the biweek: Dungeon Crawl Stone Soup and Wireshark ★ Support this podcast on Patreon ★

22 Syys 201252min

Go

Go

This show covers Go, an interpreted, low-level language that has native coroutine support. News:100000 DPI printer, Joyent Cloud ending lifetime support, Apple buys fingerprint company, Twitter caps 3rd party apps. ★ Support this podcast on Patreon ★

6 Syys 20121h 12min

Suosittua kategoriassa Politiikka ja uutiset

rss-ootsa-kuullut-tasta
aikalisa
tervo-halme
ootsa-kuullut-tasta-2
politiikan-puskaradio
otetaan-yhdet
rss-podme-livebox
rss-kiina-ilmiot
viisupodi
rss-polikulaari-humanisti-vastaa-ja-muut-ts-podcastit
et-sa-noin-voi-sanoo-esittaa
rss-vaalirankkurit-podcast
rss-raha-talous-ja-politiikka
aihe
linda-maria
rss-kovin-paikka
rss-suomen-lehdiston-podcast
rss-kaikki-uusiksi
rss-merja-mahkan-rahat
rss-50100-podcast