#516: Accelerating Python Data Science at NVIDIA

#516: Accelerating Python Data Science at NVIDIA

Python’s data stack is getting a serious GPU turbo boost. In this episode, Ben Zaitlen from NVIDIA joins us to unpack RAPIDS, the open source toolkit that lets pandas, scikit-learn, Spark, Polars, and even NetworkX execute on GPUs. We trace the project’s origin and why NVIDIA built it in the open, then dig into the pieces that matter in practice: cuDF for DataFrames, cuML for ML, cuGraph for graphs, cuXfilter for dashboards, and friends like cuSpatial and cuSignal. We talk real speedups, how the pandas accelerator works without a rewrite, and what becomes possible when jobs that used to take hours finish in minutes. You’ll hear strategies for datasets bigger than GPU memory, scaling out with Dask or Ray, Spark acceleration, and the growing role of vector search with cuVS for AI workloads. If you know the CPU tools, this is your on- ramp to the same APIs at GPU speed.

Episoder(541)

#494: Update on Flet: Python + Flutter UIs

#494: Update on Flet: Python + Flutter UIs

As Python developers, we're incredibly lucky to have over half a million packages that we can use to build our applications with over at PyPI. However, when it comes to choosing a UI framework, the op...

21 Feb 20251h

#493: Quarto: Open-source technical publishing

#493: Quarto: Open-source technical publishing

In this episode, I'm joined by JJ Allaire, founder and executive chairman at Posit, and Carlos Scheidegger, a software engineer at Posit, to explore Quarto, an open-source tool revolutionizing technic...

9 Feb 20251h 5min

#492: Great Tables

#492: Great Tables

Join me as I chat with Rich Iannone and Michael Chow from Posit where we explore the transformative power of data tables with the Great Tables library. We'll cover practical applications of Great Tabl...

30 Jan 20251h 4min

#491: DuckDB and Python: Ducks and Snakes living together

#491: DuckDB and Python: Ducks and Snakes living together

Join me for an insightful conversation with Alex Monahan, who works on documentation, tutorials, and training at DuckDB Labs. We explore why DuckDB is gaining momentum among Python and data enthusiast...

27 Des 20241h 2min

#490: Django Ninja

#490: Django Ninja

If you're a Django developer, I'm sure you've heard so many people raving about FastAPI and Pydantic. But you really love Django and don't want to switch. Then you might want to give Django Ninja a se...

24 Des 20241h 4min

#489: Anaconda Toolbox for Excel and more with Peter Wang

#489: Anaconda Toolbox for Excel and more with Peter Wang

Peter Wang has been pushing Python forward since the early days of its data science roots. We're lucky to have him back on the show. We're going to talk about the Anaconda Toolbox for Excel as well as...

20 Des 20241h 9min

#488: Multimodal data with LanceDB

#488: Multimodal data with LanceDB

LanceDB is a developer-friendly, open source database for AI. It's used by well-known companies such as Midjourney and Character.ai. We have Chang She, the CEO and cofounder of LanceDB on to give us a...

12 Des 202458min

#487: Building Rust Extensions for Python

#487: Building Rust Extensions for Python

There has been a lot of changes in the low-level Python space these days. The biggest has to be how many projects have rewritten core performance-intensive sections in Rust. Or even the wholesale adop...

1 Des 20241h 2min

Populært innen Teknologi

lydartikler-fra-aftenposten
romkapsel
tomprat-med-gunnar-tjomlid
teknisk-sett
energi-og-klima
shifter
nasjonal-sikkerhetsmyndighet-nsm
smart-forklart
rss-ki-praten
rss-impressions-2
pedagogisk-intelligens
rss-ai-forklart
hans-petter-og-co
elektropodden
rss-polypod
rss-bits-and-bytes-for-advokater
rss-alt-som-gar-pa-strom
rss-for-alarmen-gar
rss-snakk-om-sikkerhet
rss-praktisk-proptech