Gradient Dissent: Conversations on AI29 Okt 2020

Ines & Sofie — Building Industrial-Strength NLP Pipelines

Sofie and Ines walk us through how the new spaCy library helps build end to end SOTA natural language processing workflows.

Ines Montani is the co-founder of Explosion AI, a digital studio specializing in tools for AI technology. She's a core developer of spaCy, one of the leading open-source libraries for Natural Language Processing in Python and Prodigy, a new data annotation tool powered by active learning. Before founding Explosion AI, she was a freelance front-end developer and strategist.

https://twitter.com/_inesmontani

Sofie Van Landeghem is a Natural Language Processing and Machine Learning engineer at Explosion.ai. She is a Software Engineer at heart, with an absurd love for quality assurance and testing, introducing proper levels of abstraction, and ensuring code robustness and modularity.

She has more than 12 years of experience in Natural Language Processing and Machine Learning, including in the pharmaceutical industry and the food industry.

https://twitter.com/oxykodit

https://spacy.io/

https://prodi.gy/

https://thinc.ai/

https://explosion.ai/

Topics covered:

0:00 Sneak peek

0:35 intro

2:29 How spaCy was started

6:11 Business model, open source

9:55 What was spaCy designed to solve?

12:23 advances in NLP and modern practices in industry

17:19 what differentiates spaCy from a more research focused NLP library?

19:28 Multi-lingual/domain specific support

23:52 spaCy V3 configuration

28:16 Thoughts on Python, Syphon, other programming languages for ML

33:45 Making things clear and reproducible

37:30 prodigy and getting good training data

44:09 most underrated aspect of ML

51:00 hardest part of putting models into production

Visit our podcasts homepage for transcripts and more episodes!

www.wandb.com/podcast

Get our podcast on Apple, Spotify, and Google!

Apple Podcasts: bit.ly/2WdrUvI

Spotify: bit.ly/2SqtadF

Google:tiny.cc/GD_Google

We started Weights and Biases to build tools for Machine Learning practitioners because we care a lot about the impact that Machine Learning can have in the world and we love working in the trenches with the people building these models. One of the most fun things about these building tools has been the conversations with these ML practitioners and learning about the interesting things they’re working on. This process has been so fun that we wanted to open it up to the world in the form of our new podcast called Gradient Dissent. We hope you have as much fun listening to it as we had making it!

Join our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research:

tiny.cc/wb-salon

Join our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning:

bit.ly/wb-slack

Our gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices.

app.wandb.ai/gallery

Oppdag Premium

Prøv 14 dager gratis

Kjøp Premium

Episoder(136)

Adrien Treuille — Building Blazingly Fast Tools That People Love

Adrien shares his journey from making games that advance science (Eterna, Foldit) to creating a Streamlit, an open-source app framework enabling ML/Data practitioners to easily build powerful and inte...

4 Des 202045min

Peter Norvig – Singularity Is in the Eye of the Beholder

We're thrilled to have Peter Norvig join us to talk about the evolution of deep learning, his industry-defining book, his work at Google, and what he thinks the future holds for machine learning resea...

20 Nov 202047min

Robert Nishihara — The State of Distributed Computing in ML

The story of Ray and what lead Robert to go from reinforcement learning researcher to creating open-source tools for machine learning and beyondRobert is currently working on Ray, a high-performance d...

13 Nov 202035min

Daeil Kim — The Unreasonable Effectiveness of Synthetic Data

Supercharging computer vision model performance by generating years of training data in minutes.Daeil Kim is the co-founder and CEO of AI.Reverie(https://aireverie.com/), a startup that specializes in...

15 Okt 202037min

Joaquin Candela — Definitions of Fairness

Joaquin chats about scaling and democratizing AI at Facebook, while understanding fairness and algorithmic bias.---Joaquin Quiñonero Candela is Distinguished Tech Lead for Responsible AI at Facebook, ...

1 Okt 20201h 19min

Richard Socher — The Challenges of Making ML Work in the Real World

Richard Socher, ex-Chief Scientist at Salesforce, joins us to talk about The AI Economist, NLP protein generation and biggest challenge in making ML work in the real world.Richard Socher was the Chief...

29 Sep 202050min

Zack Chase Lipton — The Medical Machine Learning Landscape

How Zack went from being a musician to professor, how medical applications of Machine Learning are developing, and the challenges of counteracting bias in real world applications.Zachary Chase Lipton ...

17 Sep 202059min

Reklamefrie Premium-podkaster

Hør populære podkaster som Storefri med Mikkel og Herman, Ida med hjertet i hånden, Krimpodden og mye mye mer

Skap din egen podkastboble

I appen skaper du ditt eget bibliotek med favoritter, og vi gir deg også anbefalinger til podkaster du ikke kan gå glipp av.

Prøv 14 dager gratis

Dersom du er ny Podme-bruker får du 14 dager gratis prøveperiode når du oppretter abonnement

Premium

99 kr/ måned

Tilgang til alle våre Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker

Prøv 14 dager gratis

Premium

129 kr/ måned

Tilgang til alle Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker
En Ekstra bruker

Prøv 14 dager gratis

Populært innen Business og økonomi

tid-er-penger-en-podcast-med-peter-warren

Historiene og stemmene du vil høre

Ubegrenset tilgang til alle dine favorittpodkaster og lydbøker

Les mer