Ines & Sofie — Building Industrial-Strength NLP Pipelines

Ines & Sofie — Building Industrial-Strength NLP Pipelines

Sofie and Ines walk us through how the new spaCy library helps build end to end SOTA natural language processing workflows.

Ines Montani is the co-founder of Explosion AI, a digital studio specializing in tools for AI technology. She's a core developer of spaCy, one of the leading open-source libraries for Natural Language Processing in Python and Prodigy, a new data annotation tool powered by active learning. Before founding Explosion AI, she was a freelance front-end developer and strategist.

https://twitter.com/_inesmontani


Sofie Van Landeghem is a Natural Language Processing and Machine Learning engineer at Explosion.ai. She is a Software Engineer at heart, with an absurd love for quality assurance and testing, introducing proper levels of abstraction, and ensuring code robustness and modularity.


She has more than 12 years of experience in Natural Language Processing and Machine Learning, including in the pharmaceutical industry and the food industry.

https://twitter.com/oxykodit


https://spacy.io/

https://prodi.gy/

https://thinc.ai/

https://explosion.ai/


Topics covered:

0:00 Sneak peek

0:35 intro

2:29 How spaCy was started

6:11 Business model, open source

9:55 What was spaCy designed to solve?

12:23 advances in NLP and modern practices in industry

17:19 what differentiates spaCy from a more research focused NLP library?

19:28 Multi-lingual/domain specific support

23:52 spaCy V3 configuration

28:16 Thoughts on Python, Syphon, other programming languages for ML

33:45 Making things clear and reproducible

37:30 prodigy and getting good training data

44:09 most underrated aspect of ML

51:00 hardest part of putting models into production


Visit our podcasts homepage for transcripts and more episodes!

www.wandb.com/podcast


Get our podcast on Apple, Spotify, and Google!

Apple Podcasts: bit.ly/2WdrUvI

Spotify: bit.ly/2SqtadF

Google:tiny.cc/GD_Google


We started Weights and Biases to build tools for Machine Learning practitioners because we care a lot about the impact that Machine Learning can have in the world and we love working in the trenches with the people building these models. One of the most fun things about these building tools has been the conversations with these ML practitioners and learning about the interesting things they’re working on. This process has been so fun that we wanted to open it up to the world in the form of our new podcast called Gradient Dissent. We hope you have as much fun listening to it as we had making it!


Join our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research:

tiny.cc/wb-salon


Join our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning:

bit.ly/wb-slack


Our gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices.

app.wandb.ai/gallery

Episoder(136)

Pete Warden — Practical Applications of TinyML

Pete Warden — Practical Applications of TinyML

Pete is the Technical Lead of the TensorFlow Micro team, which works on deep learning for mobile and embedded devices.Lukas and Pete talk about hacking a Raspberry Pi to run AlexNet, the power and siz...

21 Okt 202153min

Pieter Abbeel — Robotics, Startups, and Robotics Startups

Pieter Abbeel — Robotics, Startups, and Robotics Startups

Pieter is the Chief Scientist and Co-founder at Covariant, where his team is building universal AI for robotic manipulation. Pieter also hosts The Robot Brains Podcast, in which he explores how far hu...

7 Okt 202157min

Chris Albon — ML Models and Infrastructure at Wikimedia

Chris Albon — ML Models and Infrastructure at Wikimedia

In this episode we're joined by Chris Albon, Director of Machine Learning at the Wikimedia Foundation.Lukas and Chris talk about Wikimedia's approach to content moderation, what it's like to work in a...

23 Sep 202156min

Emily M. Bender — Language Models and Linguistics

Emily M. Bender — Language Models and Linguistics

In this episode, Emily and Lukas dive into the problems with bigger and bigger language models, the difference between form and meaning, the limits of benchmarks, and why it's important to name the la...

9 Sep 20211h 12min

Jeff Hammerbacher — From data science to biomedicine

Jeff Hammerbacher — From data science to biomedicine

Jeff talks about building Facebook's early data team, founding Cloudera, and transitioning into biomedicine with Hammer Lab and Related Sciences.(Read more: http://wandb.me/gd-jeff-hammerbacher)---Jef...

26 Aug 202156min

Josh Bloom — The Link Between Astronomy and ML

Josh Bloom — The Link Between Astronomy and ML

Josh explains how astronomy and machine learning have informed each other, their current limitations, and where their intersection goes from here. (Read more: http://wandb.me/gd-josh-bloom)---Josh is ...

20 Aug 20211h 8min

Xavier Amatriain — Building AI-powered Primary Care

Xavier Amatriain — Building AI-powered Primary Care

Xavier shares his experience deploying healthcare models, augmenting primary care with AI, the challenges of "ground truth" in medicine, and robustness in ML.---Xavier Amatriain is co-founder and CTO ...

30 Jul 202150min

Spence Green — Enterprise-scale Machine Translation

Spence Green — Enterprise-scale Machine Translation

Spence shares his experience creating a product around human-in-the-loop machine translation, and explains how machine translation has evolved over the years.---Spence Green is co-founder and CEO of L...

16 Jul 202143min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
utbytte
finansredaksjonen
livet-pa-veien-med-jan-erik-larssen
pengesnakk
morgenkaffen-med-finansavisen
rss-politisk-preik
liberal-halvtime
lederpodden
stormkast-med-valebrokk-stordalen
rss-sunn-okonomi
rss-markedspuls-2
tid-er-penger-en-podcast-med-peter-warren
lederskap-nhhs-podkast-om-ledelse