Ines & Sofie — Building Industrial-Strength NLP Pipelines

Ines & Sofie — Building Industrial-Strength NLP Pipelines

Sofie and Ines walk us through how the new spaCy library helps build end to end SOTA natural language processing workflows.

Ines Montani is the co-founder of Explosion AI, a digital studio specializing in tools for AI technology. She's a core developer of spaCy, one of the leading open-source libraries for Natural Language Processing in Python and Prodigy, a new data annotation tool powered by active learning. Before founding Explosion AI, she was a freelance front-end developer and strategist.

https://twitter.com/_inesmontani


Sofie Van Landeghem is a Natural Language Processing and Machine Learning engineer at Explosion.ai. She is a Software Engineer at heart, with an absurd love for quality assurance and testing, introducing proper levels of abstraction, and ensuring code robustness and modularity.


She has more than 12 years of experience in Natural Language Processing and Machine Learning, including in the pharmaceutical industry and the food industry.

https://twitter.com/oxykodit


https://spacy.io/

https://prodi.gy/

https://thinc.ai/

https://explosion.ai/


Topics covered:

0:00 Sneak peek

0:35 intro

2:29 How spaCy was started

6:11 Business model, open source

9:55 What was spaCy designed to solve?

12:23 advances in NLP and modern practices in industry

17:19 what differentiates spaCy from a more research focused NLP library?

19:28 Multi-lingual/domain specific support

23:52 spaCy V3 configuration

28:16 Thoughts on Python, Syphon, other programming languages for ML

33:45 Making things clear and reproducible

37:30 prodigy and getting good training data

44:09 most underrated aspect of ML

51:00 hardest part of putting models into production


Visit our podcasts homepage for transcripts and more episodes!

www.wandb.com/podcast


Get our podcast on Apple, Spotify, and Google!

Apple Podcasts: bit.ly/2WdrUvI

Spotify: bit.ly/2SqtadF

Google:tiny.cc/GD_Google


We started Weights and Biases to build tools for Machine Learning practitioners because we care a lot about the impact that Machine Learning can have in the world and we love working in the trenches with the people building these models. One of the most fun things about these building tools has been the conversations with these ML practitioners and learning about the interesting things they’re working on. This process has been so fun that we wanted to open it up to the world in the form of our new podcast called Gradient Dissent. We hope you have as much fun listening to it as we had making it!


Join our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research:

tiny.cc/wb-salon


Join our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning:

bit.ly/wb-slack


Our gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices.

app.wandb.ai/gallery

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(136)

AI in electronics: Quilter’s journey in PCB design

AI in electronics: Quilter’s journey in PCB design

In this episode of Gradient Dissent, Sergiy Nesterenko, CEO of Quilter, joins host Lukas Biewald to discuss the groundbreaking use of reinforcement learning in PCB design. Learn how Quilter automates ...

6 Jun 202443min

The Future of AI in Coding with Codeium CEO Varun Mohan

The Future of AI in Coding with Codeium CEO Varun Mohan

In this episode of Gradient Dissent, Varun Mohan, Co-Founder & CEO of Codeium, joins host Lukas Biewald to discuss the transformative power of AI in coding. They explore how Codeium evolved from GPU v...

23 Mai 202454min

Shaping AI Benchmarks with Together AI Co-Founder Percy Liang

Shaping AI Benchmarks with Together AI Co-Founder Percy Liang

In this episode of Gradient Dissent, Together AI co-founder and Stanford Associate Professor Percy Liang joins host, Lukas Biewald, to discuss advancements in AI benchmarking and the pivotal role that...

9 Mai 202453min

Accelerating drug discovery with AI: Insights from Isomorphic Labs

Accelerating drug discovery with AI: Insights from Isomorphic Labs

In this episode of Gradient Dissent, Isomorphic Labs Chief AI Officer Max Jaderberg, and Chief Technology Officer Sergei Yakneen join our host Lukas Biewald to discuss the advancements in biotech and ...

25 Apr 20241h 10min

Redefining AI Hardware for Enterprise with SambaNova’s Rodrigo Liang

Redefining AI Hardware for Enterprise with SambaNova’s Rodrigo Liang

🚀 Discover the cutting-edge AI hardware development for enterprises in this episode of Gradient Dissent, featuring Rodrigo Liang, CEO of SambaNova Systems. Rodrigo Liang’s journey from Oracle to foun...

11 Apr 202453min

Navigating the Vector Database Landscape with Pinecone's Edo Liberty

Navigating the Vector Database Landscape with Pinecone's Edo Liberty

🚀 This episode of Gradient Dissent welcomes Edo Liberty, the mind behind Pinecone's revolutionary vector database technology.As a former leader at Amazon AI Labs and Yahoo's New York lab, Edo Liberty...

28 Mar 20241h 6min

Transforming Data into Business Solutions with Salesforce AI CEO, Clara Shih

Transforming Data into Business Solutions with Salesforce AI CEO, Clara Shih

🚀 In this episode of Gradient Dissent, we explore the revolutionary impact of AI across industries with Clara Shih, CEO of Salesforce AI and Founder of Hearsay Systems. Dive into Salesforce AI's cutt...

14 Mar 202458min

Upgrading Your Health: Navigating AI's Future In Healthcare with John Halamka of Mayo Clinic Platform

Upgrading Your Health: Navigating AI's Future In Healthcare with John Halamka of Mayo Clinic Platform

In the newest episode of Gradient Dissent, we explore the intersecting worlds of AI and Healthcare with John Halamka, President of the Mayo Clinic Platform.Journey with us down John Halamka's remarkab...

29 Feb 20241h 4min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
rss-penger-polser-og-politikk
e24-podden
rss-borsmorgen-okonominyhetene
rss-skravla-gar
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
rss-pa-konto
pengesnakk
pengepodden-2
tid-er-penger-en-podcast-med-peter-warren
utbytte
morgenkaffen-med-finansavisen
rss-markedspuls-2
liberal-halvtime
lederpodden
rss-sunn-okonomi
okonomiamatorene