Ines & Sofie — Building Industrial-Strength NLP Pipelines

Ines & Sofie — Building Industrial-Strength NLP Pipelines

Sofie and Ines walk us through how the new spaCy library helps build end to end SOTA natural language processing workflows. Ines Montani is the co-founder of Explosion AI, a digital studio specializing in tools for AI technology. She's a core developer of spaCy, one of the leading open-source libraries for Natural Language Processing in Python and Prodigy, a new data annotation tool powered by active learning. Before founding Explosion AI, she was a freelance front-end developer and strategist. https://twitter.com/_inesmontani Sofie Van Landeghem is a Natural Language Processing and Machine Learning engineer at Explosion.ai. She is a Software Engineer at heart, with an absurd love for quality assurance and testing, introducing proper levels of abstraction, and ensuring code robustness and modularity. She has more than 12 years of experience in Natural Language Processing and Machine Learning, including in the pharmaceutical industry and the food industry. https://twitter.com/oxykodit https://spacy.io/ https://prodi.gy/ https://thinc.ai/ https://explosion.ai/ Topics covered: 0:00 Sneak peek 0:35 intro 2:29 How spaCy was started 6:11 Business model, open source 9:55 What was spaCy designed to solve? 12:23 advances in NLP and modern practices in industry 17:19 what differentiates spaCy from a more research focused NLP library? 19:28 Multi-lingual/domain specific support 23:52 spaCy V3 configuration 28:16 Thoughts on Python, Syphon, other programming languages for ML 33:45 Making things clear and reproducible 37:30 prodigy and getting good training data 44:09 most underrated aspect of ML 51:00 hardest part of putting models into production Visit our podcasts homepage for transcripts and more episodes! www.wandb.com/podcast Get our podcast on Apple, Spotify, and Google! Apple Podcasts: bit.ly/2WdrUvI Spotify: bit.ly/2SqtadF Google:tiny.cc/GD_Google We started Weights and Biases to build tools for Machine Learning practitioners because we care a lot about the impact that Machine Learning can have in the world and we love working in the trenches with the people building these models. One of the most fun things about these building tools has been the conversations with these ML practitioners and learning about the interesting things they’re working on. This process has been so fun that we wanted to open it up to the world in the form of our new podcast called Gradient Dissent. We hope you have as much fun listening to it as we had making it! Join our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research: tiny.cc/wb-salon Join our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning: bit.ly/wb-slack Our gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices. app.wandb.ai/gallery

Avsnitt(134)

Anantha Kancherla — Building Level 5 Autonomous Vehicles

Anantha Kancherla — Building Level 5 Autonomous Vehicles

As Lyft’s VP of Engineering, Software at Level 5, Autonomous Vehicle Program, Anantha Kancherla has a birds-eye view on what it takes to make self-driving cars work in the real world. He previously wo...

12 Aug 202044min

Bharath Ramsundar — Deep Learning for Molecules and Medicine Discovery

Bharath Ramsundar — Deep Learning for Molecules and Medicine Discovery

Bharath created the deepchem.io open-source project to grow the deep drug discovery open source community, co-created the moleculenet.ai benchmark suite to facilitate development of molecular algorith...

5 Aug 202055min

Chip Huyen — ML Research and Production Pipelines

Chip Huyen — ML Research and Production Pipelines

Chip Huyen is a writer and computer scientist currently working at a startup that focuses on machine learning production pipelines. Previously, she’s worked at NVIDIA, Netflix, and Primer. She helped ...

29 Juli 202043min

Peter Skomoroch — Product Management for AI

Peter Skomoroch — Product Management for AI

👨🏻‍💻Our guest on this episode of Gradient Dissent is Peter Skomoroch! Peter is the former head of data products at Workday and LinkedIn. Previously, he was the cofounder and CEO of venture-backed d...

22 Juli 20201h 27min

Josh Tobin — Productionizing ML Models

Josh Tobin — Productionizing ML Models

Josh Tobin is a researcher working at the intersection of machine learning and robotics. His research focuses on applying deep reinforcement learning, generative models, and synthetic data to problems...

8 Juli 202048min

Miles Brundage — Societal Impacts of Artificial Intelligence

Miles Brundage — Societal Impacts of Artificial Intelligence

Miles Brundage researches the societal impacts of artificial intelligence and how to make sure they go well. In 2018, he joined OpenAI, as a Research Scientist on the Policy team. Previously, he was a...

1 Juli 20201h 2min

Hamel Husain — Building Machine Learning Tools

Hamel Husain — Building Machine Learning Tools

Hamel Husain is a Staff Machine Learning Engineer at Github. He has extensive experience building data analytics and predictive modeling solutions for a wide range of industries, including: hospitalit...

24 Juni 202036min

Peter Welinder — Deep Reinforcement Learning and Robotics

Peter Welinder — Deep Reinforcement Learning and Robotics

Peter Welinder is a research scientist and roboticist at OpenAI. Before that, he was an engineer at Dropbox and ran the machine learning team, and before that, he co-founded Anchovi Labs a startup usi...

17 Juni 202054min

Populärt inom Business & ekonomi

framgangspodden
varvet
badfluence
rss-jossan-nina
rss-borsens-finest
avanzapodden
svd-tech-brief
rss-svart-marknad
uppgang-och-fall
fill-or-kill
rss-dagen-med-di
borsmorgon
kapitalet-en-podd-om-ekonomi
affarsvarlden
rss-kort-lang-analyspodden-fran-di
tabberaset
lastbilspodden
24fragor
bathina-en-podcast
borslunch-2