Delivering Neural Speech Services at Scale with Li Jiang - #522

Delivering Neural Speech Services at Scale with Li Jiang - #522

Today we’re joined by Li Jiang, a distinguished engineer at Microsoft working on Azure Speech. In our conversation with Li, we discuss his journey across 27 years at Microsoft, where he’s worked on, among other things, audio and speech recognition technologies. We explore his thoughts on the advancements in speech recognition over the past few years, the challenges, and advantages, of using either end-to-end or hybrid models. We also discuss the trade-offs between delivering accuracy or quality and the kind of runtime characteristics that you require as a service provider, in the context of engineering and delivering a service at the scale of Azure Speech. Finally, we walk through the data collection process for customizing a voice for TTS, what languages are currently supported, managing the responsibilities of threats like deep fakes, the future for services like these, and much more! The complete show notes for this episode can be found at twimlai.com/go/522.

Episoder(779)

Exploring Causality and Community with Suzana Ilić - #419

Exploring Causality and Community with Suzana Ilić - #419

In this special #TWIMLfest episode, we’re joined by Suzana Ilić, a computational linguist at Causaly and founder of Machine Learning Tokyo (MLT). Suzana joined us as a keynote speaker to discuss the ...

16 Okt 202054min

Decolonizing AI with Shakir Mohamed - #418

Decolonizing AI with Shakir Mohamed - #418

In this special #TWIMLfest edition of the podcast, we’re joined by Shakir Mohamed, a Senior Research Scientist at DeepMind. Shakir is also a leader of Deep Learning Indaba, a non-profit organization ...

14 Okt 202054min

Spatial Analysis for Real-Time Video Processing with Adina Trufinescu

Spatial Analysis for Real-Time Video Processing with Adina Trufinescu

Today we’re joined by Adina Trufinescu, Principal Program Manager at Microsoft, to discuss some of the computer vision updates announced at Ignite 2020.  We focus on the technical innovations that we...

8 Okt 202039min

How Deep Learning has Revolutionized OCR with Cha Zhang - #416

How Deep Learning has Revolutionized OCR with Cha Zhang - #416

Today we’re joined by Cha Zhang, a Partner Engineering Manager at Microsoft Cloud & AI.  Cha’s work at MSFT is focused on exploring ways that new technologies can be applied to optical character reco...

5 Okt 202057min

Machine Learning for Food Delivery at Global Scale - #415

Machine Learning for Food Delivery at Global Scale - #415

In this special edition of the show, we discuss the various ways in which machine learning plays a role in helping businesses overcome their challenges in the food delivery space.  A few weeks ago Sam...

2 Okt 202057min

Open Source at Qualcomm AI Research with Jeff Gehlhaar and Zahra Koochak - #414

Open Source at Qualcomm AI Research with Jeff Gehlhaar and Zahra Koochak - #414

Today we're joined by Jeff Gehlhaar, VP of Technology at Qualcomm, and Zahra Koochak, Staff Machine Learning Engineer at Qualcomm AI Research.  If you haven’t had a chance to listen to our first inte...

30 Sep 202042min

Visualizing Climate Impact with GANs w/ Sasha Luccioni - #413

Visualizing Climate Impact with GANs w/ Sasha Luccioni - #413

Today we’re joined by Sasha Luccioni, a Postdoctoral Researcher at the MILA Institute, and moderator of our upcoming TWIMLfest Panel, ‘Machine Learning in the Fight Against Climate Change.’  We were ...

28 Sep 202041min

ML-Powered Language Learning at Duolingo with Burr Settles - #412

ML-Powered Language Learning at Duolingo with Burr Settles - #412

Today we’re joined by Burr Settles, Research Director at Duolingo. Most would acknowledge that one of the most effective ways to learn is one on one with a tutor, and Duolingo’s main goal is to replic...

24 Sep 202055min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
popradet
stopp-verden
det-store-bildet
bt-dokumentar-2
rss-gukild-johaug
dine-penger-pengeradet
nokon-ma-ga
lydartikler-fra-aftenposten
fotballpodden-2
hanna-de-heldige
frokostshowet-pa-p5
rss-penger-polser-og-politikk
aftenbla-bla
e24-podden
rss-dannet-uten-piano
rss-ness