Delivering Neural Speech Services at Scale with Li Jiang - #522

Delivering Neural Speech Services at Scale with Li Jiang - #522

Today we’re joined by Li Jiang, a distinguished engineer at Microsoft working on Azure Speech. In our conversation with Li, we discuss his journey across 27 years at Microsoft, where he’s worked on, among other things, audio and speech recognition technologies. We explore his thoughts on the advancements in speech recognition over the past few years, the challenges, and advantages, of using either end-to-end or hybrid models. We also discuss the trade-offs between delivering accuracy or quality and the kind of runtime characteristics that you require as a service provider, in the context of engineering and delivering a service at the scale of Azure Speech. Finally, we walk through the data collection process for customizing a voice for TTS, what languages are currently supported, managing the responsibilities of threats like deep fakes, the future for services like these, and much more! The complete show notes for this episode can be found at twimlai.com/go/522.

Episoder(779)

Are We Being Honest About How Difficult AI Really Is? w/ David Ferrucci - TWiML Talk #268

Are We Being Honest About How Difficult AI Really Is? w/ David Ferrucci - TWiML Talk #268

Today we’re joined by David Ferrucci, Founder, CEO, and Chief Scientist at Elemental Cognition, a company focused on building natural learning systems that understand the world the way people do, to d...

23 Mai 201950min

Gauge Equivariant CNNs, Generative Models, and the Future of AI with Max Welling - TWiML Talk #267

Gauge Equivariant CNNs, Generative Models, and the Future of AI with Max Welling - TWiML Talk #267

Today we’re joined by Max Welling, research chair in machine learning at the University of Amsterdam, and VP of Technologies at Qualcomm, to discuss:  • Max’s research at Qualcomm AI Research and the...

20 Mai 20191h 3min

Can We Trust Scientific Discoveries Made Using Machine Learning? with Genevera Allen - TWiML Talk #266

Can We Trust Scientific Discoveries Made Using Machine Learning? with Genevera Allen - TWiML Talk #266

Today we’re joined by Genevera Allen, associate professor of statistics in the EECS Department at Rice University. Genevera caused quite the stir at the American Association for the Advancement of S...

16 Mai 201942min

Creative Adversarial Networks for Art Generation with Ahmed Elgammal - TWiML Talk #265

Creative Adversarial Networks for Art Generation with Ahmed Elgammal - TWiML Talk #265

Today we’re joined by Ahmed Elgammal, a professor in the department of computer science at Rutgers, and director of The Art and Artificial Intelligence Lab. We discuss his work on AICAN, a creative ad...

13 Mai 201938min

Diagnostic Visualization for Machine Learning with YellowBrick w/ Rebecca Bilbro - TWiML Talk #264

Diagnostic Visualization for Machine Learning with YellowBrick w/ Rebecca Bilbro - TWiML Talk #264

Today we close out our PyDataSci series joined by Rebecca Bilbro, head of data science at ICX media and co-creator of the popular open-source visualization library YellowBrick. In our conversation, ...

10 Mai 201941min

Librosa: Audio and Music Processing in Python with Brian McFee - TWiML Talk #263

Librosa: Audio and Music Processing in Python with Brian McFee - TWiML Talk #263

Today we continue our PyDataSci series joined by Brian McFee, assistant professor of music technology and data science at NYU, and creator of LibROSA, a python package for music and audio analysis. B...

9 Mai 201938min

Practical Natural Language Processing with spaCy and Prodigy w/ Ines Montani - TWiML Talk #262

Practical Natural Language Processing with spaCy and Prodigy w/ Ines Montani - TWiML Talk #262

In this episode of PyDataSci, we’re joined by Ines Montani, Cofounder of Explosion, Co-developer of SpaCy and lead developer of Prodigy. Ines and I caught up to discuss her various projects, includin...

7 Mai 201948min

Scaling Jupyter Notebooks with Luciano Resende - TWiML Talk #261

Scaling Jupyter Notebooks with Luciano Resende - TWiML Talk #261

Today we're joined by Luciano Resende, an Open Source AI Platform Architect at IBM, to discuss his work on Jupyter Enterprise Gateway. In our conversation, we address challenges that arise while usin...

6 Mai 201933min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
popradet
stopp-verden
det-store-bildet
bt-dokumentar-2
rss-gukild-johaug
dine-penger-pengeradet
nokon-ma-ga
lydartikler-fra-aftenposten
fotballpodden-2
hanna-de-heldige
frokostshowet-pa-p5
rss-penger-polser-og-politikk
aftenbla-bla
e24-podden
rss-dannet-uten-piano
rss-ness