From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Avsnitt(911)

Containers are easy—moving your legacy system off your VM is not

Containers are easy—moving your legacy system off your VM is not

Ryan sits down with Dan Ciruli, VP and General Manager of Cloud Native at Nutanix, to talk about getting your virtual machines and Kubernetes to play nice in cloud-native environments, why VMs are sti...

26 Dec 202531min

Settle down, nerds. AI is a normal technology

Settle down, nerds. AI is a normal technology

Ryan welcomes Anil Dash, writer and former Stack Overflow board member, back to the show to discuss how AI is not a magical technology, but rather the normal next step in computing’s evolution. They e...

23 Dec 202537min

Last week in AWS re:Invent with Corey Quinn

Last week in AWS re:Invent with Corey Quinn

Ryan sits down with Corey Quinn, Chief Cloud Economist at Duckbill, at AWS re:Invent to get Corey’s patented snarky take on all the happenings from the conference. They discuss whether the AI agent hy...

19 Dec 202523min

Live from re:Invent…it’s Stack Overflow!

Live from re:Invent…it’s Stack Overflow!

Ryan is joined by Stack Overflow’s CEO Prashanth Chandrasekar and Director of Data Science Michael Foree on the floor at re:Invent to discuss all they’ve seen and heard at the event, from the future o...

16 Dec 202531min

Interface is everything, and everything is an interface

Interface is everything, and everything is an interface

Ryan talks with Wesley Yu, head of engineering at Metalab, about the evolution of interfaces in technology, the pressure that UI generated on the fly would put on your backend systems, and why AI is j...

12 Dec 202524min

AI is a crystal ball into your codebase

AI is a crystal ball into your codebase

Ryan is joined by Kayvon Beykpour, CEO and founder of Microscope, to dive into AI-powered code review’s potential for managing large codebases, the need for humans-in-the-loop for reviewing PRs so AI ...

9 Dec 202534min

Treating your agents like microservices

Treating your agents like microservices

Ryan is joined by Outshift by Cisco’s VP of Engineering Guillaume De Saint Marc to discuss the future of multi-agent architectures as microservices, the challenges and limitations of the infrastructur...

5 Dec 202535min

Abstraction, but for robots

Abstraction, but for robots

Ryan welcomes Simone Kalmakis, VP of Engineering at Viam, to dive into how her team is bridging the gap between software and robotics, the importance of abstraction layers in making robotics more acce...

2 Dec 202524min

Populärt inom Business & ekonomi

badfluence
framgangspodden
rss-jossan-nina
varvet
uppgang-och-fall
rss-borsens-finest
avanzapodden
bathina-en-podcast
svd-tech-brief
fill-or-kill
rss-inga-dumma-fragor-om-pengar
24fragor
rss-kort-lang-analyspodden-fran-di
lastbilspodden
rss-dagen-med-di
rss-den-nya-ekonomin
borsmorgon
dynastin
kapitalet-en-podd-om-ekonomi
market-makers