From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Avsnitt(925)

How do you fact-check an AI?

How do you fact-check an AI?

Vectara is a platform-as-a-service that allows users to build AI assistants and agents. Get started with their docs.This interview was recorded at HumanX last month. They are gearing up for next year’...

11 Apr 202526min

“There is a real cost to moving fast”: Using AI to accelerate drug discovery

“There is a real cost to moving fast”: Using AI to accelerate drug discovery

They also: Explore key challenges engineering leaders face, including data capacity, relevance, and throttling issues. Highlight how emerging AI tools and applications are transforming software engine...

10 Apr 202525min

WBIT #6: Be curious, ask questions, and don’t argue with JavaScript

WBIT #6: Be curious, ask questions, and don’t argue with JavaScript

When this episode was recorded, Jesse worked for WaveSeven Consulting, which provides business advisory and project delivery support for media and entertainment companies. He now works for ClickUp, sa...

9 Apr 202544min

Bottom of the first: A veteran VC’s take on the AI landscape

Bottom of the first: A veteran VC’s take on the AI landscape

Tomasz is a general partner at Theory Ventures, a venture capital firm focused on early-stage software companies.He’s a coauthor of Winning with Data, a deep dive into how big data has changed busines...

8 Apr 202528min

Is AI a bubble or a revolution? The answer is yes.

Is AI a bubble or a revolution? The answer is yes.

2024 was a defining year for AI investment. Read the HumanX/Crunchbase report.You can learn more about HumanX or register for next year’s event, April 7-9, 2026 in San Francisco.Follow Stefan on Linke...

4 Apr 202533min

Boots on the ground: Holistic AI and Audioshake at HumanX

Boots on the ground: Holistic AI and Audioshake at HumanX

Holistic AI is an AI governance platform that helps the enterprise adopt and scale AI.Audioshake uses AI to mix, master, and separate music and other audio content.Learn more about HumanX here. Feelin...

1 Apr 202524min

“Are AI agents ready for the enterprise?”

“Are AI agents ready for the enterprise?”

Deepak works on Amazon Q Developer, a GenAI-powered coding assistant that includes autonomous agents.Thinking, Fast and Slow by psychologist Daniel Kahneman is one of those books that’s a classic for ...

28 Mars 202528min

AI is shifting focus from syntax to critical thinking

AI is shifting focus from syntax to critical thinking

They also: Emphasize the critical role of customer feedback in shaping products, highlighting how continuous feedback loops drive innovation and improvement. Explore how AI is empowering non-technical...

27 Mars 202536min

Populärt inom Business & ekonomi

framgangspodden
varvet
badfluence
rss-jossan-nina
svd-tech-brief
rss-borsens-finest
rss-svart-marknad
uppgang-och-fall
rss-dagen-med-di
borsmorgon
avanzapodden
fill-or-kill
lastbilspodden
rss-inga-dumma-fragor-om-pengar
kapitalet-en-podd-om-ekonomi
tabberaset
rss-kort-lang-analyspodden-fran-di
bathina-en-podcast
affarsvarlden
market-makers