From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Jaksot(896)

"My job is going to change in a dramatic way”: Exploring the future of the internet with Cloudflare

"My job is going to change in a dramatic way”: Exploring the future of the internet with Cloudflare

Dane shares his excitement about the Model Context Protocol (MCP), exploring its potential impact on the future of technology. The discussion turns to the growing need for sustainable content monetization and fair compensation for creators in an AI-driven world, and how this connects to Cloudflare’s mission to build a better internet.The conversation also: Explores how Cloudflare leverages AI internally to enhance developer productivity and improve code quality while keeping developers as owners of their work. Covers Cloudflare’s innovative organizational structure and their journey toward becoming an AI-first company. Episode notes: Connect with Dane on LinkedIn or X, and learn more about Cloudflare. Read more about Knowledge Solutions, a data licensing offering that provides continuous access to Stack Overflow’s public dataset. Learn more about creating a private instance of Stack Overflow for your team or org with Stack Overflow for Teams. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

19 Kesä 23min

After 30 years, Java is still brewing up new features

After 30 years, Java is still brewing up new features

Connect with Georges on LinkedIn and see his work on inside.java.Listen to our previous episode with Georges, a celebration of Java’s 25th anniversary. Today we’re shouting out the age-old question What is a NullPointerException, and how do I fix it?, which was answered 31 times as a community effort.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

17 Kesä 27min

“We’re not worried about compute anymore”: The future of AI models

“We’re not worried about compute anymore”: The future of AI models

Together AI is a platform for building with open-source and specialized multimodal models. Check out their docs.Connect with Jamie on LinkedIn.Shoutout to user aryaxt who earned a Stellar Question badge by wondering about MySQL Data - Best way to implement paging?.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

13 Kesä 25min

Better vibes and vibe coding with Gemini 2.5

Better vibes and vibe coding with Gemini 2.5

Gemini 2.5 is DeepMind’s most advanced model yet, with strong reasoning and coding capabilities. Connect with Tulsee on LinkedIn.Connect with Logan on LinkedIn and Stack Overflow. Check out our previous episode with Logan, we discussed his unique path from coding to AI to product, the challenges of non-determinism in AI models, and surprising lessons from working at the Apple Store.Congrats to Populous badge winner Pascal MARTIN for answering the question PHP echo vs PHP short echo tags.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

10 Kesä 33min

Banking on a serverless world

Banking on a serverless world

Explore how Capital One is using tech to innovate the banking experience here.Connect with Kathleen on LinkedIn and visit her blog. Shoutout to user Theraot for answering the questions How to connect a signal with extra arguments in Godot 4, which won them a Lifeboat badge.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

6 Kesä 23min

If an attacker can edit your mobile code, how do you defend your app?

If an attacker can edit your mobile code, how do you defend your app?

SPONSORED BY GUARDSQUARELearn more about mobile application security and how to protect your app.Congrats to Lifeboat badge winner Chitrakshi for rescuing TypeScript Error: No overload matches this call in Express route handler.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

4 Kesä 28min

In a deterministic simulation, you can debug with time travel

In a deterministic simulation, you can debug with time travel

Antithesis is an autonomous testing platform that finds bugs in your software with perfect reproducibility.Connect with Will Wilson on Linkedin.Congrats to user hannes neukermans whose question How can I do tag wrapping in Visual Studio Code? won them a Stellar Question badge.Our 2025 Developer Survey is live! We want to know what your developer life is like!See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

3 Kesä 28min

Getting rid of the pain for developers on Shopify

Getting rid of the pain for developers on Shopify

Check out Shopify’s newest updates on their editions page, including Horizons and their new AI capabilities with Sidekick.Connect with Glen Coates on LinkedIn and X.Shoutout to Stellar Question badge winner nouptime for asking Converting string to byte array in C#.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

30 Touko 30min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
psykopodiaa-podcast
rss-rahapodi
mimmit-sijoittaa
inderespodi
taloudellinen-mielenrauha
ostan-asuntoja-podcast
rss-bisnesta-bebeja
pomojen-suusta
rss-rahamania
lakicast
rss-sisalto-kuntoon
rss-seuraava-potilas
rss-paasipodi
herrasmieshakkerit
juristipodi
jahtaa-unelmiasi
rss-ammattipodcast
rss-salonkipodi
rss-karon-grilli