From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Jaksot(903)

How the internet changed in 2024

How the internet changed in 2024

Check out Cloudflare’s 2024 Year in Review.Read John’s posts on the Cloudflare blog or connect with him on LinkedIn. Shoutout to user Timo Kähkönen for providing knowledge-seekers with a cheap algorithm to find measure of angle between vectors.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

24 Tammi 202532min

WBIT#3: Can good team dynamics make Agile obsolete?

WBIT#3: Can good team dynamics make Agile obsolete?

ApartmentAdvisor helps renters find apartments and navigate more complicated markets. What were the people who wrote the Agile Manifesto thinking? Listen to our podcast with original signatory Jim Highsmith and find out. You think the tech is impressive? Wes played the first “perfect” game of Donkey Kong. Find Wes on GitHub, LinkedIn, Twitter, or his website. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

22 Tammi 202537min

The developer skill you might be neglecting

The developer skill you might be neglecting

Find Geoffrey (Jef) Huck on LinkedIn or check out his website.Stack Overflow user Matt earned a Lifeboat badge by explaining What is the difference between Tomcat containers and Docker containers?. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

17 Tammi 202527min

Robots building robots in a robotic factory

Robots building robots in a robotic factory

Postman is an API development platform that lets developers prototype, document, test, and demo all their APIs in one place.Postman’s cofounder and CEO recently wrote about the rise of agentic AI.Find Sterling on LinkedIn. Shoutout to Stack Overflow user Knossos, who earned a Lifeboat badge by answering What is the difference between TextView and TextViewCompat. APIs, AI, GraphQL, REST, gRPC, API-first, Sterling Chin, Postman, technology, software developmentSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

14 Tammi 202522min

“Data is the key”: Twilio’s Head of R&D on the need for good data

“Data is the key”: Twilio’s Head of R&D on the need for good data

Twilio, a communication platform as a service (CPaaS), allows developers to build voice, video, and messaging capabilities into their apps. Devs can get started with their docs.Find Inbal on LinkedIn.Kudos to Stack Overflow user Wesos de Queso for explaining how to Prevent a toggle group from not having a toggle selected - Java FX.AI, Twilio, Inbal Shani, machine learning, LLM, developer productivity, responsible AI, tech stack, customer engagement, conversational AISee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

10 Tammi 202528min

Failing fast at scale: Rapid prototyping at Intuit

Failing fast at scale: Rapid prototyping at Intuit

To learn more about technology or careers at Intuit, visit intuit.com/technology. Want to read more about rapid prototyping at Intuit?To connect with Himanshu, find him on LinkedIn. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

8 Tammi 202527min

WBIT #2: Memories of persistence and the state of state

WBIT #2: Memories of persistence and the state of state

Polly is an embedded insurance company, which means you buy the insurance for a car or house at the same time as you buy the car or house itself.  Redux is a state management library for JavaScript. We’ve talked about the productivity drains that meetings can have previously on the blog. Connect with Jon on LinkedIn. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

7 Tammi 202530min

How AI apps are like Google Search

How AI apps are like Google Search

Jetify gives developers a cloud environment for building AI powered applications. Check out their blog or explore Jetify Cloud, a suite of managed services designed to make software development easier for teams.Daniel is on LinkedIn. Stack Overflow user Dhaval Simaria earned a Lifeboat badge by explaining the Difference between pushing a docker image and installing helm image. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

3 Tammi 202523min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
ostan-asuntoja-podcast
rss-lahtijat
pomojen-suusta
taloudellinen-mielenrauha
oppimisen-psykologia
rahapuhetta
sijoituspodi
rss-seuraava-potilas
kasvun-kipuja
rss-viisas-raha-podi
rss-neuvottelija-sami-miettinen
io-techin-tekniikkapodcast
sijoitusovi-podcast
rss-uskalla-yrittaa
rss-h-asselmoilanen
rss-merja-mahkan-rahat