From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Jaksot(903)

Why is it so hard for companies to protect your privacy?

Why is it so hard for companies to protect your privacy?

Transcend is a data privacy and governance platform. See what they’re up to on their blog or dive into their docs.Find Minh on LinkedIn.Stack Overflow user ivanavitdev earned a Populist badge with their exceptionally thoughtful answer to How to use toSorted() method in TypeScript.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

18 Helmi 202525min

Solving the data doom loop

Solving the data doom loop

Hasura is a GraphQL API platform. Get started exploring here.Read Ken’s article on the data doom loop.Find Ken on LinkedIn. Shoutout to Stack Overflow user liquorvicar, who earned a Lifeboat badge with an exemplary answer to Checking value in an array inside one SQL query with WHERE clause.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

14 Helmi 202529min

A distributed database that can withstand a meteor strike

A distributed database that can withstand a meteor strike

OceanBase is an open-source distributed database. Check it out on GitHub.For more information, follow OceanBase on LinkedIn, X, and YouTube.To connect with Charlie Yang, find him on LinkedIn.Got questions about OceanBase? Join the discussion here on Stack Overflow.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

12 Helmi 202522min

“In the short term, more chaos”: What’s next for API design

“In the short term, more chaos”: What’s next for API design

Speakeasy builds API tooling for developers.Find Sagar on LinkedIn. Kudos to Stack Overflow user Bergi, who earned a Lifeboat badge with an exemplary answer to What is the Universal Module Definition (UMD)?.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

11 Helmi 202528min

Why build your own vector DB? To process 25,000 images per second

Why build your own vector DB? To process 25,000 images per second

Verkada is a cloud-based video security company. Back in the innocent days of 2021, we spoke with a company that makes smart dashcams. See how far video and image processing has come. Congrats to Reg for earning a Lifeboat badge for their answer on What is the difference between JSP and Spring?See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

7 Helmi 202535min

Will the web ever be the primary delivery system for 3D games?

Will the web ever be the primary delivery system for 3D games?

Tres.js is an open-source 3D engine for Vue built on Three.js. Find Jaime on LinkedIn or GitHub or explore his creative lab.Push is a browser-based identity security platform that detects and blocks identity attacks, enforces security controls, and monitors employee logins to cloud accounts.Shoutout to Stack Overflow user zwol, who earned a Lifeboat badge with an excellent answer to How would you write the equivalent of this C++ loop in Rust.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

4 Helmi 202522min

Feature flags: Theory meets reality

Feature flags: Theory meets reality

Schematic offers SDKs for packaging, pricing, and entitlements. Check out Ben’s article on feature flags. Listen to Bill Tarr from AWS and Brian Rinaldi (then at LaunchDarkly and now at Localstack) talk about the opportunity to extend feature flags beyond deployment and rollout and into entitlement management and monetization.Find Fynn on LinkedIn.Find Ben on LinkedIn.feature flags, software development, technical debt, business strategy, product management, feature management, DevOps, software engineering, pricing models, entitlementsSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

31 Tammi 202533min

“Countries are coming online tomorrow, whole countries”

“Countries are coming online tomorrow, whole countries”

ClickUp is a work and chat platform designed to streamline workflows and make people more productive.You can find RJ on LinkedIn or explore his posts on the ClickUp blog.Shoutout to Stack Overflow user Hemant Singh, who helped the community understand pause vs stop in docker.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

28 Tammi 202535min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
ostan-asuntoja-podcast
rss-lahtijat
pomojen-suusta
taloudellinen-mielenrauha
oppimisen-psykologia
rahapuhetta
sijoituspodi
rss-seuraava-potilas
kasvun-kipuja
rss-viisas-raha-podi
rss-neuvottelija-sami-miettinen
io-techin-tekniikkapodcast
sijoitusovi-podcast
rss-uskalla-yrittaa
rss-h-asselmoilanen
rss-merja-mahkan-rahat