From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Jaksot(911)

What an MCP implementation looks like at a CRM company

What an MCP implementation looks like at a CRM company

Ryan chats with  Karen Ng, EVP of Product at HubSpot, to chat about Model Context Protocol (MCP) and how they implemented it for their server for their CRM product. They chat the emergence of this as ...

16 Syys 202524min

Planning to Arm mobile devices with chips that handle AI

Planning to Arm mobile devices with chips that handle AI

Ryan welcomes Geraint North, AI and developer platforms fellow at Arm, to dive into the impact of GenAI on chip design, Arm’s approach to designing flexible CPU architectures, and the challenges of op...

12 Syys 202532min

How AI is reshaping developer teams and the future of software development

How AI is reshaping developer teams and the future of software development

In this two-part episode of Leaders of Code, Peter O’Connor, Director of Platform Engineering, welcomes Ryan J. Salva, Senior Director of Product at Google, Developer Experiences, for a deep dive into...

11 Syys 202531min

We built stackoverflow.ai with the community and for the community

We built stackoverflow.ai with the community and for the community

Ryan is joined by our very own Ash Zade, Product Manager, and Alex Warren, Staff Software Engineer, to discuss our newly released stackoverflow.ai, how it’s enhancing user experience by combining huma...

9 Syys 202533min

Kotlin is more than just the Android house language

Kotlin is more than just the Android house language

Ryan welcomes Jeffrey van Gogh, Director of Engineering, Android Developer Experience, at Google and board member of the Kotlin Foundation. They discuss the evolution of the Kotlin language from JVM t...

5 Syys 202530min

Building AI for consumer applications isn’t all fun and games

Building AI for consumer applications isn’t all fun and games

Kylan Gibbs, CEO of Inworld, joins the show to discuss the technical challenges of creating interactive AI for virtual worlds and games, the significance of user experience, and the importance of acce...

2 Syys 202529min

Open-source is for the people, by the people

Open-source is for the people, by the people

Travis Oliphant, creator of NumPy and SciPy, joins Ryan to explore the development of Python as a data science tool, the evolution of these foundational libraries, and the importance of community and ...

29 Elo 202538min

From punch cards to prompts: a history of how software got better

From punch cards to prompts: a history of how software got better

SPONSORED BY AWSRyan welcomes Darko Mesaroš, Principal Developer Advocate at AWS and all around computer history buff, to chat about history of software development improvements and how they made deve...

27 Elo 202534min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
psykopodiaa-podcast
mimmit-sijoittaa
rss-rahapodi
rss-draivi
rss-lahtijat
oppimisen-psykologia
rss-rahamania
rss-porssipuhetta
taloudellinen-mielenrauha
rss-seuraava-potilas
rahapuhetta
rss-h-asselmoilanen
rss-paatos-podcast-suomen-kovimmat-paatoksentekijat-2
rss-paasipodi
rss-inderes
io-techin-tekniikkapodcast
pomojen-suusta
rss-viisas-raha-podi
rss-40-ajatusta-aanesta