From search trees to neural nets, a deep dive into natural language processing

From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR.

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department.

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine.

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show.

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio.

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Jaksot(904)

One of the world’s biggest web scrapers has some thoughts on data ownership

One of the world’s biggest web scrapers has some thoughts on data ownership

Or Lenchner is the CEO of Bright Data, a web data platform that offers ready-made datasets, proxy networks, and AI-powered web scrapers. Developers can get started with their docs here.ICYMI, read our blog post about the knowledge-as-a-service business model and how it will guide the future of our paid platform. AI answers alone aren’t knowledge.Connect with Or on LinkedIn. Stack Overflow user guizo earned a Populist badge by explaining How can I minify JSON in a shell script?.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

8 Marras 202434min

How Google is helping developers get better answers from AI

How Google is helping developers get better answers from AI

Logan previously worked at OpenAI, where he led developer relations. He’s now a senior product manager for Google AI Studio, the fastest way for devs to get started with the Gemini API. Logan’s team just rolled out Grounding with Google Search, a feature built to help developers get fresher, more accurate responses from the Gemini models aided by Google Search. Learn more here.Connect with Logan on LinkedIn. Props to Stack Overflow user Jonik, who earned a Populist badge by explaining How to write an S3 object to a file?.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

5 Marras 202425min

How a creator of React is rethinking IDEs

How a creator of React is rethinking IDEs

Want to learn more about the early days of React? React.js: The Documentary gives you the full story from the perspective of the developers who created it.Vercel is a native Next.js platform.v0 aims to democratize software development for non-technical users. Check it out here.Listen to our recent conversation with Vercel’s VP of AI.Connect with Tom on LinkedIn or X.Kudos to Stack Overflow user Sodruldeen Mustapha, who earned a Lifeboat badge by answering How to remove the environment variables from Laravel Debug?. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

1 Marras 202424min

Life in the Fastlane: SDK tools built with developers in mind

Life in the Fastlane: SDK tools built with developers in mind

Fastlane by PayPal is an accelerated guest checkout experience. Visit theFastlane Resource Center for Developers to get started.You can find Sunny Patel on LinkedIn and on GitHub.Find Kyle Prinsloo on X and on LinkedIn.Congrats to Lifeboat badge winner M.M who provided an answer to What does the "Expected '(' for function-style cast or type construction" error mean?See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

30 Loka 202431min

How can you get your kids into coding? We asked an 8-year-old app builder.

How can you get your kids into coding? We asked an 8-year-old app builder.

Watch Fay build a Harry Potter-themed chatbot with an assist from AI.Cursor is the AI code editor Fay’s using. Get started with their docs.Connect with Ricky on LinkedIn or X. Shoutout to Stack Overflow user Mahendra Kulkarni, who earned a Lifeboat badge by answering How do I get current rowindex of a table using JavaScript?. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

29 Loka 202423min

Tragedy of the (data) commons

Tragedy of the (data) commons

The Data Provenance Initiative is a collective of volunteer AI researchers from around the world. They conduct large-scale audits of the massive datasets that power state-of-the-art AI models with a goal of mapping the landscape of AI training data to improve transparency, documentation, and informed use of data. Their Explorer tool allows users to filter and analyze the training datasets typically used by large language models.Shayne and Robert are the authors of a new study called Consent in Crisis: The Rapid Decline of the AI Data Commons: the first large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training sets.Connect with Shayne via his website.Connect with Robert via his website or on LinkedIn. Stack Overflow user George Hawkins earned a Populist badge by explaining How to get base url in angular 5?.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

25 Loka 202430min

The new pair programming: an AI agent that cleans your code as you write

The new pair programming: an AI agent that cleans your code as you write

Tariq Shaukat, the former president of Google Cloud and Bumble, is the CEO of Sonar. Follow him on LinkedIn.Sonar offers code quality and security solutions that help developers write clean code and remediate existing code organically. Their product SonarQube helps devs ensure the quality and security of AI-generated code.Watch Olivier Gaudin, founder of Sonar, explain why clean code is the foundation for well-functioning dev teams.Stack Overflow user Ogglas earned a Populist badge by explaining How to access the appsettings in Blazor WebAssembly.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

22 Loka 202429min

How API security is evolving for the GenAI era

How API security is evolving for the GenAI era

Solo.io provides API gateway, service mesh, and internal developer portal solutions. Follow Solo.io on X or LinkedIn or dig into the docs.Want to brush up on RAG? Our Guide to AI walks you through the concept and includes a practical example. Or check out one expert’s practical tips for RAG on our blog.Connect with Keith on LinkedIn.Shoutout to Stack Overflow user MrSimpleMind: their helpful answer to the question – How to run jq from gitbash in windows? – has been viewed by more than 213,000 people and won a Populist badge.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

18 Loka 202423min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
ostan-asuntoja-podcast
rss-lahtijat
pomojen-suusta
taloudellinen-mielenrauha
oppimisen-psykologia
rahapuhetta
sijoituspodi
rss-seuraava-potilas
kasvun-kipuja
rss-viisas-raha-podi
rss-neuvottelija-sami-miettinen
io-techin-tekniikkapodcast
sijoitusovi-podcast
rss-uskalla-yrittaa
rss-h-asselmoilanen
rss-merja-mahkan-rahat