Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657

Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657

Today we’re joined by Jay Emery, director of technical sales & architecture at Microsoft Azure. In our conversation with Jay, we discuss the challenges faced by organizations when building LLM-based applications, and we explore some of the techniques they are using to overcome them. We dive into the concerns around security, data privacy, cost management, and performance as well as the ability and effectiveness of prompting to achieve the desired results versus fine-tuning, and when each approach should be applied. We cover methods such as prompt tuning and prompt chaining, prompt variance, fine-tuning, and RAG to enhance LLM output along with ways to speed up inference performance such as choosing the right model, parallelization, and provisioned throughput units (PTUs). In addition to that, Jay also shared several intriguing use cases describing how businesses use tools like Azure Machine Learning prompt flow and Azure ML AI Studio to tailor LLMs to their unique needs and processes. The complete show notes for this episode can be found at twimlai.com/go/657.

Episoder(778)

Innovating Neural Machine Translation with Arul Menezes - #458

Innovating Neural Machine Translation with Arul Menezes - #458

Today we’re joined by Arul Menezes, a Distinguished Engineer at Microsoft.  Arul, a 30 year veteran of Microsoft, manages the machine translation research and products in the Azure Cognitive Services...

22 Feb 202144min

Building the Product Knowledge Graph at Amazon with Luna Dong - #457

Building the Product Knowledge Graph at Amazon with Luna Dong - #457

Today we’re joined by Luna Dong, Sr. Principal Scientist at Amazon. In our conversation with Luna, we explore Amazon’s expansive product knowledge graph, and the various roles that machine learning p...

18 Feb 202143min

Towards a Systems-Level Approach to Fair ML with Sarah M. Brown - #456

Towards a Systems-Level Approach to Fair ML with Sarah M. Brown - #456

Today we’re joined by Sarah Brown, an Assistant Professor of Computer Science at the University of Rhode Island. In our conversation with Sarah, whose research focuses on Fairness in AI, we discuss w...

15 Feb 202137min

AI for Digital Health Innovation with Andrew Trister - #455

AI for Digital Health Innovation with Andrew Trister - #455

Today we’re joined by Andrew Trister, Deputy Director for Digital Health Innovation at the Bill & Melinda Gates Foundation.  In our conversation with Andrew, we explore some of the AI use cases at th...

11 Feb 202141min

System Design for Autonomous Vehicles with Drago Anguelov - #454

System Design for Autonomous Vehicles with Drago Anguelov - #454

Today we’re joined by Drago Anguelov, Distinguished Scientist and Head of Research at Waymo.  In our conversation, we explore the state of the autonomous vehicles space broadly and at Waymo, includin...

8 Feb 202150min

Building, Adopting, and Maturing LinkedIn's Machine Learning Platform with Ya Xu - #453

Building, Adopting, and Maturing LinkedIn's Machine Learning Platform with Ya Xu - #453

Today we’re joined by Ya Xu, head of Data Science at LinkedIn, and TWIMLcon: AI Platforms 2021 Keynote Speaker. We cover a ton of ground with Ya, starting with her experiences prior to becoming Head ...

4 Feb 202149min

Expressive Deep Learning with Magenta DDSP w/ Jesse Engel - #452

Expressive Deep Learning with Magenta DDSP w/ Jesse Engel - #452

Today we’re joined by Jesse Engel, Staff Research Scientist at Google, working on the Magenta Project.  In our conversation with Jesse, we explore the current landscape of creativity AI, and the role...

1 Feb 202139min

Semantic Folding for Natural Language Understanding with Francisco Weber - #451

Semantic Folding for Natural Language Understanding with Francisco Weber - #451

Today we’re joined by return guest Francisco Webber, CEO & Co-founder of Cortical.io. Francisco was originally a guest over 4 years and 400 episodes ago, where we discussed his company Cortical.io, a...

29 Jan 202155min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
stopp-verden
popradet
det-store-bildet
fotballpodden-2
dine-penger-pengeradet
rss-gukild-johaug
bt-dokumentar-2
nokon-ma-ga
lydartikler-fra-aftenposten
aftenbla-bla
hanna-de-heldige
rss-dannet-uten-piano
e24-podden
frokostshowet-pa-p5
rss-ness
rss-penger-polser-og-politikk