Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657

Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657

Today we’re joined by Jay Emery, director of technical sales & architecture at Microsoft Azure. In our conversation with Jay, we discuss the challenges faced by organizations when building LLM-based applications, and we explore some of the techniques they are using to overcome them. We dive into the concerns around security, data privacy, cost management, and performance as well as the ability and effectiveness of prompting to achieve the desired results versus fine-tuning, and when each approach should be applied. We cover methods such as prompt tuning and prompt chaining, prompt variance, fine-tuning, and RAG to enhance LLM output along with ways to speed up inference performance such as choosing the right model, parallelization, and provisioned throughput units (PTUs). In addition to that, Jay also shared several intriguing use cases describing how businesses use tools like Azure Machine Learning prompt flow and Azure ML AI Studio to tailor LLMs to their unique needs and processes. The complete show notes for this episode can be found at twimlai.com/go/657.

Episoder(781)

Exploring Large Language Models with ChatGPT - #603

Exploring Large Language Models with ChatGPT - #603

Today we're joined by ChatGPT, the latest and coolest large language model developed by OpenAl. In our conversation with ChatGPT, we discuss the background and capabilities of large language models, t...

8 Des 202236min

Accelerating Intelligence with AI-Generating Algorithms with Jeff Clune - #602

Accelerating Intelligence with AI-Generating Algorithms with Jeff Clune - #602

Are AI-generating algorithms the path to artificial general intelligence(AGI)?  Today we’re joined by Jeff Clune, an associate professor of computer science at the University of British Columbia, and...

5 Des 202256min

Programmatic Labeling and Data Scaling for Autonomous Commercial Aviation with Cedric Cocaud - #601

Programmatic Labeling and Data Scaling for Autonomous Commercial Aviation with Cedric Cocaud - #601

Today we’re joined by Cedric Cocaud, the chief engineer of the Wayfinder Group at Acubed, the innovation center for aircraft manufacturer Airbus. In our conversation with Cedric, we explore some of th...

28 Nov 202254min

Engineering Production NLP Systems at T-Mobile with Heather Nolis - #600

Engineering Production NLP Systems at T-Mobile with Heather Nolis - #600

Today we’re joined by Heather Nolis, a principal machine learning engineer at T-Mobile. In our conversation with Heather, we explored her machine learning journey at T-Mobile, including their initial ...

21 Nov 202243min

Sim2Real and Optimus, the Humanoid Robot with Ken Goldberg - #599

Sim2Real and Optimus, the Humanoid Robot with Ken Goldberg - #599

Today we’re joined by return guest Ken Goldberg, a professor at UC Berkeley and the chief scientist at Ambi Robotics. It’s been a few years since our initial conversation with Ken, so we spent a bit o...

14 Nov 202247min

The Evolution of the NLP Landscape with Oren Etzioni - #598

The Evolution of the NLP Landscape with Oren Etzioni - #598

Today friend of the show and esteemed guest host John Bohannon is back with another great interview, this time around joined by Oren Etzioni, former CEO of the Allen Institute for AI, where he is curr...

7 Nov 202253min

Live from TWIMLcon! The Great MLOps Debate: End-to-End ML Platforms vs Specialized Tools - #597

Live from TWIMLcon! The Great MLOps Debate: End-to-End ML Platforms vs Specialized Tools - #597

Over the last few years, it’s been established that your ML team needs at least some basic tooling in order to be effective, providing support for various aspects of the machine learning workflow, fro...

31 Okt 202247min

Live from TWIMLcon! You're not Facebook. Architecting MLOps for B2B Use Cases with Jacopo Tagliabue - #596

Live from TWIMLcon! You're not Facebook. Architecting MLOps for B2B Use Cases with Jacopo Tagliabue - #596

Much of the way we talk and think about MLOps comes from the perspective of large consumer internet companies like Facebook or Google. If you work at a FAANG company, these approaches might work well ...

24 Okt 202249min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
forklart
aftenpodden-usa
popradet
stopp-verden
lydartikler-fra-aftenposten
i-retten
rss-gukild-johaug
fotballpodden-2
det-store-bildet
dine-penger-pengeradet
nokon-ma-ga
rss-ness
hanna-de-heldige
aftenbla-bla
bt-dokumentar-2
e24-podden
frokostshowet-pa-p5
rss-dannet-uten-piano