Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657

Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657

Today we’re joined by Jay Emery, director of technical sales & architecture at Microsoft Azure. In our conversation with Jay, we discuss the challenges faced by organizations when building LLM-based applications, and we explore some of the techniques they are using to overcome them. We dive into the concerns around security, data privacy, cost management, and performance as well as the ability and effectiveness of prompting to achieve the desired results versus fine-tuning, and when each approach should be applied. We cover methods such as prompt tuning and prompt chaining, prompt variance, fine-tuning, and RAG to enhance LLM output along with ways to speed up inference performance such as choosing the right model, parallelization, and provisioned throughput units (PTUs). In addition to that, Jay also shared several intriguing use cases describing how businesses use tools like Azure Machine Learning prompt flow and Azure ML AI Studio to tailor LLMs to their unique needs and processes. The complete show notes for this episode can be found at twimlai.com/go/657.

Episoder(779)

Algorithmic Injustices and Relational Ethics with Abeba Birhane - #348

Algorithmic Injustices and Relational Ethics with Abeba Birhane - #348

Today we’re joined by Abeba Birhane, PhD Student at University College Dublin and author of the recent paper Algorithmic Injustices: Towards a Relational Ethics, which was the recipient of the Best Pa...

13 Feb 202041min

AI for Agriculture and Global Food Security with Nemo Semret - #347

AI for Agriculture and Global Food Security with Nemo Semret - #347

Today we’re excited to kick off our annual Black in AI Series joined by Nemo Semret, CTO of Gro Intelligence. Gro provides an agricultural data platform dedicated to improving global food security, fo...

10 Feb 20201h 4min

Practical Differential Privacy at LinkedIn with Ryan Rogers - #346

Practical Differential Privacy at LinkedIn with Ryan Rogers - #346

Today we’re joined by Ryan Rogers, Senior Software Engineer at LinkedIn, to discuss his paper “Practical Differentially Private Top-k Selection with Pay-what-you-get Composition.” In our conversation,...

7 Feb 202033min

Networking Optimizations for Multi-Node Deep Learning on Kubernetes with Erez Cohen - #345

Networking Optimizations for Multi-Node Deep Learning on Kubernetes with Erez Cohen - #345

Today we conclude the KubeCon ‘19 series joined by Erez Cohen, VP of CloudX & AI at Mellanox, who we caught up with before his talk “Networking Optimizations for Multi-Node Deep Learning on Kubernetes...

5 Feb 202031min

Managing Research Needs at the University of Michigan using Kubernetes w/ Bob Killen - #344

Managing Research Needs at the University of Michigan using Kubernetes w/ Bob Killen - #344

Today we’re joined by Bob Killen, Research Cloud Administrator at the University of Michigan. In our conversation, we explore how Bob and his group at UM are deploying Kubernetes, the user experience,...

3 Feb 202025min

Scalable and Maintainable Workflows at Lyft with Flyte w/ Haytham AbuelFutuh and Ketan Umare - #343

Scalable and Maintainable Workflows at Lyft with Flyte w/ Haytham AbuelFutuh and Ketan Umare - #343

Today we kick off our KubeCon ‘19 series joined by Haytham AbuelFutuh and Ketan Umare, a pair of software engineers at Lyft. We caught up with Haytham and Ketan at KubeCo, where they were presenting t...

30 Jan 202045min

Causality 101 with Robert Osazuwa Ness - #342

Causality 101 with Robert Osazuwa Ness - #342

Today Robert Osazuwa Ness, ML Research Engineer at Gamalon and Instructor at Northeastern University joins us to discuss Causality, what it means, and how that meaning changes across domains and users...

27 Jan 202039min

PaccMann^RL: Designing Anticancer Drugs with Reinforcement Learning w/ Jannis Born - #341

PaccMann^RL: Designing Anticancer Drugs with Reinforcement Learning w/ Jannis Born - #341

Today we’re joined by Jannis Born, Ph.D. student at ETH & IBM Research Zurich, to discuss his “PaccMann^RL” research. Jannis details how his background in computational neuroscience applies to this re...

23 Jan 202042min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
popradet
stopp-verden
det-store-bildet
bt-dokumentar-2
rss-gukild-johaug
dine-penger-pengeradet
nokon-ma-ga
lydartikler-fra-aftenposten
fotballpodden-2
hanna-de-heldige
frokostshowet-pa-p5
rss-penger-polser-og-politikk
aftenbla-bla
e24-podden
rss-dannet-uten-piano
rss-ness