Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

The State of China in 2022 with Shai Oster

The State of China in 2022 with Shai Oster

"If you look at over 30 years, China equities have returned zero, basically gone up and gone back down to where it was 30 years ago. It was shocking. So the other thing that is hard to believe, I thi...

14 Des 202250min

The 3AC Demise with Kyle Davies

The 3AC Demise with Kyle Davies

"What we did instead is we did not. The market rallied. We were just clouded in our judgment and we were not scaling the firm the way we should have. So that was probably one big one. The other one wa...

8 Des 202251min

Polygon with Sandeep Nailwal

Polygon with Sandeep Nailwal

"For me, it's only about daily active users. I feel in the next five years, especially for the whole ecosystem. I'm not talking about Polygon first - let's say for the whole blockchain ecosystem… In [...

5 Des 202226min

Roche Pharmaceuticals and Transforming Healthcare in Asia Pacific with Ahmed Elhusseiny

Roche Pharmaceuticals and Transforming Healthcare in Asia Pacific with Ahmed Elhusseiny

"Because of the pandemic, we all became patients. We all became aware of diagnostic tools [and] therapeutic interventions. The level of awareness that exists now around this specific topic is very hi...

29 Nov 202227min

ServiceNow & the Rise of Digital-native companies in the Asia Pacific with Mitch Young

ServiceNow & the Rise of Digital-native companies in the Asia Pacific with Mitch Young

"If you think about a modern digital business, what does that mean? Well, it translates to personalized, proactive service and support. At a minimum, it's also the ability to deliver and provide acces...

24 Nov 202233min

Autodesk in Asia Pacific & Growth by Choice with Haresh Khoobchandani

Autodesk in Asia Pacific & Growth by Choice with Haresh Khoobchandani

"Because you can learn the hard skills, but if you don't know what holds you back, you don't know who you are, you don't know what your triggers are, you don't know what motivates you, what upsets you...

21 Nov 202248min

Reality Platforms and Decentralized Mapping in the Metaverse with Alex Chung

Reality Platforms and Decentralized Mapping in the Metaverse with Alex Chung

"My version of success here is to build the most valuable data set in web3. Hopefully, if we do our jobs right, it'll probably be just one of the most valuable data sets in the world, period. When we ...

16 Nov 202255min

Analyse Asia & China with Carol Yin & Bernard Leong

Analyse Asia & China with Carol Yin & Bernard Leong

"So, because I think the way I've always been thinking about this podcast is (as) a brand, I think we talked about it just now, the quality of the guests we curate is now a challenge for everyone who ...

11 Nov 202254min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
utbytte
finansredaksjonen
morgenkaffen-med-finansavisen
tid-er-penger-en-podcast-med-peter-warren
pengesnakk
livet-pa-veien-med-jan-erik-larssen
rss-sunn-okonomi
okonomiamatorene
lederpodden
rss-markedspuls-2
rss-fa-makro
boligbobla
lederskap-nhhs-podkast-om-ledelse
rss-impressions-2