Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

Episode 120: LINE IPO & SoftBank’s divestments with Serkan Toto - Analyse Asia with Bernard Leong

Episode 120: LINE IPO & SoftBank’s divestments with Serkan Toto - Analyse Asia with Bernard Leong

Serkan Toto from Kantan Games joined us for a conversation on the three recent events that impact the mobile gaming market from Japan to the rest of world. We analysed the progress of Nintendo’s new m...

15 Jun 201623min

Episode 119: From Newsroom to Digital Media in Asia with Alan Soon - Analyse Asia with Bernard Leong

Episode 119: From Newsroom to Digital Media in Asia with Alan Soon - Analyse Asia with Bernard Leong

Alan Soon from the Splice Newsroom and Rockstart Accelerator joined us in a conversation about the media business in Asia. Drawing his experience from the media business media from Bloomberg, CNBC to ...

10 Jun 201642min

Episode 118: The State of IoT in Asia Pacific with Charles Reed Anderson - Analyse Asia with Bernard Leong

Episode 118: The State of IoT in Asia Pacific with Charles Reed Anderson - Analyse Asia with Bernard Leong

Charles Reed Anderson from IDC joined us to discuss the state of Internet of Things (IoT) in Asia Pacific. We started with the analysis of the IoT market maturity index across China, India and the res...

6 Jun 201626min

Episode 117: Why Apple Invest in Didi with Josh Horwitz - Analyse Asia with Bernard Leong

Episode 117: Why Apple Invest in Didi with Josh Horwitz - Analyse Asia with Bernard Leong

In this episode, Josh Horwitz from Quartz joined us in a conversation to dissect why Apple has invested in China’s largest ride hailing app, Didi Chuxing and the implications for Uber in their plans t...

2 Jun 201632min

Episode 116: Mashable in Asia with Michael Kriak and Gwendolyn Regina - Analyse Asia with Bernard Leong

Episode 116: Mashable in Asia with Michael Kriak and Gwendolyn Regina - Analyse Asia with Bernard Leong

Michael Kriak & Gwendolyn Regina from Mashable joined us for a conversation on Mashable.com and its recent strategy on video & expansion to Asia. We discussed how Mashable has built out a global media...

27 Mai 201623min

Episode 115: Singapore Startup Ecosystem & Asia Funding Trends with Arnaud Bonzom - Analyse Asia with Bernard Leong

Episode 115: Singapore Startup Ecosystem & Asia Funding Trends with Arnaud Bonzom - Analyse Asia with Bernard Leong

Arnaud Bonzom from 500 Startups continued our conversation on his two interesting reports (prior to the 500 Corporations report which we discussed earlier) that focus on the Singapore startup ecosyste...

22 Mai 201636min

Episode 114: Will Apple’s Asia & Car strategy work? with Sameer Singh - Analyse Asia with Bernard Leong

Episode 114: Will Apple’s Asia & Car strategy work? with Sameer Singh - Analyse Asia with Bernard Leong

Continuing our discussed from the last episode, Sameer Singh from Tech-thoughts.net analysed the recent Apple Q1 2016 earning and challenged the notion whether Apple’s Asia (India and China) and their...

18 Mai 201635min

Episode 113: Facebook vs Asia Messaging Apps with Sameer Singh - Analyse Asia with Bernard Leong

Episode 113: Facebook vs Asia Messaging Apps with Sameer Singh - Analyse Asia with Bernard Leong

Sameer Singh from Tech-thoughts.net joined us in to reflect on the major themes that has been ongoing in the technology space from messaging apps to self driving cars. In the first part, we discussed ...

14 Mai 201627min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
utbytte
finansredaksjonen
morgenkaffen-med-finansavisen
tid-er-penger-en-podcast-med-peter-warren
pengesnakk
livet-pa-veien-med-jan-erik-larssen
rss-sunn-okonomi
okonomiamatorene
lederpodden
rss-markedspuls-2
rss-fa-makro
boligbobla
lederskap-nhhs-podkast-om-ledelse
rss-impressions-2