Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Avsnitt(515)

Episode 184: Competing Against Luck with Karen Dillon - Analyse Asia with Bernard Leong

Episode 184: Competing Against Luck with Karen Dillon - Analyse Asia with Bernard Leong

Karen Dillon, one of the authors from the book “Competing against Luck” joined us in a conversation to discuss the “jobs to be done” framework and why it is important about customer choice and innovat...

14 Juni 201721min

Episode 183: The Apple Supply Chain in Asia with Tim Culpan - Analyse Asia with Bernard Leong

Episode 183: The Apple Supply Chain in Asia with Tim Culpan - Analyse Asia with Bernard Leong

Tim Culpan from Bloomberg Gadfly joined us in an interesting discussion to shed a light on Apple’s supply chain in Asia. We discussed Apple’s recent Q2 2017 earnings and its impact to Asia, the tight ...

14 Juni 201728min

Episode 182: Salesforce and Innovation in Asia Pacific with Rob Wickham - Analyse Asia with Bernard Leong

Episode 182: Salesforce and Innovation in Asia Pacific with Rob Wickham - Analyse Asia with Bernard Leong

Rob Wickham, Regional Vice President, Innovation and Digital Transformation, from Salesforce joined us to discuss the footprint of the company and their recent published report on innovation in Asia P...

13 Juni 201727min

Episode 181: Ant Financial with Zennon Kapron - Analyse Asia with Bernard Leong

Episode 181: Ant Financial with Zennon Kapron - Analyse Asia with Bernard Leong

Zennon Kapron from China Fintech & Kapron Asia joined us in a conversation to discuss the Ant Financial, one of China’s largest finance companies. We discussed their vision, mission, leadership & fina...

13 Juni 201729min

Episode 180: The Upstarts with Brad Stone - Analyse Asia with Bernard Leong

Episode 180: The Upstarts with Brad Stone - Analyse Asia with Bernard Leong

Brad Stone, senior executive editor of Bloomberg LP and author of his new book, “The Upstarts”, joined us for a discussion on the back story of Uber and Airbnb and their evolution in the sharing econo...

13 Juni 201734min

Episode 179: The Pulse of Livestreaming in China with Rhea Liu - Analyse Asia with Bernard Leong

Episode 179: The Pulse of Livestreaming in China with Rhea Liu - Analyse Asia with Bernard Leong

Rhea Liu from China Tech Insights, Tencent joined us in a conversation on the pulse of live-streaming in China. We discussed the business structure & supply chain of livestreaming companies within Chi...

13 Juni 201729min

Episode 178: The Evolution of Wechat Ecosystem with Matthew Brennan - Analyse Asia with Bernard Leong

Episode 178: The Evolution of Wechat Ecosystem with Matthew Brennan - Analyse Asia with Bernard Leong

Matthew Brennan from China Channel joined us in a conversation on the evolution of the Wechat ecosystem. In the second part of our conversation, we discussed their international strategy, the competit...

13 Juni 201722min

Episode 177: The Wechat Ecosystem – Growth, Ads & Payments with Matthew Brennan - Analyse Asia with Bernard Leong

Episode 177: The Wechat Ecosystem – Growth, Ads & Payments with Matthew Brennan - Analyse Asia with Bernard Leong

Matthew Brennan from China Channel joined us in a two part conversation on the evolution of the Wechat ecosystem. In the first part of our conversation, we discussed the potential slowdown of Wechat u...

13 Juni 201723min

Populärt inom Business & ekonomi

badfluence
framgangspodden
rss-jossan-nina
varvet
rss-borsens-finest
uppgang-och-fall
bathina-en-podcast
svd-tech-brief
lastbilspodden
fill-or-kill
avanzapodden
rss-inga-dumma-fragor-om-pengar
borsmorgon
rss-kort-lang-analyspodden-fran-di
dynastin
market-makers
affarsvarlden
tabberaset
24fragor
rss-dagen-med-di