Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

Episode 27: The Lean Hardware Startup with Cyril Ebersweiler - Analyse Asia with Bernard Leong

Episode 27: The Lean Hardware Startup with Cyril Ebersweiler - Analyse Asia with Bernard Leong

Cyril Ebersweiler, the founder of HAXLR8R & venture partner of SOSventures, joins us to discuss the lean hardware startup in Asia and the metrics that investors look at to evaluate a hardware startup....

2 Mai 201532min

Episode 26: The Nintendo and DeNA Deal with Serkan Toto - Analyse Asia with Bernard Leong

Episode 26: The Nintendo and DeNA Deal with Serkan Toto - Analyse Asia with Bernard Leong

Serkan Toto of Kantan Games is back on the podcast to discuss the mega deal between Nintendo and DeNA which created a dent in the gaming space for 2015. We discussed the pre and post press conference,...

11 Apr 201540min

Episode 25: The Apple Watch Conundrum in Asia with Sameer Singh - Analyse Asia with Bernard Leong

Episode 25: The Apple Watch Conundrum in Asia with Sameer Singh - Analyse Asia with Bernard Leong

Sameer Singh from Tech-Thoughts is back for an interesting discussion with Bernard Leong on the three major topics which dominate the Asian technology and business landscape. Discussing in depth the A...

3 Apr 201548min

Episode 24: The Risk Driven Business Model with Serguei Netessine - Analyse Asia with Bernard Leong

Episode 24: The Risk Driven Business Model with Serguei Netessine - Analyse Asia with Bernard Leong

Professor Serguei Netessine (@snetesin) joined us to discuss his book “The Risk Driven Business Model” (co-written with Karan Girotra). In the discussion, he explained the theme of the book in coming ...

28 Mar 201535min

Episode 23: Hardware 101 with Bunnie Huang - Analyse Asia with Bernard Leong

Episode 23: Hardware 101 with Bunnie Huang - Analyse Asia with Bernard Leong

Bunnie Huang (@bunniestudios), founder of Bunnie Studios and Kosagi, joined us to chat about building hardware companies in Asia. From his early experience in hacking the XBox and Chumby to his recent...

22 Mar 201553min

Episode 22: The Firefox Browser & Mobile OS with Gen Kanai - Analyse Asia with Bernard Leong

Episode 22: The Firefox Browser & Mobile OS with Gen Kanai - Analyse Asia with Bernard Leong

Gen Kanai, @gen joined us here to discuss the Mozilla footprint in the form of Firefox browser and mobile OS in Asia during the FOSS Asia 2015 conference. We discuss the new features of Firefox brows...

18 Mar 201527min

Episode 21: MaGIC & Malaysia with Cheryl Yeoh - Analyse Asia with Bernard Leong

Episode 21: MaGIC & Malaysia with Cheryl Yeoh - Analyse Asia with Bernard Leong

Cheryl Yeoh (@cherylyeoh) the CEO of Malaysian Global Innovation & Creativity Centre (MaGIC) joins us to discuss her current project to support, enhance and accelerate the startups in Malaysia and sca...

14 Mar 201532min

Episode 20: All about LINE with David Corbin - Analyse Asia with Bernard Leong

Episode 20: All about LINE with David Corbin - Analyse Asia with Bernard Leong

David Corbin (@CorbinDB) from Tech in Asia joins us for a deep dive discussion on one of the most exciting messaging apps in Asia: LINE. We discuss the origins of LINE, analyse how LINE has successful...

4 Mar 201537min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
utbytte
pengesnakk
finansredaksjonen
morgenkaffen-med-finansavisen
tid-er-penger-en-podcast-med-peter-warren
livet-pa-veien-med-jan-erik-larssen
rss-sunn-okonomi
okonomiamatorene
lederpodden
rss-markedspuls-2
rss-fa-makro
boligbobla
rss-andelige-tanker-med-camillo
lederskap-nhhs-podkast-om-ledelse