Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

The Evolution of Tech Giants in China with Rui Ma

The Evolution of Tech Giants in China with Rui Ma

"But if you look at Generative AI, that is fundamentally a different way that technology came about and it required a lot of investment without knowing what was going to transpire. So I've talked to a...

30 Apr 202351min

The Web3 Gaming Renaissance in the Asia Pacific with Serkan Toto

The Web3 Gaming Renaissance in the Asia Pacific with Serkan Toto

"Yes a lot of the blockchain companies don't know how to make games, and a lot of the game companies don't know how to incorporate blockchain into their games properly; there's really some truth to th...

13 Apr 202338min

Building a Tech-Savvy Workforce and Amazon Web Services Training & Certification in Asia Pacific with Emmanuel Pillai

Building a Tech-Savvy Workforce and Amazon Web Services Training & Certification in Asia Pacific with Emmanuel Pillai

"So at the Asia Pacific level, this study looked at several countries. But you know, we looked at a country level because that's the information that our customers and partners are interested in - whe...

9 Apr 202337min

TKX Capital and Investing into Crypto in Asia with Chris Lee

TKX Capital and Investing into Crypto in Asia with Chris Lee

"The first thing is that the crypto market is still quite small compared to the equity [market]. [The size of the crypto market to equity market is about] one to 120, less than 0.8%, not even 1%. So, ...

28 Mar 202347min

USDC Depegging and what it means for Web3 and Crypto with Cosmo Jiang

USDC Depegging and what it means for Web3 and Crypto with Cosmo Jiang

Fresh out of the studio and on the 2nd emergency podcast within the week, Cosmo Jiang from Nova River and host of the Global Coin Research Liquid podcast explained the implications of USDC depegging d...

16 Mar 202331min

Silicon Valley Bank's collapse & its impact on the Asia startup ecosystem with Shai Oster

Silicon Valley Bank's collapse & its impact on the Asia startup ecosystem with Shai Oster

Fresh out of the studio and this is an emergency podcast, Shai Oster from The Information discusses the fallout coming from the collapse of Silicon Valley Bank over the past weekend, and what it means...

14 Mar 202327min

Kleros 2.0 & Decentralized Arbitration in Web3 with Federico Ast

Kleros 2.0 & Decentralized Arbitration in Web3 with Federico Ast

"What if you could just outsource your disputes to Kleros? Whenever users have a dispute on your platform, Kleros will select a jury to analyze the dispute. They're going to see the evidence, see the ...

9 Mar 20231h 27min

Digital Report 2023 on Myth of Social Media Dying, Streaming Wars & Generative AI with Simon Kemp

Digital Report 2023 on Myth of Social Media Dying, Streaming Wars & Generative AI with Simon Kemp

"The media have been feeding us this fake news story about the death of social media. There is absolutely nothing in the data - regardless of what data points I look at, there's nothing in the data th...

1 Mar 20231h 12min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
utbytte
finansredaksjonen
morgenkaffen-med-finansavisen
tid-er-penger-en-podcast-med-peter-warren
pengesnakk
livet-pa-veien-med-jan-erik-larssen
rss-sunn-okonomi
okonomiamatorene
lederpodden
rss-markedspuls-2
rss-fa-makro
boligbobla
lederskap-nhhs-podkast-om-ledelse
rss-impressions-2