Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Avsnitt(515)

Why Southeast Asia Matters with Gullnaz Baig

Why Southeast Asia Matters with Gullnaz Baig

"People always ask us this question, what can I learn from that country? People ask us about this from the report especially when we're talking to policymakers, or what should I learn from Malaysia? W...

1 Jan 202550min

The Transition to Electric Buses: The KMB Experience & Philanthrophy with William Louey

The Transition to Electric Buses: The KMB Experience & Philanthrophy with William Louey

“ Yeah. We plan to replace the fleet by 2040. We can't scrap all of them now because each bus lasts 18 years. So we have the depreciation for 18 years. They're all from the U.K. All the buses are from...

17 Dec 202443min

The China Business Conundrum with Ken Wilcox

The China Business Conundrum with Ken Wilcox

"But what I didn't realize is that the main reason they wanted us in China was so that they could study our business model and figure out how to copy it over time. And that was something I wasn't expe...

3 Dec 202457min

ExtraOrdinary: From Stay-at-Home Mom to Global Entrepreneur with Yvon Bock

ExtraOrdinary: From Stay-at-Home Mom to Global Entrepreneur with Yvon Bock

"So, my definition of being fearless is not about no fear, but it's having that courage to face your fear, to conquer whatever adversities that are thrown your way. it is part of the entrepreneur jour...

26 Nov 202441min

Unlocking the Power of Generative AI in Dow Jones with Ingrid Verschuren

Unlocking the Power of Generative AI in Dow Jones with Ingrid Verschuren

"So, we are very conscious of the fact that we license the content from other publications. And as I mentioned previously, we do that through licensing agreements. We are transparent with the publishe...

19 Nov 202431min

From Research to Real-World Impact: AI in Action with Sun Sumei

From Research to Real-World Impact: AI in Action with Sun Sumei

"The efficiency on top of the efficacy because it is critical that we will achieve this balance of cost and performance. And in addition to that, I think one more aspect we are putting emphasis is act...

14 Nov 202425min

The e-Conomy Southeast Asia  2024 Report with Sapna Chadha, Fock Wai Hoong & Florian Hoppe

The e-Conomy Southeast Asia 2024 Report with Sapna Chadha, Fock Wai Hoong & Florian Hoppe

"The key message of the report is that the fundamentals of this region are critical, they’re clear, and businesses are doing exactly, I think, what they need to do for us to move ahead." - Sapna Chadh...

5 Nov 202440min

Data Centres in the Era of AI with Jay Park

Data Centres in the Era of AI with Jay Park

Fresh out of the studio, Jay Park, Chief Development Officer of Digital Edge, explores the rapidly transforming landscape of data centres in the Asia-Pacific region. Kicking off with the story of Jay’...

29 Okt 202447min

Populärt inom Business & ekonomi

badfluence
framgangspodden
rss-jossan-nina
varvet
rss-borsens-finest
uppgang-och-fall
avanzapodden
svd-tech-brief
fill-or-kill
bathina-en-podcast
lastbilspodden
borsmorgon
rss-inga-dumma-fragor-om-pengar
rss-kort-lang-analyspodden-fran-di
kapitalet-en-podd-om-ekonomi
rss-dagen-med-di
rss-den-nya-ekonomin
affarsvarlden
rss-borslunch
rikatillsammans-om-privatekonomi-rikedom-i-livet