Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

Uber in Asia Pacific & The Future of Mobility with Pradeep Parameswaran

Uber in Asia Pacific & The Future of Mobility with Pradeep Parameswaran

Fresh out of the studio, Pradeep Parameswaran, President - Mobility from Uber APAC joined us in a conversation on Uber's continued focus on mobility in Asia Pacific and the future of mobility. We begi...

18 Jul 202139min

The Grab SPAC with Jon Russell & Nadine Freischlad

The Grab SPAC with Jon Russell & Nadine Freischlad

Fresh out of the studio, Jon Russell & Nadine Fresischlad from The Ken joined us to discuss the 40 billion Grab SPAC (and largest in the world till date) and its impact to the entire Southeast startup...

16 Mai 202136min

The Two Sessions in China 2021 with Zhou Xin

The Two Sessions in China 2021 with Zhou Xin

Fresh out of the studio, in episode 341, Zhou Xin, political economy editor from South China Morning Post, joined us in a conversation to discuss the Two Sessions in China for the year 2021, and what ...

26 Apr 202138min

The potential Gojek-Tokopedia merger with Rama Mamuaya

The potential Gojek-Tokopedia merger with Rama Mamuaya

Fresh out of the studio, in episode 340, Rama Mamuaya from DailySocial in Indonesia joined us to discuss a potential merger between Gojek, Indonesia's largest ride-hailing app and Tokopedia, the large...

5 Apr 202139min

China AI Deep Dive: Computer Vision Report 2020 with John Artman

China AI Deep Dive: Computer Vision Report 2020 with John Artman

Fresh out of the studio, in episode 339, John Artman, the technology editor of South China Morning Post (SCMP) joins us on a conversation with China AI Deep Dive: Computer Vision Report 2020 published...

31 Jan 202136min

Reflections and Predictions on China and SoftBank in 2020 with Shai Oster

Reflections and Predictions on China and SoftBank in 2020 with Shai Oster

Fresh out of the studio, in episode 338, Shai Oster, the Asia Bureau chief for The Information is back on his annual review with us again to discuss the state of China technology giants and SoftBank i...

1 Jan 20211h 6min

The Ant Group's Botched IPO with Rui Ma

The Ant Group's Botched IPO with Rui Ma

In episode 337, Rui Ma from TechBuzz China podcast joined us in a conversation to break down what just happened to Ant Group's IPO and why it was halted at the eleven hour by the Chinese government. A...

20 Des 202046min

The SCMP China Fintech Report 2020 with Eugene Tang

The SCMP China Fintech Report 2020 with Eugene Tang

In episode 336, Eugene Tang, business editor from the South China Morning Post (SCMP) joined us to discuss the China Fintech Report 2020 where he dissect the latest important trends in the fintech mar...

27 Nov 202027min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
lydartikler-fra-aftenposten
pengepodden-2
utbytte
finansredaksjonen
pengesnakk
livet-pa-veien-med-jan-erik-larssen
morgenkaffen-med-finansavisen
okonomiamatorene
tid-er-penger-en-podcast-med-peter-warren
rss-sunn-okonomi
stormkast-med-valebrokk-stordalen
lederpodden
rss-fa-makro
rss-markedspuls-2
boligbobla