Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Avsnitt(515)

Resetting Expectations on Southeast Asia with Arnaud Bonzom

Resetting Expectations on Southeast Asia with Arnaud Bonzom

"So that's why if you have 1 billion to invest, we're not expecting the same return as if you invest 10 million. At that time, when all this money flowed to Southeast Asia, people there thought, "Oh, ...

23 Okt 20241h 4min

Learnovate, AI and EduTech with Joon Nak Choi

Learnovate, AI and EduTech with Joon Nak Choi

"The humans are going to be empowered to become superheroes like Tony Stark, and because you have your loyal A.I. assistant, Jarvis, doing all this stuff in the background, that's the example I always...

7 Okt 202454min

AI, Creativity and the Human Element with Tan Siok Siok

AI, Creativity and the Human Element with Tan Siok Siok

"Well, I think AI makes us, makes me more human in terms of understanding that nirvana or the ultimate achievement is not to be perfect. The ultimate achievement is to be authentic, present, and yours...

23 Sep 202446min

Disrupting E-Commerce: How Shein and Temu are Challenging Amazon's Reign with Jing Yang

Disrupting E-Commerce: How Shein and Temu are Challenging Amazon's Reign with Jing Yang

"Temu was launched in the US, their first market in September 2022. That is when Shein just started to gain a lot of traction. There's a lot of attention being paid to Shein. In the beginning, many pe...

12 Aug 202446min

How Netflix bring Asian Content to the Global Audience with Minyoung Kim

How Netflix bring Asian Content to the Global Audience with Minyoung Kim

“The most important lesson that I have learned is really Know your audience. You need to really understand your audience, and what they want because oftentimes the content executives make the mistake ...

18 Juni 202435min

Google & Sustainability in Europe with Adam Elman

Google & Sustainability in Europe with Adam Elman

"We have these very ambitious goals and we've spoken about a few, but there are many others in Europe. In particular, the E.U. and countries like the U.K. have very ambitious climate goals. For Google...

13 Juni 202430min

Executive Coaching in the Asia Pacific with Parin Mehta

Executive Coaching in the Asia Pacific with Parin Mehta

"I don't like to work with someone unless they're intrinsically motivated to do it. And so what I mean by that is, I don't think this is a service that you should sell to people. They should demand it...

4 Juni 202452min

Product Management in a Scale-Up with Isaac Tay

Product Management in a Scale-Up with Isaac Tay

"If I were to summarize it, your job as a product manager is to deliver the right product to the right users, to solve the right problems at the right time. Now, your role is very contextual because y...

13 Maj 20241h 4min

Populärt inom Business & ekonomi

badfluence
framgangspodden
rss-jossan-nina
varvet
rss-borsens-finest
uppgang-och-fall
avanzapodden
svd-tech-brief
fill-or-kill
bathina-en-podcast
lastbilspodden
borsmorgon
rss-inga-dumma-fragor-om-pengar
rss-kort-lang-analyspodden-fran-di
kapitalet-en-podd-om-ekonomi
rss-dagen-med-di
rss-den-nya-ekonomin
affarsvarlden
rss-borslunch
rikatillsammans-om-privatekonomi-rikedom-i-livet