Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

ADDX & Private Markets in the Asia Pacific with Choo Oi-Yee

ADDX & Private Markets in the Asia Pacific with Choo Oi-Yee

"And so we see the private market as a very interesting space because as I said, if everybody had 20% of their portfolio in private markets, we're moving from almost zero in a mass affluent hand to 20...

7 Nov 202242min

Chip War with Chris Miller

Chip War with Chris Miller

"It's impossible today for any country to do it all on their own. And even if you looked at the United States, which is still the biggest player in the supply chain by far, it's still the case that th...

3 Nov 202253min

The e-Conomy SEA Report 2022 with Stephanie Davis and Florian Hoppe

The e-Conomy SEA Report 2022 with Stephanie Davis and Florian Hoppe

"First forecasted back in 2016, we had anticipated the 200 billion by 2025. It also stands out to us that the digital economy grew 20% year on year. And the reason that stands out is that we expected ...

31 Okt 202239min

The Future of Work with Charles Anderson

The Future of Work with Charles Anderson

"So people don't mind going back to the office. But in Asia, the people who wanna go to the office the least are the people who are working in Northeast Asia. So in China, Taiwan, Hong Kong, Japan, Ko...

24 Okt 202239min

AppWorks Ventures & Deep Dive into Web3 with Jessica Liu

AppWorks Ventures & Deep Dive into Web3 with Jessica Liu

"I'm actually not a believer of the web 2.0 turning into web 3. So these days a lot of companies are basically turning non-profitable web 2.0 companies and adding token elements into that and turning ...

20 Okt 202245min

TSMC & Global Chip Shortage with Jon Y

TSMC & Global Chip Shortage with Jon Y

"I think one of the things that TSMC does is that they tend to hit their targets. When they say they're gonna build something, they're gonna do something, they do it. I think that when you contrast th...

12 Okt 202246min

Surveillance State with Liza Lin

Surveillance State with Liza Lin

"China's vision is basically to use the big data that it's harvested to enable its government to be just more nimble and more reactive to the demands of its citizens. So China's idea is, you know, if ...

7 Okt 202238min

Flashpoints & Supply Chain Challenges in Asia Pacific with James Crabtree

Flashpoints & Supply Chain Challenges in Asia Pacific with James Crabtree

"More generally, this air of crisis is just going to force companies to look again at the wisdom of supply chains, which crisscross Asia in a way that didn't take account of geopolitical boundaries. Y...

4 Okt 202232min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-borsmorgen-okonominyhetene
rss-penger-polser-og-politikk
pengepodden-2
utbytte
finansredaksjonen
morgenkaffen-med-finansavisen
rss-sunn-okonomi
livet-pa-veien-med-jan-erik-larssen
okonomiamatorene
tid-er-penger-en-podcast-med-peter-warren
pengesnakk
lederpodden
rss-fa-makro
rss-markedspuls-2
lederskap-nhhs-podkast-om-ledelse
boligbobla
rss-andelige-tanker-med-camillo