Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Episoder(515)

Deploy and PropTech in 2024 with Jordan Kostelac

Deploy and PropTech in 2024 with Jordan Kostelac

"I think the biggest learning that I had, and, by extension, JLL [Jones Lang LaSalle] in my last capacity was that it's not sufficient to take these companies and then just let them in the door. You c...

1 Mai 202453min

Akamai Cloud Computing & The Age of Edge AI with Jay Jenkins

Akamai Cloud Computing & The Age of Edge AI with Jay Jenkins

"So already this year we've rolled out 10 of these regions. So two in APJ and Kuala Lumpur and Hong Kong. But 75 of these locations by the end of the year along with our core computing regions will gi...

17 Apr 202445min

Saison Capital and Real World Asset Monetization in Crypto with Qin En Looi

Saison Capital and Real World Asset Monetization in Crypto with Qin En Looi

"The one thing that has become quite clear in Asia, at least it's direct to retail - the government is not ready. The market is not ready. And there still needs to be a very high level of consumer pro...

9 Apr 202445min

Digital Report 2024 and why Generative AI did not show up this time with Simon Kemp

Digital Report 2024 and why Generative AI did not show up this time with Simon Kemp

"The key thing that I realize every time I look at the data is that the media is telling us a lot of nonsense. I think that the one thing I know is that the data tells a very different story to the me...

2 Apr 20241h 6min

Is TikTok going to be banned in the US? An Asian Perspective with Jing Yang

Is TikTok going to be banned in the US? An Asian Perspective with Jing Yang

"If we assume that happens, I'm sure TikTok, let alone ByteDance as a company as a whole, will survive - and maybe even continue to thrive after this. Because, let's put things into perspective, right...

26 Mar 202446min

Don't Ignore Asia Tech with Catherine Shu

Don't Ignore Asia Tech with Catherine Shu

"When I wrote about it, in addition to assuming that all Asia tech companies, particularly in China, were copycats of Western companies, I think there are also a lot of misperceptions about how easy i...

20 Mar 202449min

Will the stocks from China recover & Asian Century Stocks with Michael Fritzell

Will the stocks from China recover & Asian Century Stocks with Michael Fritzell

“Let's remember if the leadership wants to do something, they have all the tools at their disposal and I don't think that they will want to underwrite massive unemployment. I think the Chinese leaders...

27 Feb 202451min

How Arta Finance Democratizes Wealth Management with Caesar Sengupta

How Arta Finance Democratizes Wealth Management with Caesar Sengupta

"So in our view, great would be when everyone can find the best place to put their money to work. To whatever cause they want. It could be investing. It could be protecting their family. It could be l...

14 Feb 202437min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
utbytte
pengesnakk
finansredaksjonen
morgenkaffen-med-finansavisen
tid-er-penger-en-podcast-med-peter-warren
livet-pa-veien-med-jan-erik-larssen
rss-sunn-okonomi
okonomiamatorene
lederpodden
rss-markedspuls-2
rss-fa-makro
boligbobla
rss-andelige-tanker-med-camillo
lederskap-nhhs-podkast-om-ledelse