Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Arize AI in Asia Pacific: LLM Evaluation, Observability & Scale with Patrick Kelly

Fresh out of the studio, Patrick Kelly, Vice President for Asia Pacific at Arize AI, joins us to explore the critical world of AI observability, evaluation, and infrastructure and how Arize AI will start their go to market across the region. Beginning with his transition from Databricks to Arize AI, Patrick explained how the company's mission centers on making AI work for people by helping teams observe, evaluate, and continuously improve their AI agents in production. Emphasizing that evaluations are the most important requirement for AI systems in 2025-2026, he revealed a striking insight: approximately 50% of AI agents fail silently in production because organizations don't know what's happening. Through compelling case studies from Booking.com, Flipkart, and AT&T, Patrick explained how Arize AI enables real-time observability and online evaluations, achieving results like 40% accuracy improvements and 84% cost reductions. Patrick concluded by sharing his vision for success across Asia Pacific's diverse markets - from regulatory frameworks in Korea and Singapore to language localization challenges in Vietnam - emphasizing the three pillars that remain constant: helping customers make money, control costs, and manage risk in an era where AI governance has become paramount. Last but not least, he shares what great would look like for Arize AI in the Asia Pacific

"The mission is to make AI work for the people. It’s about getting AI working for everybody—consumers, customers, and businesses at large. Evals are the most important things that we’ve seen through 2025 and will see more of into 2026; they are the most important thing for systems to work. When I'm working with a customer, I ask: How are we going to help them make money? How are we going to help them control costs? And how are we going to help them manage risk? A lot of AI now is about managing risk."

Episode Highlights:
[00:00] Quote of the Day by Patrick Kelly
[01:10] Bernard introduces AI evaluation and infrastructure topic
[02:24] Patrick's journey from Databricks to Arize AI
[03:20] Arize AI's mission: making AI work for people
[04:00] Understanding agentic systems and their complexity
[05:18] Observability, evaluation, and development framework explained
[06:27] Creating continuous feedback loops for AI improvement
[07:00] On-premises and air-gapped deployment capabilities
[08:00] Open Telemetry and Open Inference standards
[09:08] Evaluations are critical for 2025-2026 success
[10:36] Booking.com case: real-time production AB testing
[14:36] Phoenix open source and Open Inference: entry to Arize ecosystem
[16:00] Travel industry use cases: Skyscanner and Flipkart
[17:53] AT&T case: 40% accuracy improvement, 84% cost reduction
[19:36] 50% of production agents fail silently
[20:26] Korea and Singapore MAS launches AI risk management framework
[22:08] Arize AI CEO's 10 predictions for AI 2026
[22:41] Cursor for X: AI engineering everywhere
[24:06] Context and session state matter critically
[26:27] Harness: new buzzword for agent orchestration
[34:13] Three pillars: make money, control costs, manage risk
[36:00] Asia Pacific diversity: India to Japan
[37:12] Language and cultural nuances in evaluations
[38:00] Closing

Profile: Patrick Kelly, Vice President, Asia Pacific, Arize AILinkedIn Profile: https://www.linkedin.com/in/patrick-kelly-aab6168/?ref=analyse.asia

Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format.

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(530)

We Never Left the Industrial Age: AI and the Future of Work with Aneesh Raman

We Never Left the Industrial Age: AI and the Future of Work with Aneesh Raman

Fresh out of the studio, Aneesh Raman, Chief Economic Opportunity Officer at LinkedIn and co-author of Open to Work: How to Get Ahead in the Age of AI, joins us to dismantle the flattened narrative th...

1 Jul 41min

If AI Models Have No Moat, What Are Investors Buying? with Benedict Evans

If AI Models Have No Moat, What Are Investors Buying? with Benedict Evans

Fresh out of the studio, Benedict Evans, independent technology analyst and author of AI Eats the World, returns to explore whether the AI model layer is becoming commodity infrastructure. Benedict ar...

24 Jun 57min

Inside "Defending Taiwan": How to prevent a war between China and the US with Eyck Freymann

Inside "Defending Taiwan": How to prevent a war between China and the US with Eyck Freymann

Fresh out of the studio, Eyck Freymann, Hoover fellow at Stanford and author of Defending Taiwan: A Strategy to Prevent War with China, joins us to explore why the Taiwan question will be decided by e...

16 Jun 1h 2min

Innovationism: A New Philosophy for the Age of AI with James Liang

Innovationism: A New Philosophy for the Age of AI with James Liang

Fresh out of the studio, James Liang — Co-founder and Executive Chairman of Trip.com Group, economist, and author of Innovationism: A New Philosophy for the Age of AI — joins us to explore what become...

10 Jun 1h 1min

Incorruptible: The Chapter The Lean Startup Missed with Eric Ries

Incorruptible: The Chapter The Lean Startup Missed with Eric Ries

Fresh out of the studio, Eric Ries — author of the new book Incorruptible, founder of the Long-Term Stock Exchange, co-founder of Answer.AI, and author of The Lean Startup — joins Bernard Leong to dis...

3 Jun 46min

Steve Jobs in Exile with Geoffrey Cain

Steve Jobs in Exile with Geoffrey Cain

Fresh out of the studio, Geoffrey Cain, author of Steve Jobs in Exile and Samsung Rising, returns to the Analyse Podcast to argue that the twelve years between Jobs's 1985 ouster and his 1997 return t...

27 Mai 1h 2min

Inside Singapore's AI Bet for 2030 with Kiren Kumar

Inside Singapore's AI Bet for 2030 with Kiren Kumar

Fresh out of the studio, Bernard Leong sits down with Kiren Kumar, Deputy Chief Executive of the Infocomm Media Development Authority (IMDA) Singapore, for a conversation on how Singapore is building ...

18 Mai 48min

Inside Pulse ID's Playbook for AI-Driven Banking with Alex Topaloski

Inside Pulse ID's Playbook for AI-Driven Banking with Alex Topaloski

Fresh out of the studio, Alex Topaloski, CEO and Co-founder of Pulse ID joined us in a conversation on his company's customer engagement infrastructure powering Visa's cardholder offers across Asia Pa...

14 Mai 42min

Populært innen Business og økonomi

stopp-verden
dine-penger-pengeradet
lydartikler-fra-aftenposten
rss-penger-polser-og-politikk
e24-podden
rss-borsmorgen-okonominyhetene
rss-skravla-gar
pengepodden-2
rss-pa-konto
finansredaksjonen
livet-pa-veien-med-jan-erik-larssen
aftenbladet-intervjuer
utbytte
tid-er-penger-en-podcast-med-peter-warren
morgenkaffen-med-finansavisen
lederpodden
liberal-halvtime
okonomiamatorene
pengesnakk
rss-politisk-preik