153. LLM Inference with Bedrock
AWS Bites6 Maalis

153. LLM Inference with Bedrock

If you’re curious about building with LLMs, but you want to skip the hype and learn what it takes to ship something reliable in production, this episode is for you.We share our real-world experience building AI-powered apps and the gotchas you hit after the demo: tokens and cost, quotas and throttling, IAM and access friction, marketplace subscriptions, and structured outputs that do not break your JSON parser.We focus on Amazon Bedrock as AWS’s managed inference layer: how to get started with the current access model, how to choose models, how pricing works, and what to watch for in production.We also go deep on structured outputs: constrained decoding, schema design that improves output quality, and how to avoid “grammar compilation timed out”.


In this episode, we mentioned the following resources:


Do you have any AWS questions you would like us to address?

Leave a comment here or connect with us on X/Twitter, BlueSky or LinkedIn:


- ⁠https://twitter.com/eoins⁠ | ⁠https://bsky.app/profile/eoin.sh⁠ | ⁠https://www.linkedin.com/in/eoins/⁠

- ⁠https://twitter.com/loige⁠ | ⁠https://bsky.app/profile/loige.co⁠ | ⁠https://www.linkedin.com/in/lucianomammino/

Jaksot(157)

152. Exploring Lambda Durable Functions

152. Exploring Lambda Durable Functions

AWS Lambda is fantastic for small, stateless code on demand. But when your “function” starts looking like a workflow (retries, backoff, long waits, human approvals, callbacks), classic Lambda patterns...

6 Helmi 48min

151. EC2 ❤️ Lambda - Lambda Managed Instances

151. EC2 ❤️ Lambda - Lambda Managed Instances

AWS just made Lambda… less serverless. Lambda Managed Instances (Lambda MI) brings managed EC2 capacity into Lambda, and it changes the rules: environments stay warm, a single environment can handle m...

16 Tammi 35min

150. Exploring All-New ECS Managed Instances (MI) Mode

150. Exploring All-New ECS Managed Instances (MI) Mode

Love AWS Fargate, but occasionally hit the “I need more control” wall (GPUs, storage, network bandwidth, instance sizing)? In this episode of AWS Bites, Eoin and Luciano put the brand-new Amazon ECS M...

28 Marras 202526min

149. Headless CMS on AWS

149. Headless CMS on AWS

We dive deep into Strapi, an open-source headless CMS that's changing how we approach content management and CRUD applications. In this episode, we explore how Strapi separates content from presentati...

18 Syys 202533min

148. Lambda and Java with Mark Sailes

148. Lambda and Java with Mark Sailes

What if “Java is too old for serverless” is the myth holding your team back in 2025?We put it to the test with Mark Sailes (ex-AWS, Java-on-Lambda expert). In this AWS Bites episode we get specific: w...

4 Syys 202546min

147. Spec coding with Kiro

147. Spec coding with Kiro

What if AWS built an IDE to rival your favorite editor? Turns out they did!In this episode of AWS Bites, we dive into Kiro, an AI centric fork of VS Code that tries to turn an empty repo and a loose i...

21 Elo 202538min

146. The AWS Product Lifecycle page

146. The AWS Product Lifecycle page

What if AWS had its own version of Killed by Google? Well… turns out it kind of does.In this episode of AWS Bites, we explore the AWS Product Lifecycle page, the official place where AWS quietly lists...

10 Heinä 202515min