#417: Test-Driven Prompt Engineering for LLMs with Promptimize

#417: Test-Driven Prompt Engineering for LLMs with Promptimize

Large language models and chat-based AIs are kind of mind blowing at the moment. Many of us are playing with them for working on code or just as a fun alternative to search. But others of us are building applications with AI at the core. And when doing that, the slightly unpredictable nature and probabilistic nature of LLMs make writing and testing Python code very tricky. Enter promptimize from Maxime Beauchemin and Preset. It's a framework for non- deterministic testing of LLMs inside our applications. Let's dive inside the AIs with Max.

Avsnitt(542)

#543: Deep Agents: LangChain's SDK for Agents That Plan and Delegate

#543: Deep Agents: LangChain's SDK for Agents That Plan and Delegate

When you type a question into ChatGPT, the model only has what you typed to work with. But tools like Claude Code can plan, iterate, test, and recover from mistakes. They work more like we do. The dif...

1 Apr 1h 3min

#542: Zensical - a modern static site generator

#542: Zensical - a modern static site generator

If you've built documentation in the Python ecosystem, chances are you've used Martin Donath's work. His Material for MKDocs powers docs for FastAPI, uv, AWS, OpenAI, and tens of thousands of other pr...

25 Mars 1h 4min

#541: Monty - Python in Rust for AI

#541: Monty - Python in Rust for AI

When LLMs write code to accomplish a task, that code has to actually run somewhere. And right now, the options aren't great. Spin up a sandboxed container and you're paying a full second of cold start...

19 Mars 1h 5min

#540: Modern Python monorepo with uv and prek

#540: Modern Python monorepo with uv and prek

Monorepos -- you've heard the talks, you've read the blog posts, maybe you've seen a few tantalizing glimpses into how Google or Meta organize their massive codebases. But it's often in the abstract a...

13 Mars 1h 2min

#539: Catching up with the Python Typing Council

#539: Catching up with the Python Typing Council

You're adding type hints to your Python code, your editor is happy, autocomplete is working great. But then you switch tools and suddenly there are red squiggles everywhere. Who decides what a float a...

6 Mars 1h 1min

#538: Python in Digital Humanities

#538: Python in Digital Humanities

Digital humanities sounds niche, until you realize it can mean a searchable archive of U.S. amendment proposals, Irish folklore, or pigment science in ancient art. Today I’m talking with David Flood f...

28 Feb 1h 12min

#537: Datastar: Modern web dev, simplified

#537: Datastar: Modern web dev, simplified

You love building web apps with Python, and HTMX got you excited about the hypermedia approach -- let the server drive the HTML, skip the JavaScript build step, keep things simple. But then you hit th...

21 Feb 1h 16min

#536: Fly inside FastAPI Cloud

#536: Fly inside FastAPI Cloud

You've built your FastAPI app, it's running great locally, and now you want to share it with the world. But then reality hits -- containers, load balancers, HTTPS certificates, cloud consoles with 200...

10 Feb 1h 7min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
bilar-med-sladd
market-makers
rss-elektrikerpodden
rss-technokratin
skogsforum-podcast
har-vi-akt-till-mars-an
rss-veckans-ai
rss-laddstationen-med-elbilen-i-sverige
developers-mer-an-bara-kod
gubbar-som-tjotar-om-bilar
bli-saker-podden
rss-powerboat-sverige-podcast
hej-bruksbil
rss-milpodden
natets-morka-sida
rss-en-ai-till-kaffet
rss-snacka-om-ai
rss-rapporterat