SE Radio 661: Sunil Mallya on Small Language Models

SE Radio 661: Sunil Mallya on Small Language Models

Sunil Mallya, co-founder and CTO of Flip AI, discusses small language models with host Brijesh Ammanath. They begin by considering the technical distinctions between SLMs and large language models.

LLMs excel in generating complex outputs across various natural language processing tasks, leveraging extensive training datasets on with massive GPU clusters. However, this capability comes with high computational costs and concerns about efficiency, particularly in applications that are specific to a given enterprise. To address this, many enterprises are turning to SLMs, fine-tuned on domain-specific datasets. The lower computational requirements and memory usage make SLMs suitable for real-time applications. By focusing on specific domains, SLMs can achieve greater accuracy and relevance aligned with specialized terminologies.

The selection of SLMs depends on specific application requirements. Additional influencing factors include the availability of training data, implementation complexity, and adaptability to changing information, allowing organizations to align their choices with operational needs and constraints.

This episode is sponsored by Codegate.

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(726)

SE Radio 705: Murat Erder and Eoin Woods on Continuous Architecture

SE Radio 705: Murat Erder and Eoin Woods on Continuous Architecture

Murat Erder, CTO for Financial Services at Valtech in Europe, and Eoin Woods, independent consultant in the field of software architecture, join host Giovanni Asproni to talk about Continuous Architec...

27 Tammi 57min

SE Radio 704: Sriram Panyam on System Design Interviews

SE Radio 704: Sriram Panyam on System Design Interviews

Sriram Panyam returns to the show to discuss the system design interview (SDI) with host Robert Blumen. This challenging part of the hiring process is included in the interview loop for many jobs acro...

21 Tammi 44min

SE Radio 703: Sahaj Garg on Low Latency AI

SE Radio 703: Sahaj Garg on Low Latency AI

In this episode, Sahaj Garg, CTO of wispr.ai, joins SE Radio host Robert Blumen to talk about the challenges of building low-latency AI applications. They discuss latency's effect on consumer behavior...

14 Tammi 54min

SE Radio 702: Derick Schaefer on Modern CLIs

SE Radio 702: Derick Schaefer on Modern CLIs

Derick Schaefer, author of CLI: A Practical Guide to Creating Modern Command-Line Interfaces, talks with host Robert Blumen about command-line interfaces old and new. Starting with a short review of t...

7 Tammi 48min

SE Radio 701: Max Guernsey, III and Luniel de Beer on Readiness in Software Engineering

SE Radio 701: Max Guernsey, III and Luniel de Beer on Readiness in Software Engineering

Max Geurnsey III and Luniel de Beer, co-authors of the book Ready: Why Most Software Projects Fail and How to Fix It, discuss the concept of readiness in software engineering with host Brijesh Ammanat...

30 Joulu 20251h 1min

SE Radio 700: Mojtaba Sarooghi on Waiting Rooms for High-Traffic Events

SE Radio 700: Mojtaba Sarooghi on Waiting Rooms for High-Traffic Events

Mojtaba Sarooghi, a Distinguished Product Architect at Queue-it, speaks with host Jeremy Jung about virtual waiting rooms for high-traffic events such as concerts and limited-quantity product releases...

23 Joulu 20251h 7min

SE Radio 699: Benjamin Brial on Internal Dev Platforms

SE Radio 699: Benjamin Brial on Internal Dev Platforms

In this episode, Benjamin Brial, CEO and co-founder of Cycloid, speaks with host Sriram Panyam about internal developer platforms (IDPs) and internal developer portals. The conversation explores how t...

17 Joulu 202555min

SE Radio 698: Srujana Merugu on How to build an LLM App

SE Radio 698: Srujana Merugu on How to build an LLM App

In this episode of Software Engineering Radio, Srujana Merugu, an AI researcher with decades of experience, speaks with host Priyanka Raghavan about building LLM-based applications. The discussion beg...

9 Joulu 20251h 18min

Suosittua kategoriassa Koulutus

rss-murhan-anatomia
psykopodiaa-podcast
voi-hyvin-meditaatiot-2
adhd-podi
rss-rahamania
rss-laadukasta-ensihoitoa
rss-vapaudu-voimaasi
kesken
psykologia
rss-liian-kuuma-peruna
rss-narsisti
rss-niinku-asia-on
rss-arkea-ja-aurinkoa-podcast-espanjasta
rss-hereilla
dreamtalk
rss-keskeneraiset-aidit
ihminen-tavattavissa-tommy-hellsten-instituutti
rss-duodecim-lehti
rss-valo-minussa-2
rss-tfa-8020-podcast