Scaling LLMs and Accelerating Adoption with Aidan Gomez at Cohere

Scaling LLMs and Accelerating Adoption with Aidan Gomez at Cohere

On this episode, we’re joined by Aidan Gomez, Co-Founder and CEO at Cohere. Cohere develops and releases a range of innovative AI-powered tools and solutions for a variety of NLP use cases.

We discuss:

- What “attention” means in the context of ML.

- Aidan’s role in the “Attention Is All You Need” paper.

- What state-space models (SSMs) are, and how they could be an alternative to transformers.

- What it means for an ML architecture to saturate compute.

- Details around data constraints for when LLMs scale.

- Challenges of measuring LLM performance.

- How Cohere is positioned within the LLM development space.

- Insights around scaling down an LLM into a more domain-specific one.

- Concerns around synthetic content and AI changing public discourse.

- The importance of raising money at healthy milestones for AI development.

Aidan Gomez - https://www.linkedin.com/in/aidangomez/

Cohere - https://www.linkedin.com/company/cohere-ai/

Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.

Resources:

- https://cohere.ai/

- “Attention Is All You Need”

#OCR #DeepLearning #AI #Modeling #ML

Avsnitt(136)

Uber, Nissan, and Mercedes Chose This Self-Driving Startup | Alex Kendall, Wayve

Uber, Nissan, and Mercedes Chose This Self-Driving Startup | Alex Kendall, Wayve

"Every vehicle is capable of driverless operation. That's clearly the steady state of where we're going."Wayve started in a rented house in Cambridge with $1.5M, a car in the garage, and an aim to int...

15 Apr 45min

Why Netflix, Uber, and Spotify Never Lag: The Database Nobody Talks About | Aaron Katz

Why Netflix, Uber, and Spotify Never Lag: The Database Nobody Talks About | Aaron Katz

"Companies designing for agents, not humans, are going to get a lot of lift."ClickHouse started as an internal tool at Yandex. Today it's the database Anthropic, OpenAI, Meta and Tesla all run on.In t...

31 Mars 43min

The $64M Bet on an AI That Has to Be Right | Carina Hong, CEO of Axiom

The $64M Bet on an AI That Has to Be Right | Carina Hong, CEO of Axiom

Formal verification already consumes years of human effort.In this episode, Lukas Biewald talks with Carina Hong, Founder & CEO of Axiom, about why verification is becoming the real bottleneck in high...

5 Feb 50min

What a $42B Software Co. Really Spends on AI Tools

What a $42B Software Co. Really Spends on AI Tools

“I don't worry about being replaced by AI. I worry about being replaced by someone who's really good at using AI.”Atlassian has 10,000+ engineers currently split-testing the world’s top AI coding tool...

20 Jan 1h 7min

Inside the $41B AI Cloud Challenging Big Tech | CoreWeave SVP

Inside the $41B AI Cloud Challenging Big Tech | CoreWeave SVP

The future of AI training is shaped by one constraint: keeping GPUs fed.In this episode, Lukas Biewald talks with CoreWeave SVP Corey Sanders about why general-purpose clouds start to break down under...

6 Jan 53min

Why Physical AI Needed a Completely New Data Stack

Why Physical AI Needed a Completely New Data Stack

The future of AI is physical. In this episode, Lukas Biewald talks to Nikolaus West, CEO of Rerun, about why the breakthrough required to get AI out of the lab and into the messy real world is blocked...

16 Dec 20251h

The Engineering Behind the World’s Most Advanced Video AI

The Engineering Behind the World’s Most Advanced Video AI

Is video AI a viable path toward AGI? Runway ML founder Cristóbal Valenzuela joins Lukas Biewald just after Gen 4.5 reached the #1 position on the Video Arena Leaderboard, according to community votin...

1 Dec 202514min

The CEO Behind the Fastest-Growing AI Inference Company | Tuhin Srivastava

The CEO Behind the Fastest-Growing AI Inference Company | Tuhin Srivastava

In this episode of Gradient Dissent, Lukas Biewald talks with Tuhin Srivastava, CEO and founder of Baseten, one of the fastest-growing companies in the AI inference ecosystem. Tuhin shares the real st...

18 Nov 202559min

Populärt inom Business & ekonomi

framgangspodden
varvet
rss-jossan-nina
rss-borsens-finest
rss-svart-marknad
badfluence
avanzapodden
uppgang-och-fall
svd-tech-brief
rss-kort-lang-analyspodden-fran-di
fill-or-kill
rss-dagen-med-di
lastbilspodden
tabberaset
bathina-en-podcast
24fragor
kapitalet-en-podd-om-ekonomi
rss-inga-dumma-fragor-om-pengar
rikatillsammans-om-privatekonomi-rikedom-i-livet
dynastin