Search the Top VC & Business Podcasts

14 results for “cognitive performance”

O3 and the Next Leap in Reasoning with OpenAI’s Eric Mitchell and Brandon McKinzie

So I I don't think it sounds too far fetched to me. Yeah. I mean, I I think the the the thing that came up earlier of the also the, like, intelligence per cost thing, you know, the the real world is, like, an interesting litmus test because at the en

23:20 / 24:23

from transcript23:20 – 24:23

This Week in Startups

TWiST 500 interviews with Cortical Labs, Turing, AND Mercor | E2159

...of domain performance, outside of just pure code generation.

59:57 / 1:01:03

from transcript59:57 – 1:01:03

Latent Space

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

you know, the measurements to really define that clearly. But I think it's pretty clear. You know, people try chain of thought with GPD, like, really small models, and they saw that it just didn't really do anything. Then you go to bigger models, and

9:50 / 10:56

from transcript9:50 – 10:56

Lex Fridman Podcast

#475 – Demis Hassabis: Future of AI, Simulating Reality, Physics and Video Games

...performance in other areas. Right? So that's the hard part because you you can of course, you could put more coding data in or you could put more,

from transcript1:38:58 – 1:40:03

Latent Space

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

that is what we're doing here. Yeah. And you could argue that actually this is not that different from, like, I guess, the, the system one, system two paradigm because, you know, if you ask, like, a pigeon to think really hard about playing chess, yo

10:34 / 12:21

from transcript10:34 – 12:21

Lex Fridman Podcast

#475 – Demis Hassabis: Future of AI, Simulating Reality, Physics and Video Games

...of cognitive tasks that, you know, we know that humans can do, and maybe also make the system available to, a few 100 of the world's top experts, Terrence Tows of each each subject area, and see if they can find you know, give them give them a month

1:00:00 / 1:01:00

from transcript1:00:00 – 1:01:00

This Week in Startups

TWiST 500 interviews with Cortical Labs, Turing, AND Mercor | E2159

...out of domain performance, outside of just pure code generation. And coding and math is also interesting because sometimes when you ask complex questions, Alex, the sub steps involve being able to compute stuff or calculate stuff and pass the results

1:00:41 / 1:01:44

from transcript1:00:41 – 1:01:44

No Priors

The Best of 2025 (So Far) with Sarah Guo and Elad Gil

especially in climate, especially in, kinda like, agriculture, food security, you can't think of this as, you know, like shots on goal and this and that. You've gotta kind of say, hey, we can get better at this. Reasoning is the biggest paradigm shif

10:27 / 11:44

from transcript10:27 – 11:44

No Priors

O3 and the Next Leap in Reasoning with OpenAI’s Eric Mitchell and Brandon McKinzie

And so we're trying to, like, iteratively kind of, you know, deploy these things and, like, try them out and figure out, like, where are they reliable, you know, and where are they not. Because yeah. Like, if you did just let the model control your c

18:00 / 19:09

from transcript18:00 – 19:09

This Week in Startups

Why data is the biggest AI bottleneck (feat. Arthur Mensch of Mistral AI) | E2212

...performance is usually how enterprises think about it when they run their evaluations themselves. So that's, that's why I wouldn't put too much money on the benchmarks. It's still useful. Certain of them are. The the the lower you are from the the fa

35:13 / 36:18

from transcript35:13 – 36:18

Latent Space

Why Anthropic Thinks AI Should Have Its Own Computer — Felix Rieseberg of Claude Cowork & Claude Code Desktop

on the, that's only exclusive to CloudCoWork. We have some tricks for this sort of like change week over week, we eval cowork maybe against different use cases than we would evil a clock code, right? If you think about it this way. Okay. So like cloc

19:35 / 20:38

from transcript19:35 – 20:38

Building Great Tech

Is AI making us Stupid?

“TikTok scrolling destroys your memory worse than random guessing”

...performance is barely better than random guessing. So, apparently, if you What does that mean? So, basically, if you opened TikTok and scroll through a lot of lot of, sort of clips Uh-huh. After that,

9:31 / 10:00

market insight9:31 – 10:00

The Twenty Minute VC (20VC)

20VC: Cognition CEO Scott Wu on Acquiring Windsurf: The Process, The Deal, The Rationale | Did Google Overlook a Goldmine in the Core Asset and Did Founders Leave a Sinking Ship | How Cursor and Cognition Deal with Ever Increasing Reliance on Anthropic with Scott Wu

The thing dude, we were in between. Like, the lawyers all pulled an all nighter as well going and getting this because it was, like, yeah. I mean, we we need to get this ready to go, but, you know, and there's just all the various little things of th

40:06 / 41:36

from transcript40:06 – 41:36

Latent Space

[AIEWF Preview] Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

So I think they're also, yeah, downplaying whether or not it's reasoning or not. I think they're trying to merge everything together. And it's not I I mean, I didn't realize that, but extended thinking could not use tools before the way they worded i

3:44 / 4:48

from transcript3:44 – 4:48

Have a podcast?

Run your podcast through Clypt

Get ranked clips, hooks, and ready-to-post copy from your own episodes. Free to try.

Try Clypt free

Searching...