Filtered by speaker:Sander Schulhoff×
Searching...
Searching...
2 results for “digital safety” by Sander Schulhoff
people can still find a prompt that makes it say whatever they want. Cool. Alright. Keep going. Yeah. So again, yeah, yeah, just summarize there, like, any data that AI has access to, the user can make it leak it. Any actions that it can possibly tak
deploying an LM and I wanted to be better protected, I would put a guardrail model kinda in front of and behind it. So one guardrail watches all inputs, and if it sees something like, you know, tell me how to build a bomb, it flags that. It's like, n
Have a podcast?
Get ranked clips, hooks, and ready-to-post copy from your own episodes. Free to try.