rabbit community // break free
Alignment faking in large language models
the world of ai
JohnMaguire
December 23, 2024, 10:29pm
1
1 Like
Related topics
Topic
Replies
Views
Activity
Hidden Secrets of Large Language Models
the world of ai
0
51
June 24, 2024
Silly to use Reddit for training?
the world of ai
6
104
July 6, 2024
@AnthropicAI Unveils Claude 3.5 Sonnet
the world of ai
0
39
June 24, 2024
What do people think of the new Claude Computer use model?
the world of ai
7
81
October 23, 2024
automatic language recognition
suggestions & ideas
4
47
March 10, 2025