This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Mikita Balesni
Posts
Sorted by New
14
How I select alignment research projects
20d
4
24
A starter guide for evals
4mo
0
30
Understanding strategic deception and deceptive alignment
7mo
0
57
Paper: LLMs trained on “A is B” fail to learn “B is A”
7mo
0
44
Paper: On measuring situational awareness in LLMs
8mo
13
88
Announcing Apollo Research
1y
4
Wiki Contributions
Comments