Hi, I'm Jack! I lead the "model psychiatry" team at Anthropic. I'm interested in understanding the cognition of modern language models, so that we can make them more reliable and aligned with human values. Previously, I did my PhD in the Center for Theoretical Neuroscience at Columbia University, where I studied learning mechanisms in the brain and their relationship to machine learning algorithms. For a list of my publications, see my Google Scholar profile.