Hi, I'm Jack! I'm interested in understanding the cognition of modern language models, so that we can make them more reliable and aligned with human values. Currently, I lead the "model psychiatry" team at Anthropic. Previously, I did my PhD in the Center for Theoretical Neuroscience at Columbia University, where I studied learning mechanisms in the brain and their relationship to machine learning algorithms. For a list of my publications, see my Google Scholar profile.