We gratefully acknowledge support from
the Simons Foundation and member institutions.

Deep Ganguli and Nicholas Schiefer are qualified to endorse.

The Capacity for Moral Self-Correction in Large Language Models

Deep Ganguli: Is registered as an author of this paper.
Can endorse for cs.AI. (why?)
Nicholas Schiefer: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.DB, cs.DC, cs.DS, cs.HC, cs.LG. (why?)

Amanda Askell, Thomas I. Liao, Kamilė Lukošiūtė, Anna Chen, Anna Goldie, Azalia Mirhoseini, Catherine Olsson, Danny Hernandez, Dawn Drain, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jackson Kernion, Jamie Kerr, Jared Mueller, Joshua Landau, Kamal Ndousse, Karina Nguyen, Liane Lovitt, Michael Sellitto, Nelson Elhage, Noemi Mercado, Nova DasSarma, Oliver Rausch, Robert Lasenby, Robin Larson, Sam Ringer, Sandipan Kundu, Saurav Kadavath, Scott Johnston, Shauna Kravec, Sheer El Showk, Tamera Lanham, Timothy Telleen-Lawton, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Christopher Olah, Jack Clark, Samuel R. Bowman and Jared Kaplan are not registered as owners of this paper. (why?)