dou, Liam Fedus, Tarun Gogineni, Rapha Gontijo-Lopes, Jonathan
Gordon, Joost Huizinga, Shawn Jain, Roger Jiang, Łukasz Kaiser,
Christina Kim, Jan Leike, Chak Li, Stephanie Lin, Ryan Lowe, Jacob
Menick, Luke Metz, Pamela Mishkin, Tong Mu, Oleg Murk, Ashvin
Nair, Long Ouyang, Alex Passos, Michael (Rai) Pokorny, Vitchyr
Pong, Shibani Santurkar, Daniel Selsam, Sarah Shoker, Carroll Wain-
wright, Matt Wiethoff, Jeff Wu, Kai Xiao, Kevin Yu, Marvin Zhang,
Chong Zhang, William Zhuk, Barret Zoph
Data infrastructure
10
Irwan Bello, Lenny Bogdonoff, Juan Felipe Cerón Uribe, Joshua
Gross, Shawn Jain, Haozhun Jin, Christina Kim, Aris Konstantinidis,
Teddy Lee, David Medina, Jacob Menick, Luke Metz, Ashvin Nair,
Long Ouyang, Michael (Rai) Pokorny, Vitchyr Pong, John Schulman,
Jonathan Ward, Jiayi Weng, Matt Wiethoff, Sarah Yoo, Kevin Yu,
Wojciech Zaremba, William Zhuk, Barret Zoph
ChatML format
10
Ilge Akkaya, Christina Kim, Chak Li, Rachel Lim, Jacob Menick,
Luke Metz, Andrey Mishchenko, Vitchyr Pong, John Schulman,
Carroll Wainwright, Barret Zoph
Model safety
10
Josh Achiam, Steven Adler, Juan Felipe Cerón Uribe, Hyung Won
Chung, Tyna Eloundou, Rapha Gontijo-Lopes, Shixiang Shane Gu,
Johannes Heidecke, Joost Huizinga, Teddy Lee, Jan Leike, Stephanie
Lin, Ryan Lowe, Todor Markov, Luke Metz, Tong Mu, Shibani
Santurkar, John Schulman, Andrea Vallone, Carroll Wainwright, Jason
Wei, Lilian Weng, Kai Xiao, Chong Zhang, Marvin Zhang, Barret Zoph
Refusals
10
Juan Felipe Cerón Uribe, Tyna Eloundou, Johannes Heidecke, Joost
Huizinga, Jan Leike, Stephanie Lin, Ryan Lowe, Pamela Mishkin,
Tong Mu, Carroll Wainwright, Lilian Weng, Kai Xiao, Chong Zhang,
Barret Zoph
Foundational RLHF and InstructGPT work
10
Diogo Almeida, Joost Huizinga, Roger Jiang, Jan Leike, Stephanie Lin,
Ryan Lowe, Pamela Mishkin, Dan Mossing, Long Ouyang, Katarina
Slama, Carroll Wainwright, Jeff Wu, Kai Xiao, Marvin Zhang
Flagship training runs
10
Greg Brockman, Liam Fedus, Johannes Heidecke, Joost Huizinga,
Roger Jiang, Kyle Kosic, Luke Metz, Ashvin Nair, Jiayi Weng, Chong
Zhang, Shengjia Zhao, Barret Zoph
Code capability
10
Ilge Akkaya, Mo Bavarian, Jonathan Gordon, Shawn Jain, Haozhun
Jin, Teddy Lee, Chak Li, Oleg Murk, Ashvin Nair, Vitchyr Pong,
Benjamin Sokolowsky, Jerry Tworek, Matt Wiethoff, Sarah Yoo, Kevin
Yu, Wojciech Zaremba, William Zhuk
Evaluation & analysis
Core contributors
10
Sandhini Agarwal System card co-lead
Lama Ahmad Expert red teaming & adversarial testing program lead
Mo Bavarian Capability prediction co-lead
Tyna Eloundou Safety evaluations co-lead
Andrew Kondrich OpenAI Evals open-sourcing co-lead
Gretchen Krueger System card co-lead
Michael Lampe Privacy and PII evaluations lead
Pamela Mishkin Economic impact & overreliance evaluations lead
Benjamin Sokolowsky Capability prediction co-lead
Jack Rae Research benchmark execution lead
Chelsea Voss Eval execution lead
Alvin Wang OpenAI Evals lead
Kai Xiao Safety evaluations co-lead
Marvin Zhang OpenAI Evals open-sourcing co-lead
OpenAI Evals library
10
Shixiang Shane Gu, Angela Jiang, Logan Kilpatrick, Andrew Kon-
drich, Pamela Mishkin, Jakub Pachocki, Ted Sanders, Jessica Shieh,
Alvin Wang, Marvin Zhang
Model-graded evaluation infrastructure
10
Liam Fedus, Rapha Gontijo-Lopes, Shixiang Shane Gu, Andrew
Kondrich, Michael (Rai) Pokorny, Wojciech Zaremba, Chong Zhang,
Marvin Zhang, Shengjia Zhao, Barret Zoph
Acceleration forecasting
10
Alan Hickey, Daniel Kokotajlo, Cullen O’Keefe, Sarah Shoker
ChatGPT evaluations
10
Juan Felipe Cerón Uribe, Hyung Won Chung, Rapha Gontijo-Lopes,
Liam Fedus, Luke Metz, Michael Rai Pokorny, Jason Wei, Shengjia
Zhao, Barret Zoph
Capability evaluations
10
Tyna Eloundou, Shengli Hu, Roger Jiang, Jamie Kiros, Teddy Lee,
Scott Mayer McKinney, Jakub Pachocki, Alex Paino, Giambattista
Parascandolo, Boris Power, Raul Puri, Jack Rae, Nick Ryder, Ted
Sanders, Szymon Sidor, Benjamin Sokolowsky, Chelsea Voss, Alvin
Wang, Rowan Zellers, Juntang Zhuang
Coding evaluations
10
Ilge Akkaya, Mo Bavarian, Jonathan Gordon, Shawn Jain, Chak Li,
Oleg Murk, Vitchyr Pong, Benjamin Sokolowsky, Jerry Tworek, Kevin
Yu, Wojciech Zaremba
Real-world use case evaluations
10
Andrew Kondrich, Joe Palermo, Boris Power, Ted Sanders
Contamination investigations
10
Adrien Ecoffet, Roger Jiang, Ingmar Kanitscheider, Scott Mayer
McKinney, Alex Paino, Giambattista Parascandolo, Jack Rae, Qiming
Yuan
Instruction following and API evals
10
Diogo Almeida, Carroll Wainwright, Marvin Zhang
Novel capability discovery
10
Filipe de Avila Belbute Peres, Kevin Button, Fotis Chantzis, Mike
Heaton, Wade Hickey, Xin Hu, Andrew Kondrich, Matt Knight, An-
drew Mayne, Jake McNeil, Vinnie Monaco, Joe Palermo, Joel Parish,
Boris Power, Bob Rotsted, Ted Sanders
Vision evaluations
10
Shixiang Shane Gu, Shengli Hu, Jamie Kiros, Hyeonwoo Noh, Raul
Puri, Rowan Zellers
Economic impact evaluation
10
Tyna Eloundou, Sam Manning, Aalok Mehta, Pamela Mishkin
Non-proliferation, international humanitarian law & national
security red teaming
10
Sarah Shoker
Overreliance analysis
10
Miles Brundage, Michael Lampe, Pamela Mishkin
Privacy and PII evaluations
10
Michael Lampe, Vinnie Monaco, Ashley Pantuliano
Safety and policy evaluations
10
Josh Achiam, Sandhini Agarwal, Lama Ahmad, Jeff Belgum, Tyna
Eloundou, Johannes Heidecke, Shengli Hu, Joost Huizinga, Jamie
Kiros, Gretchen Krueger, Michael Lampe, Stephanie Lin, Ryan Lowe,
Todor Markov, Vinnie Monaco, Tong Mu, Raul Puri, Girish Sastry,
Andrea Vallone, Carroll Wainwright, CJ Weinmann, Lilian Weng, Kai
Xiao, Chong Zhang
OpenAI adversarial testers
10
Josh Achiam, Steven Adler, Lama Ahmad, Shyamal Anadkat, Red
Avila, Gabriel Bernadett-Shapiro, Anna-Luisa Brakman, Tim Brooks,
Miles Brundage, Chelsea Carlson, Derek Chen, Hyung Won Chung,
Jeremiah Currier, Daniel Kokotajlo, David Dohan, Adrien Ecoffet,
Juston Forte, Vik Goel, Ryan Greene, Johannes Heidecke, Alan Hickey,
Shengli Hu, Joost Huizinga, Janko, Tomer Kaftan, Ali Kamali, Nitish
Shirish Keskar, Tabarak Khan, Hendrik Kirchner, Daniel Kokotajlo,
Gretchen Krueger, Michael Lampe, Teddy Lee, Molly Lin, Ryan
Lowe, Todor Markov, Jake McNeil, Pamela Mishkin, Vinnie Monaco,
Daniel Mossing, Tong Mu, Oleg Murk, Cullen O’Keefe, Joe Palermo,
Giambattista Parascandolo, Joel Parish, Boris Power, Alethea Power,
Cameron Raymond, Francis Real, Bob Rotsted, Mario Salterelli, Sam
Wolrich, Ted Sanders, Girish Sastry, Sarah Shoker, Shyamal Anadkat,
Yang Song, Natalie Staudacher, Madeleine Thompson, Elizabeth
Tseng, Chelsea Voss, Jason Wei, Chong Zhang
System card & broader impacts analysis
10
Steven Adler, Sandhini Agarwal, Lama Ahmad, Janko Altenschmidt,
Jeff Belgum, Gabriel Bernadett-Shapiro, Miles Brundage, Derek Chen,
16