Dataset contributions
11
Diogo Almeida, Mo Bavarian, Juan Felipe Cerón Uribe, Tyna Eloun-
dou, Liam Fedus, Tarun Gogineni, Rapha Gontijo-Lopes, Jonathan
Gordon, Joost Huizinga, Shawn Jain, Roger Jiang, Łukasz Kaiser,
Christina Kim, Jan Leike, Chak Ming Li, Stephanie Lin, Ryan Lowe,
Jacob Menick, Luke Metz, Pamela Mishkin, Tong Mu, Oleg Murk,
Ashvin Nair, Long Ouyang, Alex Passos, Michael (Rai) Pokorny,
Vitchyr Pong, Shibani Santurkar, Daniel Selsam, Sarah Shoker, Car-
roll Wainwright, Matt Wiethoff, Jeff Wu, Kai Xiao, Kevin Yu, Marvin
Zhang, Chong Zhang, William Zhuk, Barret Zoph
Data infrastructure
11
Irwan Bello, Lenny Bogdonoff, Juan Felipe Cerón Uribe, Joshua
Gross, Shawn Jain, Haozhun Jin, Christina Kim, Aris Konstantinidis,
Teddy Lee, David Medina, Jacob Menick, Luke Metz, Ashvin Nair,
Long Ouyang, Michael (Rai) Pokorny, Vitchyr Pong, John Schulman,
Jonathan Ward, Jiayi Weng, Matt Wiethoff, Sarah Yoo, Kevin Yu,
Wojciech Zaremba, William Zhuk, Barret Zoph
ChatML format
11
Ilge Akkaya, Christina Kim, Chak Ming Li, Rachel Lim, Jacob
Menick, Luke Metz, Andrey Mishchenko, Vitchyr Pong, John Schul-
man, Carroll Wainwright, Barret Zoph
Model safety
11
Josh Achiam, Steven Adler, Juan Felipe Cerón Uribe, Hyung Won
Chung, Tyna Eloundou, Rapha Gontijo-Lopes, Shixiang Shane Gu,
Johannes Heidecke, Joost Huizinga, Teddy Lee, Jan Leike, Stephanie
Lin, Ryan Lowe, Todor Markov, Luke Metz, Tong Mu, Shibani San-
turkar, John Schulman, Andrea Vallone, Carroll Wainwright, Jason
Wei, Lilian Weng, Kai Xiao, Chong Zhang, Marvin Zhang, Barret
Zoph
Refusals
11
Juan Felipe Cerón Uribe, Tyna Eloundou, Johannes Heidecke, Joost
Huizinga, Jan Leike, Stephanie Lin, Ryan Lowe, Pamela Mishkin,
Tong Mu, Carroll Wainwright, Lilian Weng, Kai Xiao, Chong Zhang,
Barret Zoph
Foundational RLHF and InstructGPT work
11
Diogo Almeida, Joost Huizinga, Roger Jiang, Jan Leike, Stephanie
Lin, Ryan Lowe, Pamela Mishkin, Dan Mossing, Long Ouyang, Kata-
rina Slama, Carroll Wainwright, Jeff Wu, Kai Xiao, Marvin Zhang
Flagship training runs
11
Greg Brockman, Liam Fedus, Johannes Heidecke, Joost Huizinga,
Roger Jiang, Kyle Kosic, Luke Metz, Ashvin Nair, Jiayi Weng,
Chong Zhang, Shengjia Zhao, Barret Zoph
Code capability
11
Ilge Akkaya, Mo Bavarian, Jonathan Gordon, Shawn Jain, Haozhun
Jin, Teddy Lee, Chak Ming Li, Oleg Murk, Ashvin Nair, Vitchyr
Pong, Benjamin Sokolowsky, Jerry Tworek, Matt Wiethoff, Sarah
Yoo, Kevin Yu, Wojciech Zaremba, William Zhuk
Evaluation & analysis
Core contributors
11
Sandhini Agarwal System card co-lead
Lama Ahmad Expert red teaming & adversarial testing program lead
Mo Bavarian Capability prediction co-lead
Tyna Eloundou Safety evaluations co-lead
Andrew Kondrich OpenAI Evals open-sourcing co-lead
Gretchen Krueger System card co-lead
Michael Lampe Privacy and PII evaluations lead
Pamela Mishkin Economic impact & overreliance evaluations lead
Benjamin Sokolowsky Capability prediction co-lead
Jack Rae Research benchmark execution lead
Chelsea Voss Eval execution lead
Alvin Wang OpenAI Evals lead
Kai Xiao Safety evaluations co-lead
Marvin Zhang OpenAI Evals open-sourcing co-lead
OpenAI Evals library
11
Shixiang Shane Gu, Angela Jiang, Logan Kilpatrick, Andrew Kon-
drich, Pamela Mishkin, Jakub Pachocki, Ted Sanders, Jessica Shieh,
Alvin Wang, Marvin Zhang
Model-graded evaluation infrastructure
11
Liam Fedus, Rapha Gontijo-Lopes, Shixiang Shane Gu, Andrew
Kondrich, Michael (Rai) Pokorny, Wojciech Zaremba, Chong Zhang,
Marvin Zhang, Shengjia Zhao, Barret Zoph
Acceleration forecasting
11
Alan Hickey, Daniel Kokotajlo, Cullen O’Keefe, Sarah Shoker
ChatGPT evaluations
11
Juan Felipe Cerón Uribe, Hyung Won Chung, Rapha Gontijo-Lopes,
Liam Fedus, Luke Metz, Michael Rai Pokorny, Jason Wei, Shengjia
Zhao, Barret Zoph
Capability evaluations
11
Tyna Eloundou, Shengli Hu, Roger Jiang, Jamie Kiros, Teddy Lee,
Scott Mayer McKinney, Jakub Pachocki, Alex Paino, Giambattista
Parascandolo, Boris Power, Raul Puri, Jack Rae, Nick Ryder, Ted
Sanders, Szymon Sidor, Benjamin Sokolowsky, Chelsea Voss, Alvin
Wang, Rowan Zellers, Juntang Zhuang
Coding evaluations
11
Ilge Akkaya, Mo Bavarian, Jonathan Gordon, Shawn Jain, Chak Ming
Li, Oleg Murk, Vitchyr Pong, Benjamin Sokolowsky, Jerry Tworek,
Kevin Yu, Wojciech Zaremba
Real-world use case evaluations
11
Andrew Kondrich, Joe Palermo, Boris Power, Ted Sanders
Contamination investigations
11
Adrien Ecoffet, Roger Jiang, Ingmar Kanitscheider, Scott Mayer
McKinney, Alex Paino, Giambattista Parascandolo, Jack Rae, Qim-
ing Yuan
Instruction following and API evals
11
Diogo Almeida, Carroll Wainwright, Marvin Zhang
Novel capability discovery
11
Filipe de Avila Belbute Peres, Kevin Button, Fotis Chantzis, Mike
Heaton, Wade Hickey, Xin Hu, Andrew Kondrich, Matt Knight, An-
drew Mayne, Jake McNeil, Vinnie Monaco, Joe Palermo, Joel Parish,
Boris Power, Bob Rotsted, Ted Sanders
Vision evaluations
11
Shixiang Shane Gu, Shengli Hu, Jamie Kiros, Hyeonwoo Noh, Raul
Puri, Rowan Zellers
Economic impact evaluation
11
Tyna Eloundou, Sam Manning, Aalok Mehta, Pamela Mishkin
Non-proliferation, international humanitarian law & national
security red teaming
11
Sarah Shoker
Overreliance analysis
11
Miles Brundage, Michael Lampe, Pamela Mishkin
Privacy and PII evaluations
11
Michael Lampe, Vinnie Monaco, Ashley Pantuliano
Safety and policy evaluations
11
Josh Achiam, Sandhini Agarwal, Lama Ahmad, Jeff Belgum, Tyna
Eloundou, Johannes Heidecke, Shengli Hu, Joost Huizinga, Jamie
Kiros, Gretchen Krueger, Michael Lampe, Stephanie Lin, Ryan
Lowe, Todor Markov, Vinnie Monaco, Tong Mu, Raul Puri, Girish
Sastry, Andrea Vallone, Carroll Wainwright, CJ Weinmann, Lilian
Weng, Kai Xiao, Chong Zhang
OpenAI adversarial testers
11
Josh Achiam, Steven Adler, Lama Ahmad, Shyamal Anadkat, Red
Avila, Gabriel Bernadett-Shapiro, Anna-Luisa Brakman, Tim Brooks,
Miles Brundage, Chelsea Carlson, Derek Chen, Hyung Won Chung,
Jeremiah Currier, Daniel Kokotajlo, David Dohan, Adrien Ecoffet,
Juston Forte, Vik Goel, Ryan Greene, Johannes Heidecke, Alan
Hickey, Shengli Hu, Joost Huizinga, Janko, Tomer Kaftan, Ali Ka-
mali, Nitish Shirish Keskar, Tabarak Khan, Hendrik Kirchner, Daniel
Kokotajlo, Gretchen Krueger, Michael Lampe, Teddy Lee, Molly
Lin, Ryan Lowe, Todor Markov, Jake McNeil, Pamela Mishkin,
Vinnie Monaco, Daniel Mossing, Tong Mu, Oleg Murk, Cullen
O’Keefe, Joe Palermo, Giambattista Parascandolo, Joel Parish, Boris
Power, Alethea Power, Cameron Raymond, Francis Real, Bob Rot-
sted, Mario Salterelli, Sam Wolrich, Ted Sanders, Girish Sastry,
Sarah Shoker, Shyamal Anadkat, Yang Song, Natalie Staudacher,
Madeleine Thompson, Elizabeth Tseng, Chelsea Voss, Jason Wei,
Chong Zhang
16