The Llama 3 Herd of Models
Article Status
Published
Authors/contributors
- Dubey, Abhimanyu (Author)
- Jauhri, Abhinav (Author)
- Pandey, Abhinav (Author)
- Kadian, Abhishek (Author)
- Al-Dahle, Ahmad (Author)
- Letman, Aiesha (Author)
- Mathur, Akhil (Author)
- Schelten, Alan (Author)
- Yang, Amy (Author)
- Fan, Angela (Author)
- Goyal, Anirudh (Author)
- Hartshorn, Anthony (Author)
- Yang, Aobo (Author)
- Mitra, Archi (Author)
- Sravankumar, Archie (Author)
- Korenev, Artem (Author)
- Hinsvark, Arthur (Author)
- Rao, Arun (Author)
- Zhang, Aston (Author)
- Rodriguez, Aurelien (Author)
- Gregerson, Austen (Author)
- Spataru, Ava (Author)
- Roziere, Baptiste (Author)
- Biron, Bethany (Author)
- Tang, Binh (Author)
- Chern, Bobbie (Author)
- Caucheteux, Charlotte (Author)
- Nayak, Chaya (Author)
- Bi, Chloe (Author)
- Marra, Chris (Author)
- McConnell, Chris (Author)
- Keller, Christian (Author)
- Touret, Christophe (Author)
- Wu, Chunyang (Author)
- Wong, Corinne (Author)
- Ferrer, Cristian Canton (Author)
- Nikolaidis, Cyrus (Author)
- Allonsius, Damien (Author)
- Song, Daniel (Author)
- Pintz, Danielle (Author)
- Livshits, Danny (Author)
- Esiobu, David (Author)
- Choudhary, Dhruv (Author)
- Mahajan, Dhruv (Author)
- Garcia-Olano, Diego (Author)
- Perino, Diego (Author)
- Hupkes, Dieuwke (Author)
- Lakomkin, Egor (Author)
- AlBadawy, Ehab (Author)
- Lobanova, Elina (Author)
- Dinan, Emily (Author)
- Smith, Eric Michael (Author)
- Radenovic, Filip (Author)
- Zhang, Frank (Author)
- Synnaeve, Gabriel (Author)
- Lee, Gabrielle (Author)
- Anderson, Georgia Lewis (Author)
- Nail, Graeme (Author)
- Mialon, Gregoire (Author)
- Pang, Guan (Author)
- Cucurell, Guillem (Author)
- Nguyen, Hailey (Author)
- Korevaar, Hannah (Author)
- Xu, Hu (Author)
- Touvron, Hugo (Author)
- Zarov, Iliyan (Author)
- Ibarra, Imanol Arrieta (Author)
- Kloumann, Isabel (Author)
- Misra, Ishan (Author)
- Evtimov, Ivan (Author)
- Copet, Jade (Author)
- Lee, Jaewon (Author)
- Geffert, Jan (Author)
- Vranes, Jana (Author)
- Park, Jason (Author)
- Mahadeokar, Jay (Author)
- Shah, Jeet (Author)
- van der Linde, Jelmer (Author)
- Billock, Jennifer (Author)
- Hong, Jenny (Author)
- Lee, Jenya (Author)
- Fu, Jeremy (Author)
- Chi, Jianfeng (Author)
- Huang, Jianyu (Author)
- Liu, Jiawen (Author)
- Wang, Jie (Author)
- Yu, Jiecao (Author)
- Bitton, Joanna (Author)
- Spisak, Joe (Author)
- Park, Jongsoo (Author)
- Rocca, Joseph (Author)
- Johnstun, Joshua (Author)
- Saxe, Joshua (Author)
- Jia, Junteng (Author)
- Alwala, Kalyan Vasuden (Author)
- Upasani, Kartikeya (Author)
- Plawiak, Kate (Author)
- Li, Ke (Author)
- Heafield, Kenneth (Author)
- Stone, Kevin (Author)
- El-Arini, Khalid (Author)
- Iyer, Krithika (Author)
- Malik, Kshitiz (Author)
- Chiu, Kuenley (Author)
- Bhalla, Kunal (Author)
- Rantala-Yeary, Lauren (Author)
- van der Maaten, Laurens (Author)
- Chen, Lawrence (Author)
- Tan, Liang (Author)
- Jenkins, Liz (Author)
- Martin, Louis (Author)
- Madaan, Lovish (Author)
- Malo, Lubo (Author)
- Blecher, Lukas (Author)
- Landzaat, Lukas (Author)
- de Oliveira, Luke (Author)
- Muzzi, Madeline (Author)
- Pasupuleti, Mahesh (Author)
- Singh, Mannat (Author)
- Paluri, Manohar (Author)
- Kardas, Marcin (Author)
- Oldham, Mathew (Author)
- Rita, Mathieu (Author)
- Pavlova, Maya (Author)
- Kambadur, Melanie (Author)
- Lewis, Mike (Author)
- Si, Min (Author)
- Singh, Mitesh Kumar (Author)
- Hassan, Mona (Author)
- Goyal, Naman (Author)
- Torabi, Narjes (Author)
- Bashlykov, Nikolay (Author)
- Bogoychev, Nikolay (Author)
- Chatterji, Niladri (Author)
- Duchenne, Olivier (Author)
- Çelebi, Onur (Author)
- Alrassy, Patrick (Author)
- Zhang, Pengchuan (Author)
- Li, Pengwei (Author)
- Vasic, Petar (Author)
- Weng, Peter (Author)
- Bhargava, Prajjwal (Author)
- Dubal, Pratik (Author)
- Krishnan, Praveen (Author)
- Koura, Punit Singh (Author)
- Xu, Puxin (Author)
- He, Qing (Author)
- Dong, Qingxiao (Author)
- Srinivasan, Ragavan (Author)
- Ganapathy, Raj (Author)
- Calderer, Ramon (Author)
- Cabral, Ricardo Silveira (Author)
- Stojnic, Robert (Author)
- Raileanu, Roberta (Author)
- Girdhar, Rohit (Author)
- Patel, Rohit (Author)
- Sauvestre, Romain (Author)
- Polidoro, Ronnie (Author)
- Sumbaly, Roshan (Author)
- Taylor, Ross (Author)
- Silva, Ruan (Author)
- Hou, Rui (Author)
- Wang, Rui (Author)
- Hosseini, Saghar (Author)
- Chennabasappa, Sahana (Author)
- Singh, Sanjay (Author)
- Bell, Sean (Author)
- Kim, Seohyun Sonia (Author)
- Edunov, Sergey (Author)
- Nie, Shaoliang (Author)
- Narang, Sharan (Author)
- Raparthy, Sharath (Author)
- Shen, Sheng (Author)
- Wan, Shengye (Author)
- Bhosale, Shruti (Author)
- Zhang, Shun (Author)
- Vandenhende, Simon (Author)
- Batra, Soumya (Author)
- Whitman, Spencer (Author)
- Sootla, Sten (Author)
- Collot, Stephane (Author)
- Gururangan, Suchin (Author)
- Borodinsky, Sydney (Author)
- Herman, Tamar (Author)
- Fowler, Tara (Author)
- Sheasha, Tarek (Author)
- Georgiou, Thomas (Author)
- Scialom, Thomas (Author)
- Speckbacher, Tobias (Author)
- Mihaylov, Todor (Author)
- Xiao, Tong (Author)
- Karn, Ujjwal (Author)
- Goswami, Vedanuj (Author)
- Gupta, Vibhor (Author)
- Ramanathan, Vignesh (Author)
- Kerkez, Viktor (Author)
- Gonguet, Vincent (Author)
- Do, Virginie (Author)
- Vogeti, Vish (Author)
- Petrovic, Vladan (Author)
- Chu, Weiwei (Author)
- Xiong, Wenhan (Author)
- Fu, Wenyin (Author)
- Meers, Whitney (Author)
- Martinet, Xavier (Author)
- Wang, Xiaodong (Author)
- Tan, Xiaoqing Ellen (Author)
- Xie, Xinfeng (Author)
- Jia, Xuchao (Author)
- Wang, Xuewei (Author)
- Goldschlag, Yaelle (Author)
- Gaur, Yashesh (Author)
- Babaei, Yasmine (Author)
- Wen, Yi (Author)
- Song, Yiwen (Author)
- Zhang, Yuchen (Author)
- Li, Yue (Author)
- Mao, Yuning (Author)
- Coudert, Zacharie Delpierre (Author)
- Yan, Zheng (Author)
- Chen, Zhengxing (Author)
- Papakipos, Zoe (Author)
- Singh, Aaditya (Author)
- Grattafiori, Aaron (Author)
- Jain, Abha (Author)
- Kelsey, Adam (Author)
- Shajnfeld, Adam (Author)
- Gangidi, Adithya (Author)
- Victoria, Adolfo (Author)
- Goldstand, Ahuva (Author)
- Menon, Ajay (Author)
- Sharma, Ajay (Author)
- Boesenberg, Alex (Author)
- Vaughan, Alex (Author)
- Baevski, Alexei (Author)
- Feinstein, Allie (Author)
- Kallet, Amanda (Author)
- Sangani, Amit (Author)
- Yunus, Anam (Author)
- Lupu, Andrei (Author)
- Alvarado, Andres (Author)
- Caples, Andrew (Author)
- Gu, Andrew (Author)
- Ho, Andrew (Author)
- Poulton, Andrew (Author)
- Ryan, Andrew (Author)
- Ramchandani, Ankit (Author)
- Franco, Annie (Author)
- Saraf, Aparajita (Author)
- Chowdhury, Arkabandhu (Author)
- Gabriel, Ashley (Author)
- Bharambe, Ashwin (Author)
- Eisenman, Assaf (Author)
- Yazdan, Azadeh (Author)
- James, Beau (Author)
- Maurer, Ben (Author)
- Leonhardi, Benjamin (Author)
- Huang, Bernie (Author)
- Loyd, Beth (Author)
- De Paola, Beto (Author)
- Paranjape, Bhargavi (Author)
- Liu, Bing (Author)
- Wu, Bo (Author)
- Ni, Boyu (Author)
- Hancock, Braden (Author)
- Wasti, Bram (Author)
- Spence, Brandon (Author)
- Stojkovic, Brani (Author)
- Gamido, Brian (Author)
- Montalvo, Britt (Author)
- Parker, Carl (Author)
- Burton, Carly (Author)
- Mejia, Catalina (Author)
- Wang, Changhan (Author)
- Kim, Changkyu (Author)
- Zhou, Chao (Author)
- Hu, Chester (Author)
- Chu, Ching-Hsiang (Author)
- Cai, Chris (Author)
- Tindal, Chris (Author)
- Feichtenhofer, Christoph (Author)
- Civin, Damon (Author)
- Beaty, Dana (Author)
- Kreymer, Daniel (Author)
- Li, Daniel (Author)
- Wyatt, Danny (Author)
- Adkins, David (Author)
- Xu, David (Author)
- Testuggine, Davide (Author)
- David, Delia (Author)
- Parikh, Devi (Author)
- Liskovich, Diana (Author)
- Foss, Didem (Author)
- Wang, Dingkang (Author)
- Le, Duc (Author)
- Holland, Dustin (Author)
- Dowling, Edward (Author)
- Jamil, Eissa (Author)
- Montgomery, Elaine (Author)
- Presani, Eleonora (Author)
- Hahn, Emily (Author)
- Wood, Emily (Author)
- Brinkman, Erik (Author)
- Arcaute, Esteban (Author)
- Dunbar, Evan (Author)
- Smothers, Evan (Author)
- Sun, Fei (Author)
- Kreuk, Felix (Author)
- Tian, Feng (Author)
- Ozgenel, Firat (Author)
- Caggioni, Francesco (Author)
- Guzmán, Francisco (Author)
- Kanayet, Frank (Author)
- Seide, Frank (Author)
- Florez, Gabriela Medina (Author)
- Schwarz, Gabriella (Author)
- Badeer, Gada (Author)
- Swee, Georgia (Author)
- Halpern, Gil (Author)
- Thattai, Govind (Author)
- Herman, Grant (Author)
- Sizov, Grigory (Author)
- Guangyi (Author)
- Zhang (Author)
- Lakshminarayanan, Guna (Author)
- Shojanazeri, Hamid (Author)
- Zou, Han (Author)
- Wang, Hannah (Author)
- Zha, Hanwen (Author)
- Habeeb, Haroun (Author)
- Rudolph, Harrison (Author)
- Suk, Helen (Author)
- Aspegren, Henry (Author)
- Goldman, Hunter (Author)
- Damlaj, Ibrahim (Author)
- Molybog, Igor (Author)
- Tufanov, Igor (Author)
- Veliche, Irina-Elena (Author)
- Gat, Itai (Author)
- Weissman, Jake (Author)
- Geboski, James (Author)
- Kohli, James (Author)
- Asher, Japhet (Author)
- Gaya, Jean-Baptiste (Author)
- Marcus, Jeff (Author)
- Tang, Jeff (Author)
- Chan, Jennifer (Author)
- Zhen, Jenny (Author)
- Reizenstein, Jeremy (Author)
- Teboul, Jeremy (Author)
- Zhong, Jessica (Author)
- Jin, Jian (Author)
- Yang, Jingyi (Author)
- Cummings, Joe (Author)
- Carvill, Jon (Author)
- Shepard, Jon (Author)
- McPhie, Jonathan (Author)
- Torres, Jonathan (Author)
- Ginsburg, Josh (Author)
- Wang, Junjie (Author)
- Wu, Kai (Author)
- U, Kam Hou (Author)
- Saxena, Karan (Author)
- Prasad, Karthik (Author)
- Khandelwal, Kartikay (Author)
- Zand, Katayoun (Author)
- Matosich, Kathy (Author)
- Veeraraghavan, Kaushik (Author)
- Michelena, Kelly (Author)
- Li, Keqian (Author)
- Huang, Kun (Author)
- Chawla, Kunal (Author)
- Lakhotia, Kushal (Author)
- Huang, Kyle (Author)
- Chen, Lailin (Author)
- Garg, Lakshya (Author)
- A, Lavender (Author)
- Silva, Leandro (Author)
- Bell, Lee (Author)
- Zhang, Lei (Author)
- Guo, Liangpeng (Author)
- Yu, Licheng (Author)
- Moshkovich, Liron (Author)
- Wehrstedt, Luca (Author)
- Khabsa, Madian (Author)
- Avalani, Manav (Author)
- Bhatt, Manish (Author)
- Tsimpoukelli, Maria (Author)
- Mankus, Martynas (Author)
- Hasson, Matan (Author)
- Lennie, Matthew (Author)
- Reso, Matthias (Author)
- Groshev, Maxim (Author)
- Naumov, Maxim (Author)
- Lathi, Maya (Author)
- Keneally, Meghan (Author)
- Seltzer, Michael L. (Author)
- Valko, Michal (Author)
- Restrepo, Michelle (Author)
- Patel, Mihir (Author)
- Vyatskov, Mik (Author)
- Samvelyan, Mikayel (Author)
- Clark, Mike (Author)
- Macey, Mike (Author)
- Wang, Mike (Author)
- Hermoso, Miquel Jubert (Author)
- Metanat, Mo (Author)
- Rastegari, Mohammad (Author)
- Bansal, Munish (Author)
- Santhanam, Nandhini (Author)
- Parks, Natascha (Author)
- White, Natasha (Author)
- Bawa, Navyata (Author)
- Singhal, Nayan (Author)
- Egebo, Nick (Author)
- Usunier, Nicolas (Author)
- Laptev, Nikolay Pavlovich (Author)
- Dong, Ning (Author)
- Zhang, Ning (Author)
- Cheng, Norman (Author)
- Chernoguz, Oleg (Author)
- Hart, Olivia (Author)
- Salpekar, Omkar (Author)
- Kalinli, Ozlem (Author)
- Kent, Parkin (Author)
- Parekh, Parth (Author)
- Saab, Paul (Author)
- Balaji, Pavan (Author)
- Rittner, Pedro (Author)
- Bontrager, Philip (Author)
- Roux, Pierre (Author)
- Dollar, Piotr (Author)
- Zvyagina, Polina (Author)
- Ratanchandani, Prashant (Author)
- Yuvraj, Pritish (Author)
- Liang, Qian (Author)
- Alao, Rachad (Author)
- Rodriguez, Rachel (Author)
- Ayub, Rafi (Author)
- Murthy, Raghotham (Author)
- Nayani, Raghu (Author)
- Mitra, Rahul (Author)
- Li, Raymond (Author)
- Hogan, Rebekkah (Author)
- Battey, Robin (Author)
- Wang, Rocky (Author)
- Maheswari, Rohan (Author)
- Howes, Russ (Author)
- Rinott, Ruty (Author)
- Bondu, Sai Jayesh (Author)
- Datta, Samyak (Author)
- Chugh, Sara (Author)
- Hunt, Sara (Author)
- Dhillon, Sargun (Author)
- Sidorov, Sasha (Author)
- Pan, Satadru (Author)
- Verma, Saurabh (Author)
- Yamamoto, Seiji (Author)
- Ramaswamy, Sharadh (Author)
- Lindsay, Shaun (Author)
- Lindsay, Shaun (Author)
- Feng, Sheng (Author)
- Lin, Shenghao (Author)
- Zha, Shengxin Cindy (Author)
- Shankar, Shiva (Author)
- Zhang, Shuqiang (Author)
- Zhang, Shuqiang (Author)
- Wang, Sinong (Author)
- Agarwal, Sneha (Author)
- Sajuyigbe, Soji (Author)
- Chintala, Soumith (Author)
- Max, Stephanie (Author)
- Chen, Stephen (Author)
- Kehoe, Steve (Author)
- Satterfield, Steve (Author)
- Govindaprasad, Sudarshan (Author)
- Gupta, Sumit (Author)
- Cho, Sungmin (Author)
- Virk, Sunny (Author)
- Subramanian, Suraj (Author)
- Choudhury, Sy (Author)
- Goldman, Sydney (Author)
- Remez, Tal (Author)
- Glaser, Tamar (Author)
- Best, Tamara (Author)
- Kohler, Thilo (Author)
- Robinson, Thomas (Author)
- Li, Tianhe (Author)
- Zhang, Tianjun (Author)
- Matthews, Tim (Author)
- Chou, Timothy (Author)
- Shaked, Tzook (Author)
- Vontimitta, Varun (Author)
- Ajayi, Victoria (Author)
- Montanez, Victoria (Author)
- Mohan, Vijai (Author)
- Kumar, Vinay Satish (Author)
- Mangla, Vishal (Author)
- Albiero, Vítor (Author)
- Ionescu, Vlad (Author)
- Poenaru, Vlad (Author)
- Mihailescu, Vlad Tiberiu (Author)
- Ivanov, Vladimir (Author)
- Li, Wei (Author)
- Wang, Wenchen (Author)
- Jiang, Wenwen (Author)
- Bouaziz, Wes (Author)
- Constable, Will (Author)
- Tang, Xiaocheng (Author)
- Wang, Xiaofang (Author)
- Wu, Xiaojian (Author)
- Wang, Xiaolan (Author)
- Xia, Xide (Author)
- Wu, Xilun (Author)
- Gao, Xinbo (Author)
- Chen, Yanjun (Author)
- Hu, Ye (Author)
- Jia, Ye (Author)
- Qi, Ye (Author)
- Li, Yenda (Author)
- Zhang, Yilin (Author)
- Zhang, Ying (Author)
- Adi, Yossi (Author)
- Nam, Youngjin (Author)
- Yu (Author)
- Wang (Author)
- Hao, Yuchen (Author)
- Qian, Yundi (Author)
- He, Yuzi (Author)
- Rait, Zach (Author)
- DeVito, Zachary (Author)
- Rosnbrick, Zef (Author)
- Wen, Zhaoduo (Author)
- Yang, Zhenyu (Author)
- Zhao, Zhiwei (Author)
Title
The Llama 3 Herd of Models
Abstract
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.
Repository
arXiv
Archive ID
arXiv:2407.21783
Date
2024-08-15
Accessed
01/10/2024, 14:01
Library Catalogue
Extra
arXiv:2407.21783 [cs]
<AI Smry>: It is found that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks, and performs competitively with the state-of-the-art on image, video, and speech recognition tasks.
Citation
Dubey, A., Jauhri, A., Pandey, A., Kadian, A., Al-Dahle, A., Letman, A., Mathur, A., Schelten, A., Yang, A., Fan, A., Goyal, A., Hartshorn, A., Yang, A., Mitra, A., Sravankumar, A., Korenev, A., Hinsvark, A., Rao, A., Zhang, A., … Zhao, Z. (2024). The Llama 3 Herd of Models (arXiv:2407.21783). arXiv. https://doi.org/10.48550/arXiv.2407.21783
Technical methods
Link to this record