Meta's Llama 3.1: Driving Open-Source AI Innovation

We share our excitement at Meta’s release of its latest open-source AI model, Llama 3.1

Driving Open-Source Innovation

Meta has unveiled Llama 3.1 405B, a large open-source language model engineered to rival closed models such as GPT-4o and Claude 3.5 Sonnet. Though a minor update to the Llama 3 model, it significantly includes Llama 3.1 405B—a 405 billion parameter model, making it the world’s largest open-source LLM to date, surpassing NVIDIA's Nemotron-4-340B-Instruct.

It's exciting to witness open-source AI gaining prominence. This new suite of models marks a substantial advancement in AI development, offering improved performance, cost efficiency, and increased transparency.

With its 405 billion parameters, Llama 3.1 is poised to claim a high ranking on the LMSys Chatbot Arena Leaderboard, which evaluates performance based on blind user votes. In recent months, the top spot has alternated between versions of OpenAI's GPT-4, Anthropic's Claude 3, and Google's Gemini.

As with Meta’s previous models, Llama 3.1 405B is available for download and use on cloud platforms such as AWS, Azure, and Google Cloud. It is also integrated into WhatsApp and Meta.ai, where it powers a chatbot experience for U.S.-based users.

Meta researchers noted that Llama 3.1 405B was trained on a more diverse mix of non-English data compared to earlier Llama models to enhance its performance in various languages. Additionally, it included more mathematical data and code to improve the model’s mathematical reasoning skills, and recent web data to strengthen its understanding of current events.

All Llama 3.1 models are capable of utilizing third-party tools, apps, and APIs to perform tasks, similar to models from Anthropic and OpenAI. Straight out of the box, they are trained to use Brave Search to answer questions about recent events, the Wolfram Alpha API for math and science-related queries, and a Python interpreter for validating code.

What Makes Llama 3.1 Special?

Let’s go through its new features one by one.

Multilingual Capabilities

The primary update from Llama 3 to Llama 3.1 is the enhanced support for non-English languages. The training data for Llama 3 was 95% English, leading to subpar performance in other languages. The 3.1 update now provides support for German, French, Italian, Portuguese, Hindi, Spanish, and Thai, which is likely to lead to widespread usage in geographies where these languages are popular.

Longer Context Window

Llama 3 models had a context window—the amount of text that can be reasoned about at once—of 8K tokens (around 6K words). Llama 3.1 extends this to 128k tokens, or roughly the length of a 50-page book, making it competitive with other state-of-the-art LLMs. For enterprise use cases such as summarizing long documents, generating code that involves context from a large codebase, or extended support chatbot conversations, a long context window that can handle hundreds of pages of text is essential. It is important to note that Llama 3.1 can process text only, and not images as of now.

Open Model License Agreement

The Llama 3.1 models are available under Meta's custom Open Model License Agreement. This permissive license grants researchers, developers, and businesses the freedom to use the model for both research and commercial applications. Meta has also expanded the license to allow developers to utilize the outputs from Llama models, including the 405B model, to enhance other models.

The Future of Llama 3.1

As Meta continues to refine Llama 3.1, the future looks promising. The model’s ability to integrate multiple modalities could revolutionize fields like virtual reality, where understanding and generating text, images, and speech concurrently is crucial. Advancements in safety mechanisms, like Llama 3 Guard and Prompt Guard, will improve the reliability and trustworthiness of AI applications.

Meta AI, Meta’s AI-powered assistant available on Facebook, Instagram, Messenger, and the web, now supports more languages and can create stylized selfies. Starting now, Meta AI users can direct questions to Meta’s latest AI model, Llama 3.1 405B, which can handle more complex queries than the previous model used by Meta AI.

Alongside the Llama 3.1 family, Meta is introducing a “reference system” and new safety tools designed to block prompts that might lead Llama models to behave unpredictably or undesirably, encouraging developers to use Llama in a wider range of applications. Additionally, the company is previewing and seeking feedback on the Llama Stack, an upcoming API for tools that can fine-tune Llama models, generate synthetic data with Llama, and build “agentic” applications—apps powered by Llama that can act on a user’s behalf.

Overall, the future looks promising and we’re excited to see how it unfolds.

References

Meta. (2024, April 1). Meta Llama: Community stories. Meta Llama. Retrieved July 24, 2024, from https://llama.meta.com/community-stories/

Meta. (2024, July 23). Introducing Llama 3.1: Our most capable models to date. Meta AI. Retrieved July 24, 2024, from https://ai.meta.com/blog/meta-llama-3-1/

Meta. (2024, April 1). Meta AI assistant built with Llama 3. Meta Newsroom. Retrieved July 24, 2024, from https://about.fb.com/news/2024/04/meta-ai-assistant-built-with-llama-3/