Meta Launched New AI Model LLaMA: Decoding Significant Language Models!

By Preeti Rana On Mar 9, 2023

Meta Launched New AI Model LLaMA: Following the success of OpenAI’s ChatGPT, Google unveiled its BARD, and several others followed suit. It appears that Meta Platforms, Inc. is preparing to gain a competitive advantage.

The California-based tech giant has introduced a new research tool that will soon aid in the creation of chatbots based on artificial intelligence.

The company has released its Large Language Model Meta AI to the public. LLaMA is a state-of-the-art foundational language model designed to assist researchers in the subfield of AI, according to the official release.

Intriguingly, this would be Meta’s third LLM, following Glactica and Blender Bot 3, both of which were immediately shut down due to inaccurate results.

Today we’re publicly releasing LLaMA, a state-of-the-art foundational LLM, as part of our ongoing commitment to open science, transparency and democratized access to new research.

Learn more & request access ➡️ https://t.co/8AeLVhMWkq pic.twitter.com/1BEkTngtnM

— Meta AI (@MetaAI) February 24, 2023

LLaMA

LLaMA is a library of language models with parameters spanning 7B to 65B. The company has stated that it trains its models on trillions of tokens, claiming that it is possible to train cutting-edge models using public datasets rather than proprietary and inaccessible data sets.

Meta argues that it’s preferable to train smaller foundational models like LLaMA because it takes much less processing power and resources to test, validate, and explore new use cases.

It is well known that foundational language models are trained on large, unlabeled data sets, which makes them ideal for task-specific customization. Meta has stated that it will provide LLaMA parameters in sizes including 7B, 13B, 33B, and 65 B.

Meta noted in its research paper that LLaMA-13B outperformed OpenAI’s GPT-3 (175B) on the majority of benchmarks and that LLaMA-65B is comparable to the best models, DeepMind’s Chinchilla70B and Google’s PaLM-540B.

Upon completion of training, LLaMA-13B can be a boon for small businesses eager to run tests on these systems; however, it may still be out of reach for researchers working in isolation.

LLaMA is not currently implemented in any of Meta’s products, but the company intends to make it accessible to researchers.

Previously, the company had introduced the LLM OPT-175B, but LLaMA is a more advanced system. Meta has also made available the LLaMA model source code so that outsiders can observe how the system operates.

This will allow them to collaborate and customize related projects.

You may also read:-

Decoding Significant Language Models

Large language models (LLMs) are artificial intelligence (AI) systems that consume vast quantities of digital text from internet sources such as articles, news reports, and social media posts.

These digital texts are used to train software that predicts and generates content based on queries and prompts. These models can assist with tasks such as essay writing, social media post-composition, code suggestion, and chatbot conversation generation.

The most recent release from Meta arrives during a period in which the company was largely absent from the conversation surrounding the revolutionary AI chatbots.

It was among the first to launch its own chatbots. However, due to inaccurate results and a lacklustre response, its plans failed. Meta appears to have returned to the game through LLaMa.