https://en.wikipedia.org/wiki/LLaMA
LLaMA
LLaMA (Large Language Model Meta AI) is a large language model (LLM) released by Meta AI in February 2023. A variety of model sizes were trained ranging from 7 billion to 65 billion parameters. LLaMA's developers reported that the 13 billion parameter model's performance on most NLP benchmarks exceeded that of the much larger GPT-3 (with 175 billion parameters) and that the largest model was competitive with state of the art models such as PaLM and Chinchilla. Whereas the most powerful LLMs have generally been accessible only through limited APIs (if at all), Meta released LLaMA's model weights to the research community under a noncommercial license. Within a week of LLaMA's release, its weights were leaked to the public on 4chan via BitTorrent.On July 18, 2023, Meta announced the next generation of the LLaMA model, named Llama 2.
Architecture and training
Architecture
LLaMA uses the transformer architecture, the standard architecture for language modelling since 2018.
...
EN.WIKIPEDIA.ORG
0 Tags
0 Shares