https://huggingface.co/TencentARC/LLaMA-Pro-8B

Hugging Face's logo Hugging Face
[                    ]

  * Models
  * Datasets
  * Spaces
  * Docs
  * Solutions
  * Pricing
  * 
  * -----------------------------------------------------------------
  * Log In
  * Sign Up

[1625552871]
TencentARC
/
LLaMA-Pro-8B
like 56

 
Text Generation Transformers PyTorch Safetensors llama Inference
Endpoints text-generation-inference
License: llama2
Model card Files Files and versions Community
2
Train
Deploy
Use in Transformers
Edit model card

  * LLaMA-Pro-8B Model Card
      + Model Description
      + Development and Training
      + Intended Use
      + Performance
      + Limitations
      + Ethical Considerations

 LLaMA-Pro-8B Model Card

 Model Description

LLaMA-Pro is a progressive version of the original LLaMA model,
enhanced by the addition of Transformer blocks. It specializes in
integrating both general language understanding and domain-specific
knowledge, particularly in programming and mathematics.

 Development and Training

Developed by Tencent's ARC Lab, LLaMA-Pro is an 8.3 billion parameter
model. It's an expansion of LLaMA2-7B, further trained on code and
math corpora totaling 80 billion tokens.

 Intended Use

This model is designed for a wide range of NLP tasks, with a focus on
programming, mathematics, and general language tasks. It suits
scenarios requiring integration of natural and programming languages.

 Performance

LLaMA-Pro demonstrates advanced performance across various
benchmarks. It outperforms existing models in the LLaMA series in
handling diverse tasks, showcasing its capability as an intelligent
language agent.

 Limitations

While LLaMA-Pro addresses some limitations of previous models in the
series, it may still encounter challenges specific to highly
specialized domains or tasks.

 Ethical Considerations

Users should be aware of potential biases in the model and use it
responsibly, considering its impact on various applications.

Downloads last month
    70

Safetensors 
Model size
8.36B params
Tensor type
BF16
*

Space using TencentARC/LLaMA-Pro-8B 1

 

RobotDall/TencentARC-LLaMA-Pro-8B
Company
(c) Hugging Face
TOS Privacy About Jobs  
Website
Models Datasets Spaces Pricing Docs