3 vuotta sitten · 9c1933721d
--- a/README.md
+++ b/README.md
@@ -1,10 +1,10 @@
 
				 ## 🦙🌲🤏 Alpaca-LoRA: Low-Rank LLaMA Instruct-Tuning
			
 
				 
			
 
				-**Try the pretrained model out on Colab [here](https://colab.research.google.com/drive/1eWAmesrW99p7e1nah5bipn0zikMb8XYC)!** The pretrained weights fail to generate past 256 tokens due to a training bug, but I'm retraining the model as we speak. If your model's output doesn't terminate, please pull the latest version of the code.
			
 
				+**Try the pretrained model out on Colab [here](https://colab.research.google.com/drive/1eWAmesrW99p7e1nah5bipn0zikMb8XYC)!** _If you have problems with short outputs or very long outputs, please redownload the weights (`force_download=True`) and pull the latest version of the code._
			
 
				 
			
 
				 This repository contains code for reproducing the [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca) results using [low-rank adaptation (LoRA)](https://arxiv.org/pdf/2106.09685.pdf).
			
 
				-We aim to provide an Instruct model of similar quality to `text-davinci-003` that can run [on a Raspberry Pi](https://twitter.com/miolini/status/1634982361757790209) (for research),
			
 
				-but extensions to the `13b`, `30b`, and `65b` models should be feasible with simple changes to the code.
			
 
				+We provide an Instruct model of similar quality to `text-davinci-003` that can run [on a Raspberry Pi](https://twitter.com/miolini/status/1634982361757790209) (for research),
			
 
				+and the code can be easily extended to the `13b`, `30b`, and `65b` models.
			
 
				 
			
 
				 In addition to the training code, which runs within five hours on a single RTX 4090,
			
 
				 we publish a script for downloading and inference on the foundation model and LoRA,
			
@@ -22,7 +22,7 @@ is merged, users will need to replace their local `transformers` package.
 
				 1. Install dependencies (**install zphang's transformers fork**)
			
 
				 
			
 
				 ```
			
 
				-pip install -q datasets loralib sentencepiece
			
 
				+pip install -q datasets loralib sentencepiece accelerate
			
 
				 
			
 
				 pip uninstall transformers
			
 
				 pip install -q git+https://github.com/zphang/transformers@c3dc391