dongzenan/alpaca-lora @ 67077755175f72ad639ca2eb87918679c79f7190

No Description

Eric Wang 6707775517 README formatting		3 years ago
.gitignore	26f64780ad initial commit	3 years ago
DATA_LICENSE	63121244c8 Licenses and whatnot	3 years ago
LICENSE	63121244c8 Licenses and whatnot	3 years ago
README.md	6707775517 README formatting	3 years ago
alpaca_data.json	26f64780ad initial commit	3 years ago
conversion.py	26f64780ad initial commit	3 years ago
finetune.py	df2a5dc4be cleanup notebooks	3 years ago
generate.py	357ec81a17 decapoda	3 years ago
lengths.ipynb	26f64780ad initial commit	3 years ago
loss.ipynb	357ec81a17 decapoda	3 years ago

alpaca-lora (WIP)

This repository contains code for reproducing the Stanford Alpaca results. Users will need to be ready to fork transformers.

Setup

Install dependencies (install zphang's transformers fork)

pip install -q datasets accelerate loralib sentencepiece

pip install -q git+https://github.com/zphang/transformers@llama_push
pip install -q git+https://github.com/huggingface/peft.git

Install bitsandbytes from source

Inference

See generate.py. This file reads the decapoda-research/llama-7b-hf model from the Huggingface model hub and the LoRA weights from tloen/alpaca-lora-7b, and runs inference on a specified input. Users should treat this as example code for the use of the model, and modify it as needed.

Training

Under construction.

README.md

alpaca-lora (WIP)

Setup

Inference

Training