浏览代码

Add LLaMA-GPT4 dataset

Eric Wang 3 年之前
父节点
当前提交
fb9d9832e7
共有 3 个文件被更改,包括 110 次插入0 次删除
  1. 1 0
      README.md
  2. 0 0
      alpaca_data_cleaned_archive.json
  3. 109 0
      alpaca_data_gpt4.json

+ 1 - 0
README.md

@@ -2,6 +2,7 @@
 
 - 🤗 **Try the pretrained model out [here](https://huggingface.co/spaces/tloen/alpaca-lora), courtesy of a GPU grant from Huggingface!**
 - Users have created a Discord server for discussion and support [here](https://discord.gg/prbq284xX5)
+- 4/6: Repo has been updated with Microsoft Research's [LLaMA-GPT4 dataset](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM).
 
 This repository contains code for reproducing the [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca) results using [low-rank adaptation (LoRA)](https://arxiv.org/pdf/2106.09685.pdf).
 We provide an Instruct model of similar quality to `text-davinci-003` that can run [on a Raspberry Pi](https://twitter.com/miolini/status/1634982361757790209) (for research),

+ 0 - 0
alpaca_data_cleaned.json → alpaca_data_cleaned_archive.json


文件差异内容过多而无法显示
+ 109 - 0
alpaca_data_gpt4.json


部分文件因为文件数量过多而无法显示