Skip to content

Instantly share code, notes, and snippets.

@SohelAhmedJoni
Last active November 1, 2023 21:06
Show Gist options
  • Save SohelAhmedJoni/7d905ee4b843e460ba1b19b4c4a6f52f to your computer and use it in GitHub Desktop.
Save SohelAhmedJoni/7d905ee4b843e460ba1b19b4c4a6f52f to your computer and use it in GitHub Desktop.
AI dataset
############################ dataset ############################
link: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K
benchmark: https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM#how-good-is-the-data
############################ dataset sample ############################
count: 36441
sample:
[
{
"instruction": "What are the three primary colors?",
"input": "",
"output": [
"The three primary colors are red, blue, and yellow. These colors are called primary because they cannot be created by mixing other colors and all other colors can be made by combining them in various proportions. In the additive color system, used for light, the primary colors are red, green, and blue (RGB).",
"Red, Yellow, and Green."
]
},
{
"instruction": "Explain why the following fraction is equivalent to 1/4",
"input": "4/16",
"output": [
"The fraction 4/16 is equivalent to 1/4 because both numerators and denominators are divisible by 4. Dividing both the top and bottom numbers by 4 yields the fraction 1/4.",
"1/4 is the same as 1/4."
]
}
]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment