Skip to content

Instantly share code, notes, and snippets.

@talaugust
talaugust / README.md
Last active November 30, 2023 08:32
StoriesInTheWild dataset

StoriesInTheWild dataset

This is the data from the StoriesInTheWild paper (citation below), split into two CSV files: stories.csv and ratings.csv.

Columns in stories.csv:

  • data_type: Image prompt the story is about
  • story_id: Unique story identifier
  • story: Story text (merged if written in chunks)
  • story_chunks: Only for stories written in chunks. Each chunk is separated by a newline containing ''
  • authorAge: Age of author
  • authorGender: Gender of author
This file has been truncated, but you can view the full file.
{"word":{"0":"id","1":"ii","2":"ind","3":"ib","4":"iv","5":"iin","6":"ia","7":"ih","8":"ill","9":"ide","10":"il","11":"iq","12":"ik","13":"ic","14":"iii","15":"ij","16":"ix","17":"int","18":"inc","19":"india","20":"ireland","21":"israel","22":"italy","23":"io","24":"ith","25":"iff","26":"ig","27":"im","28":"ideas","29":"identity","30":"image","31":"importance","32":"inches","33":"income","34":"increased","35":"independence","36":"index","37":"individual","38":"individuals","39":"industries","40":"industry","41":"information","42":"injury","43":"interest","44":"interests","45":"interpretation","46":"intervention","47":"interview","48":"investigation","49":"investment","50":"involved","51":"ip","52":"ir","53":"island","54":"islands","55":"issue","56":"issues","57":"item","58":"ite","59":"idea","60":"important","61":"impossible","62":"infection","63":"influence","64":"institutions","65":"ives","66":"iz","67":"ie","68":"iva","69":"ins","70":"ift","71":"increase","72":"institution","73":"ize","74":"ibid","75":"iu"