Skip to content

Instantly share code, notes, and snippets.

@hadyelsahar
Created January 30, 2018 12:35
Show Gist options
  • Save hadyelsahar/58b42d91a43bee5f8779088e86899b04 to your computer and use it in GitHub Desktop.
Save hadyelsahar/58b42d91a43bee5f8779088e86899b04 to your computer and use it in GitHub Desktop.
wget https://www.dropbox.com/s/tohrsllcfy7rch4/SimpleQuestions_v2.tgz
tar -xvzf ./SimpleQuestions_v2.tgz
# number of unique predicates in annotated Simple Question Dataset
cat ./SimpleQuestions_v2/annotated_fb_data_* | cut -d" " -f2 | sort | uniq | wc -l
# 1837
# number of unique predicates in FB5M (a subset of freebase)
cat ./SimpleQuestions_v2/freebase-subsets/freebase-FB5M.txt | cut -d" " -f2 | sort | uniq | wc -l
# 7523
# only 1837 predicates from 7523 are represented in SimpleQuestions
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment