Skip to content

Instantly share code, notes, and snippets.

@sampottinger
Last active August 10, 2020 16:55
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save sampottinger/0b67822ede3bdc3e23f40d0210f84c21 to your computer and use it in GitHub Desktop.
Save sampottinger/0b67822ede3bdc3e23f40d0210f84c21 to your computer and use it in GitHub Desktop.
Data note for everday health colab interview.

Exploring Conversations about Health at Scale during COVID-19

Hey there! Recently I was interviewed for a post on IDEO's Colab Blog which uses everday health as an example to discuss how data science shows up on IDEO projects. As some results were discussed to situate that example, this gist provides supplemental data and additional methodological details for data discussed within that post. This is shared publically to add important context to visualizations and discussion of preliminary results. It is not meant to be a complete paper. Other data sources are being considered by the project.


Additional Methodical Details

The Benjamini–Hochberg procedure controls the false discovery rate in the case of multiple comparisons. Overlaps between topics calculated by Jaccard Index. Health content found by filtering for "health" in the attributes listed below for each dataset. The tokens (including hashtags) used in groups found by identifying those appearing more often in health content (both before and during pandemic). Specifically, analysis finds those tags and tokens unique to health conversations by comparing health related content to content in general ("reference group") on each platform before clustering those into topical groups. Some tokens and tags are ambiguous or do not cluster into a larger group and are in the "other" group.


Additional Sample Details

This post uses data from Tumblr (hashtags from 2008–11–10 to 2020–05–19), various news sources (titles from 2020/04/15 to 2020/05/15), and Crunchbase (company descriptions from 2020/01/01 to 2020/05/15). Other datasets considered in internal analysis to try to reach a broader audience and demongraphics.

News Sample

The news outlet sample includes the following (powered by News API):

Note that there are different numbers of articles for each outlet but the frequencies are normalized such that each outlet has the same weight. These sources chosen to cover a broad range of news topics and ideological audiences.

Tumblr

Tumblr allows for a sample of public conversations over a longer period of time. Furthermore, it's ability to support long form communication enabled the team to investigate rich stories. That said, as with any social media, the sample carries some bias and research must recognize the demographic skew of the platform. The sample size is also slightly small compared to some other sources reviewed.

Crunchbase

Not all compaies are documented within the platform and the potential bias in that sampling is unknown. That said, it still provides a broad perspective on start ups / new companies and an important window into industry activity.


Statistical Test Results

In response to the question "Going back to what you said earlier, what has changed since COVID-19?" the results are significant (P < (i / 124) 0.05). Additionally, results in response to the question "What themes are emerging that we should all pay more attention to?" are significant (P < (i / 6) 0.05). Note that the response to "What did you learn about how the public thinks about health?" is simply the observation that diet and exercise appear at the top of the frequency distribution.


Supplemental Data

Summary data are made available under the Open Data Commons Attribution License: http://opendatacommons.org/licenses/by/1.0/. The following tables are available:

  • 2_sample_size.csv: Size of sample used across different dimensions.
  • 3_cross_group_topic_frequency_comparison.csv: Frequency (z-scores) of different tag groups across the different datastets examined.
  • 4_tumblr_jaccards.csv: Jaccard index for overlaps between different hashtag groups in the Tumblr health posts dataset.
  • 5_tumblr_health_tags_with_increase.csv: Tumblr hashtags with significant (P < (i/m) 0.05) increase from before pandemic to during (pre 2020 vs 2020).
  • 6_token_mapping.csv: Mapping from token to token group.
  • 7_other_tokens.csv: The list of tokens in the "other" group.

Future Work

Future work could investigate additional platforms and databases which may have different bias / demographic skew. Additionally, work in languages other than English is important to broadening these perspectives. Furthermore, this investigation focuses on passive observation but direct engagement through tools like surveys could further complement these methods. Next, even though the data yield significant results, the data could be expanded to acheve larger sample sizes. Finally, further investigation into the "other" group could reveal additional insights and other forms of topic modeling (other than clustering terms of higher frequency compared a "reference sample") could be explored though the brevity of the observations means that many topic modeling techniques like LDA performed poorly. Of course, a study over a longer time frame could help with addressing seasonality which was not considered in the results presented in the blog post.

Source Total Health Health During Pandemic Units
Crunchbase 752 565 156 Companies
News Articles 14818 9422 5020 Articles
Tumblr 19088 5104 3067 Posts
Total 34658 15091 8243
public group company news
0 -0.5168681032691294 time -0.7367064943094048 -0.13735985481178722
1 -0.4304490732639405 pain -0.3779624622978687 -0.4473464765682405
2 -0.4823004912670538 death -0.6470204863065209 -0.13279300765418492
3 -0.4823004912670538 cancer -0.3779624622978687 -0.42062812875199296
4 -0.493823028601079 testing -0.7367064943094048 -0.25376513777352977
5 -0.5226293719361419 opioid -0.7367064943094048 -0.44456436312590675
6 -0.40164272992887756 chronic illness -0.7367064943094048 -0.4459534232575277
7 0.9753004814871313 inspiration -0.28827645429498466 -0.40607527008984523
8 -0.34979131192576435 politics -0.6470204863065209 -0.16638091851200842
9 -0.5053455659351042 economy -0.6470204863065209 0.07641226763759607
10 -0.07325041590916002 medicine 1.9538737457771167 -0.2348513428819704
11 2.9859832462745244 diet 0.24983959372231965 -0.373320249221212
12 -0.395881461261865 alternative medicine -0.7367064943094048 -0.4497871413371824
13 -0.36707511792680203 loneliness -0.7367064943094048 -0.4213782176533231
14 -0.5226293719361419 vaccine -0.5573344783036367 -0.31457964359477336
15 -0.4765392226000413 blood -0.7367064943094048 -0.382331847894333
16 -0.5053455659351042 research -0.7367064943094048 -0.24151538663892846
17 -0.3382687745917391 technology 2.491989793794421 -0.18186776920032685
18 -0.27489481925460063 health care system 1.6848157217684645 0.47324744304696853
19 -0.4707779539330286 government 0.787955641739624 1.1442256745162132
20 2.738248693592983 exercise 0.5188976177309718 -0.4043217475057806
21 2.17364436422575 disease 0.6982696337367398 4.695047108731936
22 -0.10205675924422296 care 2.312617777788653 0.5710607393668473
23 -0.5226293719361419 law -0.6470204863065209 -0.4308623284443466
24 -0.5168681032691294 abortion -0.7367064943094048 -0.4247912246934133
25 0.7333271974726024 mental health 0.0704675777165516 -0.3225185366507591
26 -0.5226293719361419 wearable -0.19859044629210057 -0.4497871413371824
27 -0.3325075059247265 science 0.24983959372231965 0.5267859242989942
index hashtagGroup1 hashtagGroup2 jaccard
0 exercise inspiration 0.16666666666666666
1 diet exercise 0.42424242424242425
2 diet inspiration 0.10559796437659033
3 exercise mental health 0.041168658698539175
4 diet mental health 0.07124352331606218
5 mental health politics 0.008130081300813009
6 care government 0.030303030303030304
7 care diet 0.004511278195488722
8 exercise technology 0.003355704697986577
9 death disease 0.006355932203389831
10 care medicine 0.17346938775510204
11 diet medicine 0.0030165912518853697
12 disease mental health 0.03003003003003003
13 chronic illness diet 0.04272151898734177
14 chronic illness mental health 0.1452991452991453
15 diet loneliness 0.04262295081967213
16 chronic illness loneliness 0.4807692307692308
17 loneliness mental health 0.12385321100917432
18 diet disease 0.01892147587511826
19 inspiration mental health 0.02795698924731183
20 chronic illness pain 0.17857142857142858
21 mental health pain 0.035398230088495575
22 medicine politics 0.011764705882352941
23 disease medicine 0.03353057199211045
24 health care system mental health 0.00904977375565611
25 mental health time 0.0045871559633027525
26 disease politics 0.0163265306122449
27 loneliness pain 0.04878048780487805
28 diet pain 0.00482315112540193
29 chronic illness medicine 0.0392156862745098
30 chronic illness politics 0.02564102564102564
31 diet politics 0.001567398119122257
32 loneliness politics 0.017857142857142856
33 chronic illness disease 0.0136986301369863
34 care chronic illness 0.02830188679245283
35 care disease 0.03536345776031434
36 care pain 0.013513513513513514
37 mental health science 0.024489795918367346
38 politics technology 0.03333333333333333
39 cancer diet 0.0016260162601626016
40 disease exercise 0.01671583087512291
41 care exercise 0.00644122383252818
42 disease inspiration 0.0027548209366391185
43 mental health research 0.0091324200913242
44 disease government 0.008456659619450317
45 disease science 0.024539877300613498
46 inspiration science 0.006872852233676976
47 diet science 0.006269592476489028
48 exercise science 0.0050335570469798654
49 science testing 0.08571428571428572
50 disease economy 0.0042643923240938165
51 exercise medicine 0.001610305958132045
52 medicine mental health 0.01107011070110701
53 disease testing 0.006382978723404255
54 care economy 0.01639344262295082
55 mental health technology 0.008064516129032258
56 death diet 0.004893964110929853
57 death mental health 0.013513513513513514
58 chronic illness death 0.05555555555555555
59 death loneliness 0.0625
60 inspiration research 0.003816793893129771
61 exercise research 0.0017605633802816902
62 disease health care system 0.004246284501061571
63 politics science 0.05
64 care mental health 0.007272727272727273
65 blood care 0.015151515151515152
66 disease technology 0.002004008016032064
67 cancer medicine 0.016129032258064516
68 death pain 0.045454545454545456
69 loneliness science 0.01694915254237288
70 chronic illness science 0.012195121951219513
71 exercise pain 0.0017211703958691911
72 care science 0.01098901098901099
73 care testing 0.015873015873015872
74 care health care system 0.03225806451612903
75 health care system medicine 0.016666666666666666
76 science technology 0.015625
77 cancer exercise 0.0017482517482517483
token beforePandemicCount duringPandemicCount beforePandemicPercent duringPandemicPercent pValue isNovel
coronavirus 0 276 0.0 0.08999021845451580 1.14385796423166E-43 1
covid19 0 188 0.0 0.06129768503423540 1.17070807351947E-29 1
healthyeating 18 104 0.008836524300441830 0.033909357678513200 1.61090665344988E-08 0
tophealthnewssciencedaily 5 81 0.002454590083456060 0.026410172807303600 1.54619935162584E-10 0
mental 7 47 0.003436426116838490 0.015324421258558900 8.65707374243805E-05 0
virus 0 44 0.0 0.014346266710140200 1.32955465168414E-07 1
anxiety 12 51 0.005891016200294550 0.016628627323117100 0.0010642185014306100 0
pandemic 0 33 0.0 0.010759700032605200 6.22761712808061E-06 1
anxious 0 31 0.0 0.010107597000326100 1.25714238526998E-05 1
healthbenefits 2 32 0.0009818360333824250 0.010433648516465600 0.00010045545169074900 0
eatingdisorder 0 29 0.0 0.009455493968046950 2.54128134928934E-05 1
anorexia 2 30 0.0009818360333824250 0.0097815454841865 0.00019972647028843200 0
depressed 2 29 0.0009818360333824250 0.009455493968046950 0.00028157686148914000 0
bulimia 2 29 0.0009818360333824250 0.009455493968046950 0.00028157686148914000 0
quote 1 27 0.0004909180166912130 0.008803390935767850 0.00018124828493768400 0
healthyliving 16 57 0.007854688267059400 0.018584936419954400 0.0023548348661840300 0
healthtipsandtricks 0 25 0.0 0.008151287903488750 0.0001043984809947880 1
suicidal 0 24 0.0 0.0078252363873492 0.00014883212974236400 1
udemy 0 23 0.0 0.007499184871209650 0.00021231651687565400 1
selfharm 1 25 0.0004909180166912130 0.008151287903488750 0.00036538384019529700 0
suicide 1 25 0.0004909180166912130 0.008151287903488750 0.00036538384019529700 0
memes 0 22 0.0 0.007173133355070100 0.0003030990727186660 1
alone 1 24 0.0004909180166912130 0.0078252363873492 0.0005190479627536010 0
meme 2 25 0.0009818360333824250 0.008151287903488750 0.0011107798247610200 0
pakistan 2 24 0.0009818360333824250 0.0078252363873492 0.001564714698153730 0
corona 0 18 0.0 0.0058689272905119000 0.0012701721141334300 1
quarantine 0 18 0.0 0.0058689272905119000 0.0012701721141334300 1
lonely 1 21 0.0004909180166912130 0.006847081838930550 0.0014914751762104800 0
quotes 5 31 0.002454590083456060 0.010107597000326100 0.0024565567910147100 0
covid 0 17 0.0 0.005542875774372350 0.0018223706146240200 1
eatingdisorders 3 24 0.0014727540500736400 0.0078252363873492 0.004146146995348200 0
mindfulness 5 29 0.002454590083456060 0.009455493968046950 0.004577323146608540 0
sadquote 0 15 0.0 0.00489077274209325 0.003767726618655310 1
sadquotes 0 15 0.0 0.00489077274209325 0.003767726618655310 1
anorexic 0 14 0.0 0.004564721225953700 0.005431575306979790 1
infectiousdiseases 0 14 0.0 0.004564721225953700 0.005431575306979790 1
fitblr 50 115 0.024545900834560600 0.03749592435604830 0.013107672410715800 0
media 21 61 0.010309278350515500 0.019889142484512600 0.010707681873183800 0
bulimic 0 13 0.0 0.004238669709814150 0.007846427171112870 1
immunesystem 1 16 0.0004909180166912130 0.005216824258232800 0.008750192578603450 0
sad 6 29 0.0029455081001472800 0.009455493968046950 0.00968945605338446 0
covidー19 0 12 0.0 0.0039126181936746 0.011362185260620100 1
gna 0 12 0.0 0.0039126181936746 0.011362185260620100 1
coronavirusoutbreak 0 12 0.0 0.0039126181936746 0.011362185260620100 1
cafs 0 12 0.0 0.0039126181936746 0.011362185260620100 1
lockdown 0 12 0.0 0.0039126181936746 0.011362185260620100 1
ebola 21 59 0.010309278350515500 0.019237039452233500 0.016412555483508600 0
selfhate 0 11 0.0 0.0035865666775350500 0.016499664971537900 1
blackandwhite 0 11 0.0 0.0035865666775350500 0.016499664971537900 1
staysafe 0 11 0.0 0.0035865666775350500 0.016499664971537900 1
propaganda 21 58 0.010309278350515500 0.018910987936093900 0.02022751999767050 0
healthblr 15 46 0.007363770250368190 0.014998369742419300 0.019991855569415200 0
healing 7 29 0.003436426116838490 0.009455493968046950 0.01899862146585070 0
runblr 3 19 0.0014727540500736400 0.006194978806651450 0.021237372299259100 0
token group
coronavirus disease
economy economy
economic economy
unemployment economy
health general health
covid-19 disease
us government
virus disease
pandemic disease
death death
may time
case disease
care care
hospital care, health care system
mental mental health
measles disease
study research
doctor care
outbreak disease
risk science
official government
abortion abortion
medical medicine
cancer cancer
u.s government
report science
bill government
un government
california government
democrats politics
chief government
testing testing, health care system, care
cdc science
york government
federal government
nursing care, health care system
sanders politics
biden politics
vaccine vaccine, medicine
app technology
workers economy
suicide mental health
korea government
election politics
brazil government
tracing technology
georgia government
contact technology
opioid opioid
scientist science
worker economy
ebola disease
mayor government
dr care, health care system
patient care, health care system
spread disease
killed death
journal science
doctors care, health care system
alabama government
border government
medicare government, health care system
vote politics
covid disease
lawsuit law
senate government
democrat politics
eu government
administration government
weight diet
blood blood, care
mexico government
clinic health care system, care
fitness exercise
healthcare care, health care system
nutrition diet
healthy general wellness
weightloss diet
coronavirus disease
wellness general wellness
life general wellness
motivation inspiration
lifestyle general wellness
mentalhealth mental health
covid19 disease
fitblr inspiration
exercise exercise
food diet
healthyeating diet
workout exercise
diet diet
ebola disease
healthyliving general wellness
medical medicine
selfcare general wellness
healthfitness exercise
depression mental health
loseweight diet
anxiety mental health
fitspo inspiration
weight diet
healthblr inspiration
mental mental health
technology technology
running exercise
virus disease
gym exercise
medicine medicine
politics politics
fitspiration inspiration
doctor care, health care system
alternativemedicine alternative medicine, medicine
yoga exercise
sad mental health
mindfulness mental health
anorexia diet
bulimia diet
depressed mental health
anxious mental health
chronicillness chronic illness
eatingdisorder mental health, diet
stress mental health
selflove general wellness
eatingdisorders mental health, diet
pain pain
selfharm mental health
suicide mental health
fitfam inspiration
wellbeing general wellness
hiv disease, chronic illness
alone loneliness
fit exercise
healing general wellness
healthandwellness general wellness
disease disease
selfhealing general wellness
suicidal mental health
thoughts mental health
vegetarian diet
supplements diet
healthyfood diet
lonely loneliness
losingweight diet
runblr inspiration
sports exercise
aids disease, chronic illness
pandemic disease
weightlossjourney diet
healthandfitness general wellness
care general wellness
meditation mental health
science science
psychology mental health
supplement diet
goalweight diet
vitamin diet
weightlog diet
corona disease
chronicpain chronic illness, pain
health general wellness
covid19 disease
healthcare care
coronavirus disease
fitness exercise
wellness general wellness
ai technology
data technology
technology technology
wearable wearable
community inspiration
bbc
could
fox
help
know
latest
2020
spears
britney
rise
ship
americans
heart
ban
today
cruise
warns
linked
rate
likely
early
threat
questions
release
concern
need
city
fighting
poll
healthtips
gesundheit
deutschesärzteblattaktuelles
cdnhealth
tips
breakingnews
tophealthnewssciencedaily
media
nyt
propaganda
personal
body
dailymail
healthbenefits
pakistan
healthtipsandtricks
women
udemy
infraredsauna
quotes
recovery
drpounders
healthcaribbean
Open Data Commons Attribution License (ODC-By) v1.0
Disclaimer
Open Data Commons is not a law firm and does not provide legal services of any kind.
Open Data Commons has no formal relationship with you. Your receipt of this document does not create any kind of agent-client relationship. Please seek the advice of a suitably qualified legal professional licensed to practice in your jurisdiction before using this document.
No warranties and disclaimer of any damages. This information is provided ‘as is‘, and this site makes no warranties on the information provided. Any damages resulting from its use are disclaimed.
A plain language summary of the ODC Attribution License (ODC-BY) is available as well as a plain text version.
Attribution License (ODC-By)
Preamble
The Open Data Commons Attribution License is a license agreement intended to allow users to freely share, modify, and use this Database subject only to the attribution requirements set out in Section 4.
Databases can contain a wide variety of types of content (images, audiovisual material, and sounds all in the same database, for example), and so this license only governs the rights over the Database, and not the contents of the Database individually. Licensors may therefore wish to use this license together with another license for the contents.
Sometimes the contents of a database, or the database itself, can be covered by other rights not addressed here (such as private contracts, trademark over the name, or privacy rights / data protection rights over information in the contents), and so you are advised that you may have to consult other documents or clear other rights before doing activities not covered by this License.
The Licensor (as defined below)
and
You (as defined below)
agree as follows:
1.0 Definitions of Capitalised Words
“Collective Database” – Means this Database in unmodified form as part of a collection of independent databases in themselves that together are assembled into a collective whole. A work that constitutes a Collective Database will not be considered a Derivative Database.
“Convey” – As a verb, means Using the Database, a Derivative Database, or the Database as part of a Collective Database in any way that enables a Person to make or receive copies of the Database or a Derivative Database. Conveying does not include interaction with a user through a computer network, or creating and Using a Produced Work, where no transfer of a copy of the Database or a Derivative Database occurs.
“Contents” – The contents of this Database, which includes the information, independent works, or other material collected into the Database. For example, the contents of the Database could be factual data or works such as images, audiovisual material, text, or sounds.
“Database” – A collection of material (the Contents) arranged in a systematic or methodical way and individually accessible by electronic or other means offered under the terms of this License.
“Database Directive” – Means Directive 96/9/EC of the European Parliament and of the Council of 11 March 1996 on the legal protection of databases, as amended or succeeded.
“Database Right” – Means rights resulting from the Chapter III (“sui generis”) rights in the Database Directive (as amended and as transposed by member states), which includes the Extraction and Re-utilisation of the whole or a Substantial part of the Contents, as well as any similar rights available in the relevant jurisdiction under Section 10.4.
“Derivative Database” – Means a database based upon the Database, and includes any translation, adaptation, arrangement, modification, or any other alteration of the Database or of a Substantial part of the Contents. This includes, but is not limited to, Extracting or Re-utilising the whole or a Substantial part of the Contents in a new Database.
“Extraction” – Means the permanent or temporary transfer of all or a Substantial part of the Contents to another medium by any means or in any form.
“License” – Means this license agreement and is both a license of rights such as copyright and Database Rights and an agreement in contract.
“Licensor” – Means the Person that offers the Database under the terms of this License.
“Person” – Means a natural or legal person or a body of persons corporate or incorporate.
“Produced Work” – a work (such as an image, audiovisual material, text, or sounds) resulting from using the whole or a Substantial part of the Contents (via a search or other query) from this Database, a Derivative Database, or this Database as part of a Collective Database.
“Publicly” – means to Persons other than You or under Your control by either more than 50% ownership or by the power to direct their activities (such as contracting with an independent consultant).
“Re-utilisation” – means any form of making available to the public all or a Substantial part of the Contents by the distribution of copies, by renting, by online or other forms of transmission.
“Substantial” – Means substantial in terms of quantity or quality or a combination of both. The repeated and systematic Extraction or Re-utilisation of insubstantial parts of the Contents may amount to the Extraction or Re-utilisation of a Substantial part of the Contents.
“Use” – As a verb, means doing any act that is restricted by copyright or Database Rights whether in the original medium or any other; and includes without limitation distributing, copying, publicly performing, publicly displaying, and preparing derivative works of the Database, as well as modifying the Database as may be technically necessary to use it in a different mode or format.
“You” – Means a Person exercising rights under this License who has not previously violated the terms of this License with respect to the Database, or who has received express permission from the Licensor to exercise rights under this License despite a previous violation.
Words in the singular include the plural and vice versa.
2.0 What this License covers
2.1. Legal effect of this document. This License is:
a. A license of applicable copyright and neighbouring rights;
b. A license of the Database Right; and
c. An agreement in contract between You and the Licensor.
2.2 Legal rights covered. This License covers the legal rights in the Database, including:
a. Copyright. Any copyright or neighbouring rights in the Database. The copyright licensed includes any individual elements of the Database, but does not cover the copyright over the Contents independent of this Database. See Section 2.4 for details. Copyright law varies between jurisdictions, but is likely to cover: the Database model or schema, which is the structure, arrangement, and organisation of the Database, and can also include the Database tables and table indexes; the data entry and output sheets; and the Field names of Contents stored in the Database;
b. Database Rights. Database Rights only extend to the Extraction and Re-utilisation of the whole or a Substantial part of the Contents. Database Rights can apply even when there is no copyright over the Database. Database Rights can also apply when the Contents are removed from the Database and are selected and arranged in a way that would not infringe any applicable copyright; and
c. Contract. This is an agreement between You and the Licensor for access to the Database. In return you agree to certain conditions of use on this access as outlined in this License.
2.3 Rights not covered.
a. This License does not apply to computer programs used in the making or operation of the Database;
b. This License does not cover any patents over the Contents or the Database; and
c. This License does not cover any trademarks associated with the Database.
2.4 Relationship to Contents in the Database. The individual items of the Contents contained in this Database may be covered by other rights, including copyright, patent, data protection, privacy, or personality rights, and this License does not cover any rights (other than Database Rights or in contract) in individual Contents contained in the Database.
For example, if used on a Database of images (the Contents), this License would not apply to copyright over individual images, which could have their own separate licenses, or one single license covering all of the rights over the images.
3.0 Rights granted
3.1 Subject to the terms and conditions of this License, the Licensor grants to You a worldwide, royalty-free, non-exclusive, terminable (but only under Section 9) license to Use the Database for the duration of any applicable copyright and Database Rights. These rights explicitly include commercial use, and do not exclude any field of endeavour. To the extent possible in the relevant jurisdiction, these rights may be exercised in all media and formats whether now known or created in the future.
The rights granted cover, for example:
a. Extraction and Re-utilisation of the whole or a Substantial part of the Contents;
b. Creation of Derivative Databases;
c. Creation of Collective Databases;
d. Creation of temporary or permanent reproductions by any means and in any form, in whole or in part, including of any Derivative Databases or as a part of Collective Databases; and
e. Distribution, communication, display, lending, making available, or performance to the public by any means and in any form, in whole or in part, including of any Derivative Database or as a part of Collective Databases.
3.2 Compulsory license schemes. For the avoidance of doubt:
a. Non-waivable compulsory license schemes. In those jurisdictions in which the right to collect royalties through any statutory or compulsory licensing scheme cannot be waived, the Licensor reserves the exclusive right to collect such royalties for any exercise by You of the rights granted under this License;
b. Waivable compulsory license schemes. In those jurisdictions in which the right to collect royalties through any statutory or compulsory licensing scheme can be waived, the Licensor waives the exclusive right to collect such royalties for any exercise by You of the rights granted under this License; and,
c. Voluntary license schemes. The Licensor waives the right to collect royalties, whether individually or, in the event that the Licensor is a member of a collecting society that administers voluntary licensing schemes, via that society, from any exercise by You of the rights granted under this License.
3.3 The right to release the Database under different terms, or to stop distributing or making available the Database, is reserved. Note that this Database may be multiple-licensed, and so You may have the choice of using alternative licenses for this Database. Subject to Section 10.4, all other rights not expressly granted by Licensor are reserved.
4.0 Conditions of Use
4.1 The rights granted in Section 3 above are expressly made subject to Your complying with the following conditions of use. These are important conditions of this License, and if You fail to follow them, You will be in material breach of its terms.
4.2 Notices. If You Publicly Convey this Database, any Derivative Database, or the Database as part of a Collective Database, then You must:
a. Do so only under the terms of this License;
b. Include a copy of this License or its Uniform Resource Identifier (URI) with the Database or Derivative Database, including both in the Database or Derivative Database and in any relevant documentation;
c. Keep intact any copyright or Database Right notices and notices that refer to this License; and
d. If it is not possible to put the required notices in a particular file due to its structure, then You must include the notices in a location (such as a relevant directory) where users would be likely to look for it.
4.3 Notice for using output (Contents). Creating and Using a Produced Work does not require the notice in Section 4.2. However, if you Publicly Use a Produced Work, You must include a notice associated with the Produced Work reasonably calculated to make any Person that uses, views, accesses, interacts with, or is otherwise exposed to the Produced Work aware that Content was obtained from the Database, Derivative Database, or the Database as part of a Collective Database, and that it is available under this License.
a. Example notice. The following text will satisfy notice under Section 4.3:
Contains information from DATABASE NAME which is made available
under the ODC Attribution License.
DATABASE NAME should be replaced with the name of the Database and a hyperlink to the location of the Database. “ODC Attribution License” should contain a hyperlink to the URI of the text of this License. If hyperlinks are not possible, You should include the plain text of the required URI’s with the above notice.
4.4 Licensing of others. You may not sublicense the Database. Each time You communicate the Database, the whole or Substantial part of the Contents, or any Derivative Database to anyone else in any way, the Licensor offers to the recipient a license to the Database on the same terms and conditions as this License. You are not responsible for enforcing compliance by third parties with this License, but You may enforce any rights that You have over a Derivative Database. You are solely responsible for any modifications of a Derivative Database made by You or another Person at Your direction. You may not impose any further restrictions on the exercise of the rights granted or affirmed under this License.
5.0 Moral rights
5.1 Moral rights. This section covers moral rights, including any rights to be identified as the author of the Database or to object to treatment that would otherwise prejudice the author’s honour and reputation, or any other derogatory treatment:
a. For jurisdictions allowing waiver of moral rights, Licensor waives all moral rights that Licensor may have in the Database to the fullest extent possible by the law of the relevant jurisdiction under Section 10.4;
b. If waiver of moral rights under Section 5.1 a in the relevant jurisdiction is not possible, Licensor agrees not to assert any moral rights over the Database and waives all claims in moral rights to the fullest extent possible by the law of the relevant jurisdiction under Section 10.4; and
c. For jurisdictions not allowing waiver or an agreement not to assert moral rights under Section 5.1 a and b, the author may retain their moral rights over certain aspects of the Database.
Please note that some jurisdictions do not allow for the waiver of moral rights, and so moral rights may still subsist over the Database in some jurisdictions.
6.0 Fair dealing, Database exceptions, and other rights not affected
6.1 This License does not affect any rights that You or anyone else may independently have under any applicable law to make any use of this Database, including without limitation:
a. Exceptions to the Database Right including: Extraction of Contents from non-electronic Databases for private purposes, Extraction for purposes of illustration for teaching or scientific research, and Extraction or Re-utilisation for public security or an administrative or judicial procedure.
b. Fair dealing, fair use, or any other legally recognised limitation or exception to infringement of copyright or other applicable laws.
6.2 This License does not affect any rights of lawful users to Extract and Re-utilise insubstantial parts of the Contents, evaluated quantitatively or qualitatively, for any purposes whatsoever, including creating a Derivative Database (subject to other rights over the Contents, see Section 2.4). The repeated and systematic Extraction or Re-utilisation of insubstantial parts of the Contents may however amount to the Extraction or Re-utilisation of a Substantial part of the Contents.
7.0 Warranties and Disclaimer
7.1 The Database is licensed by the Licensor “as is” and without any warranty of any kind, either express, implied, or arising by statute, custom, course of dealing, or trade usage. Licensor specifically disclaims any and all implied warranties or conditions of title, non-infringement, accuracy or completeness, the presence or absence of errors, fitness for a particular purpose, merchantability, or otherwise. Some jurisdictions do not allow the exclusion of implied warranties, so this exclusion may not apply to You.
8.0 Limitation of liability
8.1 Subject to any liability that may not be excluded or limited by law, the Licensor is not liable for, and expressly excludes, all liability for loss or damage however and whenever caused to anyone by any use under this License, whether by You or by anyone else, and whether caused by any fault on the part of the Licensor or not. This exclusion of liability includes, but is not limited to, any special, incidental, consequential, punitive, or exemplary damages such as loss of revenue, data, anticipated profits, and lost business. This exclusion applies even if the Licensor has been advised of the possibility of such damages.
8.2 If liability may not be excluded by law, it is limited to actual and direct financial loss to the extent it is caused by proved negligence on the part of the Licensor.
9.0 Termination of Your rights under this License
9.1 Any breach by You of the terms and conditions of this License automatically terminates this License with immediate effect and without notice to You. For the avoidance of doubt, Persons who have received the Database, the whole or a Substantial part of the Contents, Derivative Databases, or the Database as part of a Collective Database from You under this License will not have their licenses terminated provided their use is in full compliance with this License or a license granted under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will survive any termination of this License.
9.2 If You are not in breach of the terms of this License, the Licensor will not terminate Your rights under it.
9.3 Unless terminated under Section 9.1, this License is granted to You for the duration of applicable rights in the Database.
9.4 Reinstatement of rights. If you cease any breach of the terms and conditions of this License, then your full rights under this License will be reinstated:
a. Provisionally and subject to permanent termination until the 60th day after cessation of breach;
b. Permanently on the 60th day after cessation of breach unless otherwise reasonably notified by the Licensor; or
c. Permanently if reasonably notified by the Licensor of the violation, this is the first time You have received notice of violation of this License from the Licensor, and You cure the violation prior to 30 days after your receipt of the notice.
9.5 Notwithstanding the above, Licensor reserves the right to release the Database under different license terms or to stop distributing or making available the Database. Releasing the Database under different license terms or stopping the distribution of the Database will not withdraw this License (or any other license that has been, or is required to be, granted under the terms of this License), and this License will continue in full force and effect unless terminated as stated above.
10.0 General
10.1 If any provision of this License is held to be invalid or unenforceable, that must not affect the validity or enforceability of the remainder of the terms and conditions of this License and each remaining provision of this License shall be valid and enforced to the fullest extent permitted by law.
10.2 This License is the entire agreement between the parties with respect to the rights granted here over the Database. It replaces any earlier understandings, agreements or representations with respect to the Database.
10.3 If You are in breach of the terms of this License, You will not be entitled to rely on the terms of this License or to complain of any breach by the Licensor.
10.4 Choice of law. This License takes effect in and will be governed by the laws of the relevant jurisdiction in which the License terms are sought to be enforced. If the standard suite of rights granted under applicable copyright law and Database Rights in the relevant jurisdiction includes additional rights not granted under this License, these additional rights are granted in this License in order to meet the terms of this License.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment