Skip to content

Instantly share code, notes, and snippets.

@ricardoaat
Created July 24, 2018 12:54
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ricardoaat/7964d80e8ba506c59a4742723fc58789 to your computer and use it in GitHub Desktop.
Save ricardoaat/7964d80e8ba506c59a4742723fc58789 to your computer and use it in GitHub Desktop.
Crawl metadata fields
{
"schema": {
"@context": "http://schema.org",
"@type": "Dataset",
"description": "BioSamples stores and supplies descriptions and metadata about biological samples used in research and development by academia and industry. Samples are either 'reference' samples (e.g. from 1000 Genomes, HipSci, FAANG) or have been used in an assay database such as the European Nucleotide Archive (ENA) or ArrayExpress.",
"includedInDataCatalog": {
"@id": "https://www.ebi.ac.uk/biosamples",
"@type": "DataCatalog"
},
"name": "Sample collection",
"page": "https://www.ebi.ac.uk/biosamples/samples/?start=19380"
},
"url": "http://wp-p1m-41.ebi.ac.uk:8081/biosamples/samples",
"datetime": "2018-06-07T10:39:12.445037038+01:00",
"schemaSourceType": "Microdata",
"IP-address": "xxx.xxx.xxx.xxx",
"contentType": "html",
"sourceLinkType": "sitemap.xml",
"crawler": {
"crawler-id": 21,
"crawler-name": "leCrawler"
}
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment