Skip to content

Instantly share code, notes, and snippets.

@VladimirAlexiev
Created March 14, 2019 10:42
Show Gist options
  • Star 3 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save VladimirAlexiev/fafdcb258d067cfed272664d6ae5d982 to your computer and use it in GitHub Desktop.
Save VladimirAlexiev/fafdcb258d067cfed272664d6ae5d982 to your computer and use it in GitHub Desktop.
Science Classification

Science Classifications

This describes datasets for scientific topic classification. The list was developed as part of work on the project Tracking of Research Results for EC DG RTD.

Microsoft Academic Graph

MAG fields of study (FOS): developed using artificial intelligence and semantic understanding of content.

  • 229k FOS, organized in a non-mutually exclusive hierarchy with 19 top-level FOS.
  • View
  • Downloadable as CSV tables from Azure. Relational schema
    • FieldsOfStudy
    • FieldsOfStudyChildren
    • RelatedFieldOfStudy
    • includes Rank (just like papers, authors, orgs)
  • RDF dump from ma-graph.org. RDF schema
  • MAG applies FOS to papers, journals, authors, orgs

Australian and New Zealand Standard Research Classification (ANZSRC)

Description. Released on 31 March 2008.

  • The prev version (1998) was called Research Fields, Courses and Disciplines (RFCD)
  • Digital-Science Dimensions: "The leading categorization system with broad coverage of subject areas and a large general corpus of training material is ANZSRC FOR. This classification “lens” has been made available as part of the free Dimensions version"

Includes these classifications:

  • Fields of Research (FOR). Appendix 1 FOR Fields by Code Number
    • 1419 nodes at 3 levels: 22 Divisions, 157 Groups, 1340 Fields.
    • Eg 01 MATHEMATICAL SCIENCES> 0105 MATHEMATICAL PHYSICS> 010501 Algebraic Structures in Mathematical Physics
  • Socio-economic Objective (SEO). Appendix 2 SEO Objectives by Code Number
    • 991 nodes at 4 levels: 5 Sectors, 17 Divisions, 119 Groups, 850 Objectives
    • eg SECTOR B ECONOMIC DEVELOPMENT > DIVISION 82 PLANT PRODUCTION AND PLANT PRIMARY PRODUCTS > GROUP 8201 FORESTRY> 820102 Harvesting and Transport of Forest Products
  • Type of Activity (TOA): where is it?

Scopus Subject Areas

  • 335 nodes at 2 levels: 4 subjects, 331 codes.
  • The top level is very flat: only Life Sciences, Health Sciences, Social Sciences & Humanities, Physical Sciences (eg both Software and Earth and Planetary Sciences are in Physical Sciences)
  • All Science Journal Classification Codes (ASJC) are similar
  • Table, CSV
  • Used in CrossRef "Category labels come from this list published by Scopus"
  • Discussion: CrossRef/rest-api-doc#422

Science-Metrix Classification

  • Description, CSV
  • 3 levels and 180 values
  • Includes a list of journals with classification

EU Project Subjects

cordisref-sicCode.csv. About 20-30 topic areas.

Eg WAS is "Waste Management" and includes the following keywords (sub-categories?)

  • recycling
  • recovery and reclamation
  • incineration and pyrolysis
  • land and sea disposal
  • bioconversion
  • landfill
  • industrial waste
  • energy from waste

Computer Science Ontology

Developed by OpenU KMI and Springer-Nature.

  • Machine learned
  • Project, Homepage, Downloads, press release
  • The ontology is over-complicated, hopefully they'll convert it to simpe SKOS
  • Touted to have 14k CompSci classes. But more realistically it's 2-3k (the number of clusters)

Relations

  • hierarchical (narrower)
  • lateral (contributesTo)
  • external to DBpedia (sameAs)
  • equivalence cluster (relatedEquivalent)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment