This describes datasets for scientific topic classification. The list was developed as part of work on the project Tracking of Research Results for EC DG RTD.
MAG fields of study (FOS): developed using artificial intelligence and semantic understanding of content.
- 229k FOS, organized in a non-mutually exclusive hierarchy with 19 top-level FOS.
- View
- Downloadable as CSV tables from Azure. Relational schema
- FieldsOfStudy
- FieldsOfStudyChildren
- RelatedFieldOfStudy
- includes Rank (just like papers, authors, orgs)
- RDF dump from ma-graph.org. RDF schema
- MAG applies FOS to papers, journals, authors, orgs
Description. Released on 31 March 2008.
- The prev version (1998) was called Research Fields, Courses and Disciplines (RFCD)
- Digital-Science Dimensions: "The leading categorization system with broad coverage of subject areas and a large general corpus of training material is ANZSRC FOR. This classification “lens” has been made available as part of the free Dimensions version"
Includes these classifications:
- Fields of Research (FOR). Appendix 1 FOR Fields by Code Number
- 1419 nodes at 3 levels: 22 Divisions, 157 Groups, 1340 Fields.
- Eg 01 MATHEMATICAL SCIENCES> 0105 MATHEMATICAL PHYSICS> 010501 Algebraic Structures in Mathematical Physics
- Socio-economic Objective (SEO). Appendix 2 SEO Objectives by Code Number
- 991 nodes at 4 levels: 5 Sectors, 17 Divisions, 119 Groups, 850 Objectives
- eg SECTOR B ECONOMIC DEVELOPMENT > DIVISION 82 PLANT PRODUCTION AND PLANT PRIMARY PRODUCTS > GROUP 8201 FORESTRY> 820102 Harvesting and Transport of Forest Products
- Type of Activity (TOA): where is it?
- 335 nodes at 2 levels: 4 subjects, 331 codes.
- The top level is very flat: only Life Sciences, Health Sciences, Social Sciences & Humanities, Physical Sciences (eg both Software and Earth and Planetary Sciences are in Physical Sciences)
- All Science Journal Classification Codes (ASJC) are similar
- Table, CSV
- Used in CrossRef "Category labels come from this list published by Scopus"
- Discussion: CrossRef/rest-api-doc#422
- Description, CSV
- 3 levels and 180 values
- Includes a list of journals with classification
cordisref-sicCode.csv. About 20-30 topic areas.
Eg WAS is "Waste Management" and includes the following keywords (sub-categories?)
- recycling
- recovery and reclamation
- incineration and pyrolysis
- land and sea disposal
- bioconversion
- landfill
- industrial waste
- energy from waste
Developed by OpenU KMI and Springer-Nature.
- Machine learned
- Project, Homepage, Downloads, press release
- The ontology is over-complicated, hopefully they'll convert it to simpe SKOS
- Touted to have 14k CompSci classes. But more realistically it's 2-3k (the number of clusters)
Relations
- hierarchical (narrower)
- lateral (contributesTo)
- external to DBpedia (sameAs)
- equivalence cluster (relatedEquivalent)