Any newly generated metadata file or JSON-L containing JSTOR or Portico journal content will now include a docSubType on the metadata of most items (if we have it, which we usually do.)  You can also filter on this field using our advanced search in the keyword field in the dataset builder.

For example, say you want to research what is going on in frontmatter over the past couple of decades, you could do this search to build a dataset completely of frontmatter articles:


This field is not normalized, however there is a limited set of frequently used types. Below are a list of those docSubType elements in JSTOR which currently have over 500 articles:

docSubType Num JSTOR Articles
research-article 7,070,992
book-review 3,802,198
misc 2,169,329
news 139,885
editorial 18,465
frontmatter 6,409
other 5,996
review-article 3,129
backmatter 3,009
brief-report 1,280
correction 1,227
front-matter 769
Letter 723
discussion 561

The Portico content is a little more raucous, and below is the list of those docSubType elements in Portico which currently have over 500 articles:

docSubType Num Portico Articles
article 5,571,511
research-article 3,264,221
null 949,701
miscellaneous 625,101
book-review 579,233
other 562,580
scientific 322,344
bookreview 317,160
review-article 293,482
review 265,138
abstract 244,638
letter 240,344
shortcommunication 219,252
editorial 187,547
reviewarticle 181,988
frontmatter 153,160
brief-report 69,254
casestudy 68,429
news 65,165
magazine 59,551
case-report 55,036
commentary 51,812
e-non-article 51,469
erratum 44,216
announcement 37,689
meeting-report 33,453
evaluation 32,157
congress-abstract 31,235
book review 25,329
obituary 24,479
article-commentary 22,351
meetingreport 22,034
research paper 21,601
cover 21,345
correction 18,965
introduction 18,678
general review 18,520
e-review 18,334
journal article 18,230
technicalnote 16,452
product-review 12,363
rapidpublication 11,286
e-conceptual-paper 10,915
index 10,674
cpunit 10,649
books-received 10,600
rapid-communication 10,341
contents 9,989
synfact 8,243
ra 7,525
e-viewpoint 7,191
back-matter 7,009
backmatter 5,793
reviewArticle 5,742
reviews 5,331
e-literature-review 5,160
discussion 5,076
e-technical-paper 4,471
paper 4,383
events 4,300
misc 4,066
case study 3,419
conceptual paper 3,198
authorinstructions 3,032
shortCommunication 2,990
calendar 2,932
cme 2,928
research article 2,853
reply 2,738
viewpoint 2,247
chapter 2,114
in-brief 1,992
bookReview 1,964
front-matter 1,892
note 1,805
oration 1,583
biographicalarticle 1,572
promotional 1,564
original 1,507
caseStudy 1,439
technical paper 1,378
addendum 1,333
dissertation 1,223
clinicalmessage 1,135
debate 1,079
retraction 1,035
literature review 809
partintroduction 776
othertype 766
review article 622
cpappendix 605

Please note, if you are working with a previously created dataset, this data is not included.  Just sent us your dataset ID and a request, and we’ll delete the files on the system – when you go back to your dataset in the UI it will rebuild and include this data.

If you use this new feature and data point, let us know!