Data


Description of data fields

General data fields

Display Name API Name Description
PMID pmid PubMed Identifier, an article ID as assigned in PubMed by the National Library of Medicine
DOI doi Digital Object Identifier, if available
Year year Year the article was published
Title title Title of the article
Authors authors List of author names
Journal journal Journal name (ISO abbreviation)
Article is_research_article Flag indicating whether the Publication Type tags for this article are consistent with that of a primary research article
Relative Citation Ratio relative_citation_ratio Relative Citation Ratio (RCR)--OPA's metric of scientific influence. RCR represents the field- and time-normalized citation rate, and is benchmarked to 1.0 for a typical (median) NIH paper in the corresponding year of publication. A paper with an RCR of 1.0 has received the same number of cites/year as the median NIH-funded paper in its field, a paper with an RCR of 2.0 has received twice as many cites/year as the median NIH-funded paper in its field, while an RCR of 0.5 means that it is receiving half as many citations per year. Calculation details are documented in Hutchins et al., PLoS Biol. 2016;14(9):e1002541.
Provisional provisional RCRs for papers published in the previous two years are flagged as "provisional", to reflect that citation metrics for newer articles are not necessarily as stable as they are for older articles. Provisional RCRs are provided for papers published previous year, if they have received with 5 citations or more, despite being, in many cases, less than a year old. All papers published the year before the previous year receive provisional RCRs. The current year is considered to be the NIH Fiscal Year which starts in October. For example, in July 2019 (NIH Fiscal Year 2019), papers from 2018 receive provisional RCRs if they have 5 citations or more, and all papers from 2017 receive provisional RCRs. In October 2019, at the start of NIH Fiscal Year 2020, papers from 2019 receive provisional RCRs if they have 5 citations or more and all papers from 2018 receive provisional RCRs.

Influence module

Display Name API Name Description
Total Citations citation_count Number of unique articles that have cited this one
Citations Per Year citations_per_year Citations per year that this article has received since its publication. If this appeared as a preprint and a published article, the year from the published version is used as the primary publication date. This is the numerator for the Relative Citation Ratio.
Field Citation Rate field_citation_rate Measure of the intrinsic citation rate of this paper's field, estimated using its co-citation network.
Expected Citation Rate expected_citations_per_year Citations per year that NIH-funded articles, with the same Field Citation Rate and published in the same year as this paper, recieve. This is the denominator for the Relative Citation Ratio.
NIH Percentile nih_percentile Percentile rank of this paper's RCR compared to all NIH publications. For example, 95% indicates that this paper's RCR is higher than 95% of all NIH funded publications.

Translation Module

Display Name API Name Description
Human human Fraction of MeSH terms that are in the Human category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)
Animal animal Fraction of MeSH terms that are in the Animal category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)
Mol/Cell molecular_cellular Fraction of MeSH terms that are in the Molecular/Cellular Biology category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)
xCoord x_coord X coordinate of the article on the Triangle of Biomedicine
yCoord y_coord Y Coordinate of the article on the Triangle of Biomedicine
Clinical is_clinical Flag indicating that this paper meets the definition of a clinical article.
Cited By Clin. cited_by_clin PMIDs of clinical articles that this article has been cited by.
APT apt Approximate Potential to Translate is a machine learning-based estimate of the likelihood that this publication will be cited in later clinical trials or guidelines.

Open Citations Module

Display Name API Name Description
Cited By PMIDs cited_by PMIDs of articles that have cited this one.
References references PMIDs of articles in this article's reference list.

Definitions for derived data

Article (is_research_article)

Flag indicating whether the Publication Type tags for this article are consistent with that of a primary research article. To be counted as an Article, a paper's Publication Types must include at least one Type from the qualifying list (e.g. Journal Article or Clinical Trial) and cannot contain any from the disqualifying list (e.g. Editorial or Review):

Publication Type Qualifying/Disqualifying
Case Reports Qualifying
Classical Article Qualifying
Clinical Trial Qualifying
Clinical Trial, Phase I Qualifying
Clinical Trial, Phase II Qualifying
Clinical Trial, Phase III Qualifying
Clinical Trial, Phase IV Qualifying
Comparative Study Qualifying
Controlled Clinical Trial Qualifying
Corrected and Republished Article Qualifying
Dataset Qualifying
Duplicate Publication Qualifying
Evaluation Studies Qualifying
Journal Article Qualifying
Letter Qualifying
Meta-Analysis Qualifying
Multicenter Study Qualifying
Observational Study Qualifying
Pragmatic Clinical Trial Qualifying
Preprint Qualifying
Randomized Controlled Trial Qualifying
Technical Report Qualifying
Twin Study Qualifying
Validation Studies Qualifying
Addresses Disqualifying
Autobiography Disqualifying
Bibliography Disqualifying
Biography Disqualifying
Clinical Conference Disqualifying
Comment Disqualifying
Congresses Disqualifying
Consensus Development Conference Disqualifying
Consensus Development Conference, NIH Disqualifying
Dictionary Disqualifying
Directory Disqualifying
Editorial Disqualifying
Festschrift Disqualifying
Government Publications Disqualifying
Guideline Disqualifying
Historical Article Disqualifying
Interactive Tutorial Disqualifying
Interview Disqualifying
Introductory Journal Article Disqualifying
Lectures Disqualifying
Legal Cases Disqualifying
Legislation Disqualifying
News Disqualifying
Newspaper Article Disqualifying
Overall Disqualifying
Patient Education Handout Disqualifying
Periodical Index Disqualifying
Personal Narratives Disqualifying
Portraits Disqualifying
Practice Guideline Disqualifying
Published Erratum Disqualifying
Retracted Publication Disqualifying
Retraction of Publication Disqualifying
Review Disqualifying
Scientific Integrity Review Disqualifying
Webcasts Disqualifying

Based on manual curation of a random sample of articles published in 2013, this classification strategy yields 92% accuracy. 8% of 'article' assignments on average were incorrect, while 5% of 'non-article' assignments were incorrect. Most errors tend to occur in journals that are not indexed in MedLine, suggesting that persons whose publication records are concentrated in non-MedLine journals may have disproportionately high error rates in this classification schema.


Clinical (is_clinical)

Flag indicating whether the Publication Type tags for this article are consistent with that of a clinical article. Publication Types must include at least one Type from the following list:

  • Clinical Study
  • Clinical Trial
  • Clinical Trial, Phase I
  • Clinical Trial, Phase II
  • Clinical Trial, Phase III
  • Clinical Trial, Phase IV
  • Adaptive Clinical Trial
  • Controlled Clinical Trial
  • Randomized Controlled Trial
  • Clinical Trial Protocol
  • Observational Study
  • Guideline
  • Practice Guideline

MeSH addresses for paper categories in the Translation Module

These definitions have been described in two previous papers. Note that due to annual updates to the MeSH Tree, current definitions may not align perfectly with the MeSH addresses described in the papers, which used MeSH trees from specific fiscal years.


Human
  • M01 (Persons)
  • B01.050.150.900.649.313.988.400.112.400.400 (Human)

Animal
  • B01 (Eukaryota) not B01.050.150.900.649.313.988.400.112.400.400 (Human)

Molecular/Cellular Biology
  • A11 (Cells)
  • B02 (Archaea)
  • B03 (Bacteria)
  • B04 (Viruses)
  • G02 (Chemical Phenomena)
  • G04 (Cell Physiological Phenomena)

Diseases
  • C (Diseases) not C22 (Animal Diseases)

Analytical, Diagnostic and Therapeutic Techniques
  • E (Analytical, Diagnostic and Therapeutic Techniques, and Equipment) not E07 (Equipment and Supplies)

Chemicals and Drugs
  • D (Chemicals and Drugs)