Pharma – PubMed authorship analysis

Pharma – PubMed authorship analysis

30 million citations for biomedical literature

5,725,819 articles available (2019)

release in 1996

Authorship Analysis for PubMed

For one client in the pharma industry, we developed cross-platform desktop tool which allows a user to import search results from PubMed for a particular term, and process them into knowledge graphs.

Fast Data Science - London

Need a business solution?

NLP, ML and data science leader since 2016 - get in touch for an NLP consulting session.

The output of the tool can be combined with other data sources, such as conference programmes, to produce a ranking of key opinion leaders (KOLs) in a particular sub-field of pharma. Pharma companies will contact key opinion leaders with proposals to run clinical trials.

The tool uses natural language processing and PubMed’s MeSH tags to identify who are the most prominent researchers in sub-fields, and quantify how active they are and where they are located geographically. Although medical literature is generally tagged with MeSH terms, lot of the relevant information is found only in paper abstract or full text, and so a sophisticated bespoke natural language processing algorithm was necessary to extract relevant data.

Using this data it was possible for the client to generate graphs of researchers’ collaborations, and rank researchers by various metrics, allowing the company to target researchers effectively for potential collaborations.

What we can do for you

Transform Unstructured Data into Actionable Insights

Contact us