Case Studies

Data Science Consulting
Case Studies
Interpreting Land Titles in Land Registry using Natural Language Processing

Interpreting Land Titles in Land Registry using Natural Language Processing

A national Land Registry hired us to use NLP to interpret land title deeds, which are written in unstructured legal language.

Using NLP to predict customer escalation

Using NLP to predict customer escalation

As part of an AI strategy engagement, we explored the potential for NLP and machine learning for a Canadian housing regulator

Drug named entity recognition Python library

Drug named entity recognition Python library

Recognising drug names in unstructured English text with Python We have open-sourced a Python library called Drug Named Entity Recognition for finding drug names in a string.

Open Source Tools for Natural Language Processing

Open Source Tools for Natural Language Processing

Open source projects (MIT license) We have participated in two externally projects which produced open-source code and data, which are available to the public for personal and commercial use.

Country named entity recognition Python library

Country named entity recognition Python library

Recognising country names in unstructured English text with Python We have open-sourced a Python library called Country Named Entity Recognition for finding country names in a string.

Harmony (Wellcome Data Prize in Mental Health entry)

Harmony (Wellcome Data Prize in Mental Health entry)

Harmony is an open source NLP-driven data harmonisation tool developed for the Wellcome Data Prize. What does Harmony do? Psychologists and social scientists often have to match items in different questionnaires, such as “I often feel anxious” and “Feeling nervous, anxious or afraid”.

Clinical Trial Risk Tool

Clinical Trial Risk Tool

Machine learning in clinical trials: We developed a clinical trial risk assessment tool using Natural Language Processing for the Gates Foundation to assist experts to estimate the risk of a clinical trial ending uninformatively.

Machine Learning drag-and-drop GUI Dashboard - Office of Rail and Road

Machine Learning drag-and-drop GUI Dashboard - Office of Rail and Road

Building a machine learning GUI for the Office of Rail and Road The Office of Rail and Road (ORR) is the British national rail regulator, responsible for health and safety on mainline rail, the London Underground, light rail, and trams.

Causal machine learning for Skills Development Scotland

Causal machine learning for Skills Development Scotland

Analysing employment and education outcomes using machine learning and causality models Skills Development Scotland (SDS) is the national body in Scotland which supports people to develop and apply their skills.

Boehringer Ingelheim – NLP clustering on factory error reports

Boehringer Ingelheim – NLP clustering on factory error reports

How we used a Natural Language Processing clustering algorithm to assist pharmaceutical company Boehringer Ingelheim to gain insights into and discover topics in their manufacturing processes.

Information Commissioner's Office - ML email classification model

Information Commissioner's Office - ML email classification model

ICO: Email classification The Information Commissioner’s Office (ICO) is the public body which is responsible for regulating data protection in the UK.

CBT Clinics - Counsellors, Psychiatrists and Therapists Recruitment

CBT Clinics - Counsellors, Psychiatrists and Therapists Recruitment

CBT Clinics: Counsellors, Psychiatrists and Therapists Recruitment CBT Clinics is a UK-based company offering mental healthcare practitioners. They have a roster of counsellors, psychiatrists and therapists, and wanted to expand to recruit more clinicians and also to understand the nuances of the counselling and therapy market in the UK.

Past clients of Fast Data Science

We work with clients all over the world, although the majority of our clients are in the UK, followed by the USA and the rest of Europe.

Industry expertise

We have focused on healthcare and pharmaceuticals but are open to working in a range of industries.

Consulting case studies at Fast Data Science

Some of the projects we have worked on in the past include:

  • A dashboard allowing members of the public to explore survey responses, which have been automatically categorised using machine learning, for White Ribbon Alliance. This dashboard was presented to the United Nations in 2021.
  • An unsupervised learning model to identify recurring topics and errors in the manufacturing and supply chain processes for Boehringer Ingelheim. The errors were written in plain English or the local language of each facility.
  • A predictive model in Microsoft Azure ML which identified which junior doctors (interns/residents) at the UK’s National Health Service (NHS) are at risk of leaving the organisation.
  • A deep learning model, also in Azure ML, to categorise emails from customers for the Information Commissioner’s Office.
  • A neural network based model to extract structured data and statistics from clinical trial protocols, also for Boehringer Ingelheim.
  • A predictive model using neural networks to deduce attributes of jobseekers’ CVs, deployed on the website of CV-Library.
  • A model that predicts customers’ online purchase amounts, for the British supermarket chain Tesco.

Interactive graph of past clients

In our interactive graph you can view and explore where our clients are from and what industries they are in.

More case studies

  • A recommender system to recommend jobs to candidates for CV-Library.
  • A model to predict the unloading time of vehicles, used to improve accuracy of logistics planning for grocery deliveries, also for Tesco.
  • A convolutional neural network based face recognition system, built for Android, iOS and desktop apps and used for biometric security.
  • A voice controlled smart home application.

What we can do for you

Transform Unstructured Data into Actionable Insights

Contact us