Data Science Consultant

What is a data science consultant?

A data science consultant is an external person with expertise and experience in data science and AI, who assists companies and organisations to get the best value out of their data. It’s an interesting job with a mix of the technical side (such as data analysis, statistics, and software engineering) and the business side (understanding problems in a particular industry).

A data science consultant is usually hired by an organisation because they needed an external expert with skills that may be hard to source in-house, or they may not have capacity for the data science projects that they want to complete. For example, an in-house data science team may not have enough experience in natural language processing or another niche area of data science and so they may look for a consultancy which specialises in that field, like Fast Data Science, which focuses on NLP.

Many companies are not necessarily tech-focused. In particular medium-sized corporations are increasingly turning to data science consultants to fulfil their needs in this area.

A Venn diagram showing some of the results a data science consultant can achieve for a business

Above: some of the results a data science consultant can achieve for a business

What are some examples of data science projects that a data science consultant would help with?

A data science consultant’s job is varied and no two days are the same. Here are some real examples of data science projects, from Fast Data Science’s case studies:

  1. designing a natural language processing model to identify clinical trial endpoints in a protocol (PDF document). These are sections of the document which could take any form, but generally describe something that will be measured as the main outcome of the trial. For example, “5 year survival”, “response rate”, “QoL”. For more information about our experience in pharmaceuticals, try the Clinical Trial Risk Tool.
  2. Setting up a recommendation system which identifies best points (latitude and longitude) for an offline marketing company so that they would know where to physically station their brand ambassadors to distribute flyers, based on a database of past conversion rates (QR code scans). So the tool could say, “send brand ambassador 23 to the leafleting spot in front of Oxford Circus on Tuesday at 6pm for a 2% conversion rate”.
  3. Building a vector index of longitudinal studies around the UK, so that research psychologists can easily retrieve past studies that measured a particular variable, such as “depression”, “anxiety”, and “PTSD”, even when those were reported using different wording, such as “feeling nervous”.
  4. Analysing the council tax records and housing benefit and universal credit database of a UK local authority, in order to identify persistent non-payers or duplicate claimants.
  5. Analysing the chat logs of a number of UK councils, and setting up an automated system to identify and triage when a user has requested a replacement green bin, brown bin, parking sticker, or is reporting graffiti.

Data science consultant - London

Need to hire a data science consultant?

We have been consulting in data science since 2018. Please get in touch and we can set up a free call with you.

Below you can see a small demo of a clinical trial cost prediction model which Fast Data Science has been working on.

Clinical Trial Cost Estimation

Trial is for condition
Phase
Vaccine trial?
Country
Sample size
Number of interventions
Click button to estimate cost in dollars

Check out our case studies page if you’re feeling inspired!

What kind of data science consulting does Fast Data Science do?

We can help you get value from your unstructured data, such as PDFs that may be floating around your business. We can analyse your data and present it in a jargon-free way using graphical techniques, and move on to building explainable predictive models.

For example, if you want to understand the causes of customer churn or employee turnover, Fast Data Science can identify key correlations and even find causal mechanisms. We can recommend strategic actions (e.g. “send a coupon to customer #2123 on X date”). We can even train and deploy a machine learning model which can highlight any employee or customer at risk.

Our main area of focus at Fast Data Science is natural language processing (NLP), which means that we do a lot of work with unstructured text data (documents in English or other natural languages).

What qualifications should a data science consultant have?

A data science consultant should ideally have qualifications and experience both in machine learning and in the industry that they’re consulting in. A minimum of a Masters degree in data science or a data science adjacent field is a must.

For example, the director of Fast Data Science, Thomas Wood, studied Physics as his undergraduate degree at Durham in the UK, and then completed a Masters in 2008 at Cambridge University in Computer Speech, Text and Internet Technology. Since then, he has been working exclusively in machine learning and data science, with a large number of engagements in particular in the pharmaceutical industry. He has also completed other professional certifications such as the Bond Solon Expert Witness program and the Microsoft Azure Data Scientist Associate certification.

What does a data science consultant do?

Examples of what a data science consultant can do for a business include:

  • Developing an AI strategy, which could involve identifying and prioritising AI initiatives in collaboration with executives and employees in a business. The business strategy should be based on a combination of data-driven and qualitative approaches. The data science consultant should identify strategic blind spots.
  • Helping to collect data and setting up big data pipelines
  • Analysing the data collected
  • Building and training AI models and deploying them and ensuring that they are fully integrated into the client’s technology and products. The data science consultant should not leave the client organisation with a load of proofs of concept which they can’t use.
  • Analyse unstructured data using techniques such as natural language processing
  • Evaluate existing technology such as machine learning models. This could be related to a past or planned merger or acquisition, or the data science consultant may be taking responsibility for tools that were previously developed in house.
  • Taking on data science due diligence or expert witness or expert advisor work.

Data science consultant case studies

Fast Data Science’s data science consultants have built machine learning models and consulted for companies across industries and of all sizes. Below you can see a selection of some of our past consulting engagements, and you can find out more by reading our case studies.

How can you measure the ROI of a data science consultant (return on investment)?

If a data science consultant is hired for a foundational data science project such as proof of concept, data exploration and data governance, or AI strategy consulting, you should agree on a measure of success which reflects the value for the business. Often a proof of concept may not lead to immediate profit, but it will lead to a more powerful production system further down the line, and so it should be viewed as an investment. However, a proof of concept should still work and be of use. The measure of success for an early stage system could be simply that it works with a reasonable level of accuracy, precision, recall, or some other metric, within an acceptable time.

For a project that involves implementing a production system, such as a live customer analytics model, the measure of success could be something as explicit as change in customer lifetime value, increased customer retention, or even impact on turnover. Fully mature data science projects usually lead to a cost saving (for example, fewer staff needed to do the task that is automated), better forecasting, or risk reduction.

Once a mature data science project has been completed, you can calculate the ROI as:

ROI = net benefit from data science project / total cost of project * 100

Since the cost of a data science project for a company with 10,000 customers is not too far from the same project for a company with 100,000 customers, there’s a huge leverage effect when data science is done at scale, and a data science project can pay for itself many times over.

How do data science consultants work?

Above: a video detailing how natural language processing consulting could benefit a business. This is one area of data science consulting which we do at Fast Data Science.

We always begin with the question you want to answer:

Following a rigorous and numbers-based approach, we proceed to find answers to the question using statistics and machine learning algorithms. At all stages we keep you on-board and provide easily interpretable visualisations of our findings, complete documentation, and interactive handover workshops. We work with both traditional machine learning techniques as well as the state of the art such as neural networks. We normally use Python for our data science consulting work, but can adapt to your technology stack. We often develop front ends for any models that we deploy.

We will not use unnecessarily complex or over-hyped models. If another data science consultant tells you they will solve your problem using an LLM connected to Blockchain, come to us instead and we might be able to give you a model you can work with on pen and paper! We prioritise simple models (Occam’s Razor) and make sure that the client is onboard with all the technology used.

What about Generative AI?

We recognise that generative AI is a tool in the toolkit and we will use it for some of our projects with the client’s permission. At present, organisational policies on generative AI vary and there may be restrictions because of either the opaque (non-explainable) way that it generates predictions, as well as objections to the idea of sending data to a third party provider. However, times are changing. But it should be noted that simple problems that require only a regression model can be solved more cheaply and reliably with that regression model. Generative AI is exciting but it isn’t the answer to everything!

Can a data science consultant analyse unstructured data?

In some industries, such as healthcare, insurance, pharmaceuticals and legal, it is common for large amounts of the data to be stored in an unstructured format, for example PDFs, Excels, and other documents, which may be stored on the cloud, on on-premise servers, on employees’ laptops, or even physically. These kinds of documents are particularly difficult to analyse.

Fast Data Science has a proven track record of extracting useful information from gold-mines such as this for a series of high-profile clients. Our area of focus is natural language processing, which allows us to analyse data in free text, or even in a multitude of languages. Imagine you are an executive in an insurance company and you have a collection of a thousand ship inspection reports, which are written in plain text, submitted in Word or PDF form, and by people of different nationalities, in highly technical language. You want to know how many vessels failed the fire safety inspection. Do you:

  1. read the reports back-to-back for a month (or hire a smart graduate to do this)?
  2. hire a data science consultant who can give you a visualisation of key statistics from the reports, and even develop a drag-and-drop tool which lets you drop a PDF or group of PDFs and immediately spits out the vital statistics (inspection result, causes of failure, location, company name, etc)?

Data Science applications in healthcare

Natural Language Processing applications in healthcare Natural Language Processing applications in healthcare

Data science is being increasingly adopted across the healthcare sector. This technology is sometimes called healthtech or MedTech.

As an example, machine learning models can predict doctors’ workforce attrition, patient outcomes, or A&E or emergency room demand. Natural language processing can be used to analyse clinical data in text form and categorise (code) information by MeSH terms.

We have worked on a number of healthcare data science consulting projects, such as:

Data Science technologies at Fast Data Science

The technology we use for your project depends on you and your organisation’s preferences, but all other things being equal, we will choose the simplest tool to answer your questions in an efficient and explainable way.

  • We work with Python as this is the lingua franca of data science, however we can use other languages and frameworks on request.
  • We are happy to use TensorFlow, Hugging Face, spaCy, NLTK, Scikit-Learn, and any of the other widely used machine learning and NLP frameworks.
  • We are happy to work with any of the following cloud providers:
    • Google Cloud Platform
    • AWS
    • Microsoft Azure
  • We can also use cloud machine learning (no-code) interfaces such as Azure ML.
Topic detection is an NLP technique that allows you to discover common themes in a set of unstructured documents.

What value can data science deliver for my business? Isn’t it all hype?

You’ll be pleased to find out that data science is one of a company’s operations whose impact is most easily quantified – after sales, of course!

Taking an example from one of Fast Data Science’s case studies, the data science consultants identified a key pain point in the client’s customer sign-up form which was causing a percentage of the customers to drop out. The consultants could estimate from UX research and prior knowledge that an improvement would result in a 7% increase in sign-ups.

The CEO of the company was sceptical, so the consultants ran an A/B test for two weeks. Half of potential customers were presented with the old form, and half with the new form. After two weeks, the expected increase of 7% did indeed occur. The client company promptly switched the website over completely to the new design.

You may wish to measure the ROI of a data science project in terms of effect on turnover, customer acquisition, customer retention, or EBITDA and profit and loss. Non-monetary metrics include things like customer or employee satisfaction.

How much would your company benefit from a data science consultant identifying quick wins like this?

Examples of past data science consulting projects

Our data science consultants have worked on a number of projects for major companies recently, including:

  • a dashboard to allow members of the public to explore and analyse survey responses for White Ribbon Alliance.
  • a machine learning model to predict the complexity of clinical trials.
  • a model to identify employees at risk of leaving their organisation.
  • a model to predict customer churn.

Please visit our Portfolio for more information, or simply contact us for a chat.

Can I hire an on-demand data science consultant?

Yes, you can enter into a retainer agreement and have a data science consultancy like Fast Data Science on call. If you have need for C-level expertise in data science in your organisation, you can also hire a fractional chief data scientist or head of AI.

What we can do for you

Transform Unstructured Data into Actionable Insights

Contact us