Publication announced

· Thomas Wood
Publication announced

Our NLP research has been published in BMC Psychiatry!

Fast Data Science has been working on the Harmony project in collaboration with UCL, Ulster University, and Universidade Federal de Santa Maria. Harmony is an open-source natural language processing tool designed to revolutionise mental health research as well as an AI research project.

We are pleased to announce the publication of a paper coming out from this collaboration, Using natural language processing to facilitate the harmonisation of mental health questionnaires: a validation study using real-world data, authored by Eoin McElroy, Thomas Wood, Raymond Bond, Maurice Mulvenna, Mark Shevlin, George B. Ploubidis, Mauricio Scopel Hofmann and Bettina Moltrecht, and published in BMC Psychiatry, which is part of Springer Nature. Our paper demonstrates Harmony’s ability to predict correlations in mental health questionnaires - how the cosine scores coming out of Harmony relate to real-world data correlations.

Open access natural language processing paper

Read BMC paper

Our publication is open access. Click to read online or download as PDF.

You can try Harmony at https://harmonydata.ac.uk

🤷 What does Harmony do?

👉 Psychologists and social scientists often have to match items in different questionnaires, such as ““I often feel anxious”” and ““Feeling nervous, anxious or afraid””.

👉 This is called harmonisation.

👉 Harmonisation is a time consuming and subjective process.

👉 Going through long PDFs of questionnaires and putting the questions into Excel is no fun.

✅ Enter Harmony, a tool that uses natural language processing and generative AI models to help researchers harmonise questionnaire items, even in different languages.

Harmony addresses a critical challenge in psychology and social sciences: harmonising text data from diverse questionnaires often found in PDF format. By leveraging AI and natural language processing, Harmony efficiently compares and aligns questionnaire items, enabling researchers to conduct longitudinal studies using datasets from multiple studies. This groundbreaking tool is helping to unlock the potential of existing data and accelerate mental health research.

Funded by Wellcome and UKRI Economic and Social Research Council, Harmony is freely available for researchers to download and customize. Join the open-source community and help shape the future of mental health research.

Unlock Your Future in NLP!

Dive into the world of Natural Language Processing! Explore cutting-edge NLP roles that match your skills and passions.

Explore NLP Jobs

Clinical AI Interest Group at Alan Turing Institute

Clinical AI Interest Group at Alan Turing Institute

Thomas Wood presents the Clinical Trial Risk Tool before the November meeting of the Clinical AI Interest Group at Alan Turing Institute The Clinical AI Interest group is a community of health professionals from a broad range of backgrounds with an interest in Clinical AI, organised by the Alan Turing Institute.

Fast Data Science at Ireland's Expert Witness Conference on 20 May 2026
Legal aiGenerative ai

Fast Data Science at Ireland's Expert Witness Conference on 20 May 2026

Fast Data Science will appear at Ireland’s Expert Witness Conference on 20 May 2026 in Dublin On 20 May 2026, La Touche Training is running the Expert Witness Conference 2026, at the Radisson Blu Hotel, Golden Lane, Dublin 8, Ireland. This is a full-day event combining practical workshops and interactive sessions, aimed at expert witnesses and legal professionals who want to enhance their expertise. The agenda covers critical topics like recent developments in case law, guidance on report writing, and techniques for handling cross-examination.

Using Natural Language Processing (NLP) to predict the future
Ai for businessNatural language processing

Using Natural Language Processing (NLP) to predict the future

Guest post by Alex Nikic In the past few years, Generative AI technology has advanced rapidly, and businesses are increasingly adopting it for a variety of tasks. While GenAI excels at tasks such as document summarisation, question answering, and content generation, it lacks the ability to provide reliable forecasts for future events. GenAI models are not designed for forecasting, and along with the tendancy to hallucinate information, the output of these models should not be trusted when planning key business decisions. For more details, a previous article on our blog explores in-depth the trade-offs of GenAI vs Traditional Machine Learning approaches.

What we can do for you

Transform Unstructured Data into Actionable Insights

Contact us