Natural Language Processing Data Scientist
Does your company have a large amount of unstructured data, such as unorganised documents? Consider hiring an NLP data scientist to help you extract value from it. Fast Data Science is a data science consultancy offering NLP consulting services.
At Fast Data Science we have a number of data scientists in our team, and our main focus is natural language processing (NLP). The manager, Thomas Wood, studied a Masters in 2008 at Cambridge University in an area of NLP, Computer Speech, Text and Internet Technology, and conducted his research project on pleonastic pronouns using unsupervised learning. Since completing his postgraduate studies he has worked exclusively in data science, maintaining a constant focus on NLP, although he has occasionally worked in computer vision and other areas of data science, including a stint consulting for Tesco, predicting customer purchases. The numerical techniques he has learnt in other disciplines of data science have been incredibly useful in NLP. For example, convolutional neural networks were designed to process image data, but have found a niche for building text classifiers as well as music recommendation systems.
Thomas Wood founded Fast Data Science Ltd in 2018 to deliver data science consultancy focussing on natural language processing problems in large organisations that deal with lots of text data, such as healthcare, pharma, insurance and legal.
A good NLP data scientist is able to perform generalist non-NLP work, such as build a product recommendation system, as well as handle text data. Our team of NLP data scientists has built NLP pipelines from scratch. We have worked on natural language dialogue systems, document classifiers and text-based recommender systems. We use both traditional data science techniques as well as the state of the art NLP data science toolkit which includes neural networks. Python is the tool of choice for an NLP data scientist, due to its abundance of NLP and deep learning libraries – although any language can be used in principle.