Planning data science projects is tricky, and NLP projects can be particularly problematic. Based on our past experience, we have shared an interactive tool which you can use for estimating task durations and dependencies for an NLP project.
It generates a graphical Gantt chart for your project, based on the inputs you give it.
Input the parameters of your Natural Language Processing project
Project and organisation level
What is the goal of the project?
Is the client a large organisation with a complex process of procurements, purchase orders, approvals, etc?
Does the project need to be signed off by a separate executive level in the organisation, or in another organisation?
Who will use the model?
Is the text data multilingual?
Does the text data need to be extracted from PDFs or similar?
Do we need to manually annotate data?
Is the text data sensitive?
Must the data remain on the client's servers?
Is there a risk of AI bias, or is AI bias an issue?
Do we need to classify data into more than 10 classes?
Do we need to extract multiple values from text, such as finding percentages, dosages, addresses, names?
Does a gold standard of model performance exist? For example, do human annotators achieve 85% accuracy?
Must a front end program be developed?
Must the model be deployed and integrated into the existing technology stack?
Does the model need to be retrained regularly?
Do we need to make an explainable AI model?
View your NLP project’s Gantt chart
ethics and privacy management
request access to data and systems
kick off meeting
define metrics for success
develop baseline model
develop a series of models in a leaderboard
select best model
develop front end
Getting your Natural Language Processing project off the ground