Dataset creation
The better the bot understands its users, the better it will perform its tasks. It is important to train it on a dataset resembling the data the bot will encounter during real communication.
JAICP provides several tools through which you can create a dataset for training the bot from the platform interface:
Data labeling
Process “raw” logs to extract potential intents and their contents
Intent fine-tuning
Update intents with phrases from previous dialogs with the bot
To go to dataset creation:
Navigate to a project and select CAILA → Data labeling in the dashboard.
Choose a tool appropriate for your needs: data labeling or intent fine-tuning.
tipUse data labeling if you have your own data for training. Intent fine-tuning will come in handy if the bot has already been in operation for some time and has acquired dialog data.Depending on your choice, upload a file with the data or select Import from analytics. Now you can start working on the dataset.
tipThe article on How to train intents contains practical recommendations for creating CAILA classifiers. Keep them in mind as you work on your dataset.
If you have already used data labeling or fine-tuning in this project, but you want to process a new dataset this time, select New set of phrases after navigating to CAILA → Data labeling.