The Test Set Insights in Botium provide a rough estimation on the quality of your training data. A Static Analysis is done, which means that the data is not actually evaluated with an external chatbot engine like IBM Watson or Dialogflow, but just with an integrated state-of-the-art generic language model. Nevertheless, it provides some good first insights what might be wrong with your data.
Test Set Insights vs Botium NLP
Test Set Insights provide:
-
static analysis against an in-memory generic language model
-
quick evaluation
-
provides metrics on the data itself, the keywords and the similarity
Botium provides:
-
evaluation against your chatbot engine of choice
-
possible long-term execution sessions required
-
provides metrics on how the data behaves in real-life
Test Data vs Training Data
Generally spoken, you should use all the training data available for getting the most out of the test set insights. Doing this kind of evaluation on test data only will not yield good results.
Enable Botium Worker
You have to enable the optional component Botium Worker for using the test set insights. In docker-compose.yml or docker-compose-standalone.yml:
-
Uncomment the environment variable COACH_WORKER_ENDPOINT
-
Uncomment the coach container section to enable the container
COACH_WORKER_APIKEY
Using Test Set Insights
The test set insights are available in the Insights section of the test set. You can
see descriptions of the available metrics and the history there.