Dataset size limits and performance

This topic describes the dataset size limits and expected performance of each Arria for MicroStrategy feature.

NLG Apps

NLG Apps generate narratives describing a subset of dimensions and measures (data fields) that you select from your data. The more targeted the dataset is, the more pertinent the narrative will be.

NLG Apps perform best when the size of the dataset sent to the NLG Apps service (i.e., the data payload) is around 2MB or less.

For these reasons, we recommend that you select only those dimensions and measures required for the particular narrative you wish to generate. For example, for a time-based variance analysis of sales drilled down by product and segment, you might select one time dimension, plus your sales, product, and segment measures. This will result in a smaller aggregated dataset.

Arria Answers

Arria Answers allows you to gain insights into a subset of dimensions and measures (data fields) that you select from your data, according to the questions you ask.

Arria Answers performs best when the size of the dataset sent to the Arria Answers service (i.e., the data payload) is around 5MB or less. Performance will vary based on dataset size.

For this reason, we recommend that you select only those dimensions and measures required to answer your queries.

Custom Narratives

Various factors affect the time it takes to generate custom narratives — mainly network speed, the complexity of the Studio project, and the size of the data payload sent to the Custom Narratives service.

For example, assuming download/upload speeds of 48MBps/3MBps:

  • generating a custom narrative from a dataset of around 10,000 rows and 8 columns (data payload approx. 1MB), may take up to 4 seconds.

  • generating a custom narrative from a dataset of around 100,000 rows and 8 columns (data payload approx. 10MB), may take up to 33 seconds.