Configure Data Summarization

Data summarization refers to the process of creating a concise version of raw data and field data. Data can be summarized as a function of time or when a change occurs. You can improve the performance and disk space utilization of the Paragon Insights (formerly HealthBot) time series database (TSDB) by configuring data summarization methods to summarize the raw data and field data collected by Paragon Insights.

Paragon Insights collects data by using push or pull data collection methods. You can create Paragon Insights rules or use the available predefined rules to determine how and when data is collected. This collected telemetry data provides information about the state of network devices and its components. For more information on data collection methods, see Paragon Insights Data Ingest Guide.

You can create a raw data summarization profile to improve the performance and disk space utilization of the TSDB. Starting with Paragon Insights Release 4.0.0, you can create a rollup summarization profile to summarize processed data that is stored in fields in the TSDB. Field data is processed data that is stored in fields in the TSDB. A field is a single piece of information that forms a record in a database. In TSDB, multiple fields of processed data make a record. In releases earlier than Paragon Insights Release 4.0.0, only raw data can be summarized.

Table 1 provides a list of the supported data summarization algorithms and a description of their output:

Table 1: Descriptions of the Data Summarization Algorithms
Algorithm	Description of output
Latest	Value of the last data point collected within the time span.
Count	Total number of data points collected within the time span.
Mean	Average value of the data points collected within the time span.
Min	Minimum value of the data points collected within the time span.
Max	Maximum value of the data points within the time span.
On-change	Value of the data point whenever the value is different from the previous data point (occurs independently from the user-defined time span).
Stddev	Standard deviation of the data points collected within the time span.
Sum	Sum of the data points collected within the time span.

If no summarization algorithm is associated with the data, the following algorithms are used by default:

Data type	Data summarization algorithm
Float, integer, unsigned	Mean
Boolean, string	On-change

You can use data summarization profiles to apply specific summarization algorithms to raw data and field data collected by Paragon Insights for a specific device group:

These topics provide instructions on how to create a data summarization profile.

Creating a Raw Data Summarization Profile
Creating a Data Rollup Summarization Profile

After you have created a data summarization profile, you can apply the profile to a device group. For more information, see Applying Data Summarization Profiles to a Device Group.

Creating a Raw Data Summarization Profile

To create a raw data summarization profile that can be applied to a device group:

Click Settings > Summarization Profiles link in the left-nav bar.
Select Raw Data from the Summarization Profiles list.

The Raw Data Summarization Profiles page is displayed.
Click (+) icon to add a summarization profile.

The Add Raw Data Summarization Profile page is displayed.
In the Name text box, enter the name of the profile.
Click Add Type Aggregate to add an aggregate type.

The Name and Function drop-down lists are displayed.

Follow these steps to select a name data type and associate it with a data summarization algorithm.

The algorithm configured for a specific sensor path name overrides the algorithm configured for the corresponding data type.
1. Select a name data type from the Name drop-down list.
  
  The available name data types to choose from are string, integer, boolean, float, and unsigned integer.
  
  Starting in Paragon Insights Release 4.2.0, you can also select unsigned integer as a name data type. An unsigned integer is a data type that can contain values from 0 through 4,294,967,295.
2. After you have selected a name data type, you associate it with a data summarization function.
  
  To associate a name data type with a data summarization function, select a function from the Functions drop-down list.
  
  The available functions to choose from are latest, count, mean, min, max, on-charge, stddev, and sum.
3. (Optional) To add another aggregate type, click Add Type Aggregate, and repeat step 5.a and step 5.b.

Click Add Path Aggregate to add an aggregate path.

The Name and Function drop-down lists are displayed.

To assign a sensor path name and associate it with a data summarization algorithm:

Note:

The algorithm configured for a specific sensor path name overrides the algorithm configured for the corresponding data type.

Enter a sensor path name in the Name text box.

You can enter a path name for a sensor that is not supported by Paragon Insights. For sensors supported by Paragon Insights, the path name must be entered in the following format:

Sensor	Path Name Format	Example
Open Config	sensor-path	/components/component/name
Native GPB	sensor-name:sensor-path	jnpr_qmon_ext:queue_monitor_element_info.percentage
iAgent	yaml-table-name:sensor-path	REutilizationTable:15_min_cpu_idle
SNMP	snmp-table-name:sensor-path	.1.3.6.1.2.1.2.2:jnxLED1Index ospfNbrTable:ospfNbrIpAddr
Syslog	pattern-set: sensor-path	interface_link_down:operational-status
Flow (NetFlow)	template-name:sensor-path	hb-ipfix-ipv4-template:sourceIPv4Address

After you have entered a sensor path name, you associate it with a data summarization function.

To associate a sensor path name with a data summarization function, select a function from the Functions drop-down list.

The available functions to choose from are latest, count, mean, min, max, on-charge, stddev, and sum.
(Optional) To add another aggregate path, click Add Path Aggregate, and repeat step 6.a and step 6.b.

Click Save to only save the configuration.

Click Save & Deploy to save and immediately deploy the configuration.
You can now apply the raw data summarization profile that you created to a specific device group. For more information, see Applying Data Summarization Profiles to a Device Group.

Creating a Data Rollup Summarization Profile

Paragon Insights Release 4.0.0 supports data rollup summarization. You can create a rollup summarization profile to summarize processed data that is stored in fields in the TSDB. Field data is processed data that provides information on network devices and its components, and is stored in fields in the TSDB. A field is a single piece of information that forms a record in a database. In TSDB, multiple fields of processed data make a record. Data rollup summarization enables efficient data storage and also ensures retaining of data for a longer duration.

You can create a data rollup summarization profile to apply to a device group from the:

Paragon Insights graphical user interface (GUI)
Command line interface (CLI)

Create a Data Rollup Summarization Profile by using Paragon Insights UI

To create a data rollup summarization profile:

Click Settings > Summarization Profiles link in the left-nav bar.
Select Data Rollup from the Summarization Profiles list.

The Data Rollup Summarization Profiles page is displayed.
Click (+) icon to add a summarization profile.

The Add Data Rollup Summarization Profile page is displayed.
Enter the name of the profile in the Name text box.

The maximum length is 64 characters.

Regex pattern: “[a-zA-Z][a-zA-Z0-9_-]*”
Click Add Rule to add an existing Paragon Insights rule for which rollup summarization must be applied.

The Name and Apply on Existing Data drop-down lists are displayed.

Follow these steps to select a rule, and to apply the rule to the profile. You can also apply the rule to existing data.
1. Select an existing Paragon Insights from the Name drop-down list.
2. To apply the rule that you selected to existing data, select True from the Apply on Existing Data drop-down list.
  
  The default value is False.
3. Click Add Field to define fields of the rule configured for which data rollup summarization must be applied.
  
  To associate a field to an aggregate function:
  1. Select a field from the Name drop-down list for which data must be aggregated.
  2. Select one or more aggregate functions from the Aggregate Function drop-down list that you want to apply to a field.
4. (Optional) To add another rule to the profile, click Add Rule, and repeat step 5.a through step 5.c.
Click Add Data Rollup Order to define the frequency at which rollup summarization should occur.

The Name and Retention Policy drop-down lists, and the Rollup Interval text box are displayed.

To define a data rollup order:
1. Enter a name to identify the data rollup order in the Name text box.
  
  The maximum length is 64 characters.
  
  Regex pattern: “[a-zA-Z][a-zA-Z0-9_-]*”
2. Enter a value in the Rollup Interval text box to define the interval in which data is summarized.
  
  Regex pattern: “[1-9][0-9]*[mhdw]”, where m is minutes, h is hours, d is days, and w is weeks.
  
  Minimum value is 30m. Maximum value is 52w.
3. Select the retention policy for the rollup order from the Retention Policy drop-down list. A retention policy defines how long you want to retain the rolled-up data.
  
  Selecting a retention policy is optional. If you do not select a retention policy, the device group retention policy is considered by default.
4. (Optional) To define another data rollup order, click Add Data Rollup Order, and repeat step 6.a through step 6.c.
Click Save to only save the configuration.

Click Save and Deploy to save and immediately deploy the configuration.
You can now apply the data rollup summarization profile that you created to a specific device group. For more information, see Applying Data Summarization Profiles to a Device Group.

Create a Data Rollup Summarization Profile by using CLI

Figure 1 is an example configuration of how you can configure a data rollup summarization profile from the CLI.

Figure 1: Example CLI Configuration of Creating a Data Rollup Summarization Profile

Applying Data Summarization Profiles to a Device Group

After you create a data summarization profile, you can apply the profile to a specific device group to start summarizing TSDB data:

Click Configuration > Device Group in the left-nav bar.

The Device Group Configuration page is displayed.
Select the check box next to the name of the device group to which you want to apply the data summarization profile.
Click the Edit Device Group icon to edit the device group.

The Edit <device-group-name> page is displayed.

Apply a raw data summarization profile.

To apply a raw data summarization profile to a device group:

Click Summarization.

The Time Span and Data Summarization text boxes are displayed.
Enter the Time Span in seconds (s), minutes (m), hours (h), days (d), weeks (w), or years (y).

Choose the data summarization profiles from the drop-down list to apply the ingest data. To edit or view details about saved data summarization profiles, go to the Data Summarization page and click the Settings menu option in the left-nav bar.

If you select two or more profiles, the following guidelines apply:

If the same data type or sensor path name is configured in two or more profiles, the associated algorithms will be combined.

The table that stores the summarization output includes columns of summarized data for each algorithm associated with each data field collected by Paragon Insights. The naming convention for each column is as follows:

Number of algorithms associated with a data field	Column name for the summarized output
1	field-name Example: 5_sec_cpu_idle
2	field-name_first-algorithm-name, field-name_ second-algorithm-name Example: 5_sec_cpu_idle_MIN, 5_sec_cpu_idle_MAX
3	field-name_first-algorithm-name, field-name_ second-algorithm-name, field-name_ third-algorithm-name...

Number of algorithms associated with a data field

Column name for the summarized output

field-name

Example: 5_sec_cpu_idle

field-name_first-algorithm-name, field-name_ second-algorithm-name

Example: 5_sec_cpu_idle_MIN, 5_sec_cpu_idle_MAX

field-name_first-algorithm-name, field-name_ second-algorithm-name, field-name_ third-algorithm-name...

Apply a data rollup summarization profile

Points to remember before you apply a data rollup summarization profile to a device group:

Ensure that the rules present in the rollup profile are already associated with the device group.
You can add one or more than one rollup summarization profile to a device group.
Rules configured across all the profiles associated with the device group must be unique.
While associating a rollup profile with a device group, the interval of the first data rollup order must be less then the device group retention policy to avoid data overflow. The device group retention policy is set to 7 days by default.
When you want to remove a rule that is associated to a device group, you must first remove the data rollup summarization profile.

To apply a data rollup summarization profile to a device group:

Click Rollup Summarization.

The Rollup Summarization Profiles drop-down list is displayed.
Select the rollup summarization profiles you want to associate to this device group from the Rollup Summarization Profiles drop-down list.
(Optional) You can also deploy rollup configuration at the device group-level by using the CLI.

See Figure 2 for an example CLI configuration.

Figure 2: Example CLI Configuration

Click Save to only save the configuration.

Click Save and Deploy to save and immediately deploy the configuration.

Change History Table

Feature support is determined by the platform and release you are using. Use Feature Explorer to determine if a feature is supported on your platform.

Release

Description

4.0.0

Paragon Insights Release 4.0.0 supports data rollup summarization. You can create a rollup summarization profile to summarize processed data that is stored in fields in the TSDB.

ON THIS PAGE

Configure Data Summarization

Creating a Raw Data Summarization Profile

Creating a Data Rollup Summarization Profile

Applying Data Summarization Profiles to a Device Group

Change History Table