Basic functions of group summary- FineBI Help Document FineBI Document

Recent Updates: April 15, 2022

1. Overview

1.1 Version

FineBI Version	JAR	Function changes
5.1	-	-
5.1	2020-08-04	Remove copy-to-group feature from a custom grouping of self-service datasets
5.1.6	2020-09-30	Add "Approximate de-duplication" function to the "Real-time data" field summary method Note: Supported database versions: CLICKHOUSE, ORACLE (12.1 or above), SQLSERVER (2019 or above), PRESTO, REDSHIFT, VERTICA, MAXCOMPUTE.
5.1.10	2021-02-05	Group summary supports setting "Custom Same Ring Ratio", and supports setting same ring ratio by "Year", "Quarter", "Month", and "Week".
5.1.11	2021-04-02	Renaming a field has no effect on the display name of the field in the previous steps

1.2 Application Scenarios

For example, if a user needs to combine data with the same product name and calculate the corresponding remittance amount, the sample effect is shown below.

image (10).png

1.3 Function Introduction

Grouped aggregation means that the original data are combined into one group first according to the conditions, and then aggregated and calculated according to the grouped data.

This is achieved by setting up grouping fields and summary fields in BI.

Note: From 2018-11-16 onwards, the weekly calculation logic of the annual week granularity adopts the iso-standard weekly logic, where the first day of the week starts on a Sunday and contains 52 or 53 weeks per year, and the first week of the year must contain the 4th of January. This ensures that there are 7 days of data per week, which is more in line with the business scenario. The week start time can be modified in Section 2.9 of the General Parameters.

2. Example

This example shows the average value of order amount for different payment method for each quarter

2.1 Add data table

1）Log into the Data Decision System, go to "Data Preparation > Package", and click "Add Table > Self-Service Dataset", as shown below.

image (11).png

2）Enter the self-service dataset configuration interface, select the required fields from the data table of the business package, and the selected fields are displayed in the preview on the right, as shown in the following figure.

Note: Only data tables that have been added and updated with data can be selected here; data tables that have not been updated cannot be used to create self-service data sets.

image (12).png

2.2 Add group fields

2.2.1 Select group fields

1）Select the grouping summary from the operational flow on the left, as shown in the following figure.

image (13).png

2）Enter the group summary configuration interface and drag the "SIGNDATE" and "PAYMENTMETHOD" fields into the group column, as shown in the following figure.

image (14).png

2.2.2 Set the group method

Click on the field to drop down and select the "year quarter" grouping for "SIGNDATE", as shown below.

image (15).png

2.2.3 Group field settings support range

Group fields support adding "text field", "numeric field", "time field", no fields are filled in by default, all support renaming and deleting field operations.

Field types	Support group method	Description
text filed	Same value as a group Custom group	Same values as a group mean grouped by the same values in the text field Custom group means that you define your own groups in the fields contained in the text
numeric field	Same value as a group Interval group	The same value as a group i.e. the same value as a group according to the numeric field. Interval grouping contains two ways: automatic and custom grouping. By default, the automatic grouping system sets the interval according to the maximum and minimum values, divided into 5 groups, where the value of the interval can also be modified. By default, custom groups are set according to the interval of automatic grouping, and the interval can be modified. It supports adding, modifying, and deleting groups, as well as custom naming of groups and grouping the remaining values of undefined groups into a custom named group, and this item is enabled by default.
time field	Support 17 group types, including year, month, year, quarter, month, day, day, week, hour, minute, second, year quarter, year month, year week, year month day hour, year month day hour minute, year month day hour minute second

2.3 Add summary field

2.3.1 Select the summary field

Drag the "Amount" field into the summary column, as shown below：

image (16).png

2.3.2 Set the summary method

Click on the field to drop down and select the "average" summary method for "Amount" as shown below.

image (17).png

2.3.3 Summary field setting support range

Grouped fields support adding "text field", "numeric field", "time field", no fields are filled in by default, all support renaming and deleting field operations.

Field types	Support group method	Description
text field	Distinct Count Records Count String Splicing Note: Only "Extract Data" is supported. Approximate counting unrepeated Note: Only "Real-time data" is supported	The distinct count is to count the same value in the field only once, that is, count the number of different values, can be understood as count(distinct field) The number of records is the number of records in the field, equivalent to count(*) String stitching is the stitching of data values under this field by grouping columns into strings that are stitched together into a single value. When the calculated data is very large, the traditional exact de-duplication count may not be calculated, and the result can be calculated quickly using approximate de-duplication count. Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result.
numeric field	Sum Average Maximum Minimum Distinct Count Approximate counting unrepeated Note: Only "Real-time data" is supported. Records count Variance Standard deviation Median
time field	Distinct Count Approximate counting unrepeated Note: Only "Real-time data" is supported. Records count Earliest time Latest time	The distinct count is to count the same value in the field only once, that is, count the number of different values, can be understood as count(distinct field) When the calculated data is very large, the traditional exact number of de-duplicated records may not be calculated, and the result can be calculated quickly using the approximate number of de-duplicated records. Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result. The records count is the number of records in the field, equivalent to count(*) The earliest time and latest time refer to the earliest and latest time within the group.

Field types

Support group method

Description

text field

Distinct Count
Records Count
String Splicing Note: Only "Extract Data" is supported.
Approximate counting unrepeated Note: Only "Real-time data" is supported

The distinct count is to count the same value in the field only once, that is, count the number of different values, can be understood as count(distinct field)
The number of records is the number of records in the field, equivalent to count(*)
String stitching is the stitching of data values under this field by grouping columns into strings that are stitched together into a single value.
When the calculated data is very large, the traditional exact de-duplication count may not be calculated, and the result can be calculated quickly using approximate de-duplication count.

Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result.

numeric field

Sum
Average
Maximum
Minimum
Distinct Count
Approximate counting unrepeated Note: Only "Real-time data" is supported.
Records count
Variance
Standard deviation
Median

time field

Distinct Count
Approximate counting unrepeated Note: Only "Real-time data" is supported.
Records count
Earliest time
Latest time

The distinct count is to count the same value in the field only once, that is, count the number of different values, can be understood as count(distinct field)
When the calculated data is very large, the traditional exact number of de-duplicated records may not be calculated, and the result can be calculated quickly using the approximate number of de-duplicated records.

Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result.

The records count is the number of records in the field, equivalent to count(*)
The earliest time and latest time refer to the earliest and latest time within the group.

Note: The database versions supported by the "Approximate counting unrepeated" function are: CLICKHOUSE, ORACLE (12.1 or above), SQLSERVER (2019 or above), PRESTO, REDSHIFT, VERTICA, MAXCOMPUTE. restriction, only the same field at the same time to do the need to sort the aggregation operation (such as to find the de-duplication count, median, percentile, approximate de-duplication count), at the same time for two or more fields to do such aggregation operation will occur errors. When the database system is VERTICA, only one of the (exact) distinct count or approximate counting unrepeated can be used at the same time due to the limitations of VERTICA.

2.4 Effect view

After the group summary is set up, click the Save button in the upper right corner and after updating the data, you can see the newly created self-service dataset in the package, as shown in the following figure.

image (18).png

3. Caution

If the selected group field is numeric and the value size is infinite, the interval grouping option cannot be selected and there is no response after clicking interval grouping.

Already the First

Already the Last

Helpful
Not helpful
Only read

售前咨询电话

400-811-8890转1

在线技术支持

在线QQ：800049425

热线电话：400-811-8890转2

总裁办24H投诉

热线电话：173-1278-1526

文档反馈

鼠标选中内容，快速反馈问题

鼠标选中存在疑惑的内容，即可快速反馈问题，我们将会跟进处理。

不再提示

10s后关闭

Basic functions of group summary