反馈已提交
网络繁忙
Note: Supported database versions: CLICKHOUSE, ORACLE (12.1 or above), SQLSERVER (2019 or above), PRESTO, REDSHIFT, VERTICA, MAXCOMPUTE.
For example, if a user needs to combine data with the same product name and calculate the corresponding remittance amount, the sample effect is shown below.
Grouped aggregation means that the original data are combined into one group first according to the conditions, and then aggregated and calculated according to the grouped data.
This is achieved by setting up grouping fields and summary fields in BI.
Note: From 2018-11-16 onwards, the weekly calculation logic of the annual week granularity adopts the iso-standard weekly logic, where the first day of the week starts on a Sunday and contains 52 or 53 weeks per year, and the first week of the year must contain the 4th of January. This ensures that there are 7 days of data per week, which is more in line with the business scenario. The week start time can be modified in Section 2.9 of the General Parameters.
This example shows the average value of order amount for different payment method for each quarter
1)Log into the Data Decision System, go to "Data Preparation > Package", and click "Add Table > Self-Service Dataset", as shown below.
2)Enter the self-service dataset configuration interface, select the required fields from the data table of the business package, and the selected fields are displayed in the preview on the right, as shown in the following figure.
Note: Only data tables that have been added and updated with data can be selected here; data tables that have not been updated cannot be used to create self-service data sets.
1)Select the grouping summary from the operational flow on the left, as shown in the following figure.
2)Enter the group summary configuration interface and drag the "SIGNDATE" and "PAYMENTMETHOD" fields into the group column, as shown in the following figure.
Click on the field to drop down and select the "year quarter" grouping for "SIGNDATE", as shown below.
Group fields support adding "text field", "numeric field", "time field", no fields are filled in by default, all support renaming and deleting field operations.
Same value as a group
Custom group
Same values as a group mean grouped by the same values in the text field
Custom group means that you define your own groups in the fields contained in the text
Interval group
The same value as a group i.e. the same value as a group according to the numeric field.
Interval grouping contains two ways: automatic and custom grouping.
By default, the automatic grouping system sets the interval according to the maximum and minimum values, divided into 5 groups, where the value of the interval can also be modified.
By default, custom groups are set according to the interval of automatic grouping, and the interval can be modified. It supports adding, modifying, and deleting groups, as well as custom naming of groups and grouping the remaining values of undefined groups into a custom named group, and this item is enabled by default.
Drag the "Amount" field into the summary column, as shown below:
Click on the field to drop down and select the "average" summary method for "Amount" as shown below.
Grouped fields support adding "text field", "numeric field", "time field", no fields are filled in by default, all support renaming and deleting field operations.
Distinct Count
Records Count
String Splicing Note: Only "Extract Data" is supported.
Approximate counting unrepeated Note: Only "Real-time data" is supported
The distinct count is to count the same value in the field only once, that is, count the number of different values, can be understood as count(distinct field)
The number of records is the number of records in the field, equivalent to count(*)
String stitching is the stitching of data values under this field by grouping columns into strings that are stitched together into a single value.
When the calculated data is very large, the traditional exact de-duplication count may not be calculated, and the result can be calculated quickly using approximate de-duplication count.
Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result.
Sum
Average
Maximum
Minimum
Approximate counting unrepeated Note: Only "Real-time data" is supported.
Records count
Variance
Standard deviation
Median
Earliest time
Latest time
When the calculated data is very large, the traditional exact number of de-duplicated records may not be calculated, and the result can be calculated quickly using the approximate number of de-duplicated records.
The records count is the number of records in the field, equivalent to count(*)
The earliest time and latest time refer to the earliest and latest time within the group.
Note: The database versions supported by the "Approximate counting unrepeated" function are: CLICKHOUSE, ORACLE (12.1 or above), SQLSERVER (2019 or above), PRESTO, REDSHIFT, VERTICA, MAXCOMPUTE. restriction, only the same field at the same time to do the need to sort the aggregation operation (such as to find the de-duplication count, median, percentile, approximate de-duplication count), at the same time for two or more fields to do such aggregation operation will occur errors. When the database system is VERTICA, only one of the (exact) distinct count or approximate counting unrepeated can be used at the same time due to the limitations of VERTICA.
After the group summary is set up, click the Save button in the upper right corner and after updating the data, you can see the newly created self-service dataset in the package, as shown in the following figure.
If the selected group field is numeric and the value size is infinite, the interval grouping option cannot be selected and there is no response after clicking interval grouping.
售前咨询电话
400-811-8890转1
在线技术支持
在线QQ:800049425
热线电话:400-811-8890转2
总裁办24H投诉
热线电话:173-1278-1526
文 档反 馈
鼠标选中内容,快速反馈问题
鼠标选中存在疑惑的内容,即可快速反馈问题,我们将会跟进处理。
不再提示
10s后关闭