反馈已提交

网络繁忙

You are viewing 5.1 help doc. More details are displayed in the latest help doc.

Basic functions of group summary

  • Recent Updates: April 15, 2022
  • 1. Overview

    1.1 Version

    FineBI VersionJARFunction changes
    5.1--
    2020-08-04Remove copy-to-group feature from a custom grouping of self-service datasets
    5.1.62020-09-30Add "Approximate de-duplication" function to the "Real-time data" field summary method

    Note: Supported database versions: CLICKHOUSE, ORACLE (12.1 or above), SQLSERVER (2019 or above), PRESTO, REDSHIFT, VERTICA, MAXCOMPUTE.

    5.1.102021-02-05Group summary supports setting "Custom Same Ring Ratio", and supports setting same ring ratio by "Year", "Quarter", "Month", and "Week".
    5.1.112021-04-02Renaming a field has no effect on the display name of the field in the previous steps

    1.2 Application Scenarios

    For example, if a user needs to combine data with the same product name and calculate the corresponding remittance amount, the sample effect is shown below.

     image (10).png

     

    1.3 Function Introduction

    Grouped aggregation means that the original data are combined into one group first according to the conditions, and then aggregated and calculated according to the grouped data.

    This is achieved by setting up grouping fields and summary fields in BI.

    Note: From 2018-11-16 onwards, the weekly calculation logic of the annual week granularity adopts the iso-standard weekly logic, where the first day of the week starts on a Sunday and contains 52 or 53 weeks per year, and the first week of the year must contain the 4th of January. This ensures that there are 7 days of data per week, which is more in line with the business scenario. The week start time can be modified in Section 2.9 of the General Parameters.

    2. Example

    This example shows the average value of order amount for different payment method for each quarter

    2.1 Add data table

    1)Log into the Data Decision System, go to "Data Preparation > Package", and click "Add Table > Self-Service Dataset", as shown below.

    image (11).png

    2)Enter the self-service dataset configuration interface, select the required fields from the data table of the business package, and the selected fields are displayed in the preview on the right, as shown in the following figure.

    Note: Only data tables that have been added and updated with data can be selected here; data tables that have not been updated cannot be used to create self-service data sets.

    image (12).png

    2.2 Add group fields

    2.2.1 Select group fields

    1)Select the grouping summary from the operational flow on the left, as shown in the following figure.

    image (13).png

    2)Enter the group summary configuration interface and drag the "SIGNDATE" and "PAYMENTMETHOD" fields into the group column, as shown in the following figure.

    image (14).png

    2.2.2 Set the group method

    Click on the field to drop down and select the "year quarter" grouping for "SIGNDATE", as shown below.

    image (15).png

    2.2.3 Group field settings support range

    Group fields support adding "text field", "numeric field", "time field", no fields are filled in by default, all support renaming and deleting field operations.

    Field typesSupport group methodDescription
    text filed
    • Same value as a group

    • Custom group

    • Same values as a group mean grouped by the same values in the text field

    • Custom group means that you define  your own groups in the fields contained in the text

    numeric field
    • Same value as a group

    • Interval group

    • The same value as a group i.e. the  same value as a group according to the numeric field.

    • Interval grouping contains two ways: automatic and custom grouping.

    • By default, the automatic grouping system sets the interval according to the maximum and minimum values, divided into 5 groups, where the value of the interval can also be modified.

    • By default, custom groups are set according to the interval of automatic grouping, and the interval can be modified. It supports adding, modifying, and deleting groups, as well as custom naming of groups and grouping the remaining values of undefined groups into a custom named group, and this item is enabled by default.

    time fieldSupport 17 group types, including year, month, year, quarter, month, day, day, week, hour, minute, second, year quarter, year month, year week, year month day hour, year month day hour minute, year month day hour minute second

    2.3 Add summary field

    2.3.1 Select the summary field

    Drag the "Amount" field into the summary column, as shown below

    image (16).png

    2.3.2 Set the summary method

    Click on the field to drop down and select the "average" summary method for "Amount" as shown below.

    image (17).png

    2.3.3 Summary field setting support range

    Grouped fields support adding "text field", "numeric field", "time field", no fields are filled in by default, all support renaming and deleting field operations.


    Field types
    Support group methodDescription
    text field
    • Distinct Count

    • Records Count

    • String Splicing Note: Only "Extract Data" is supported.

    • Approximate counting unrepeated Note: Only "Real-time data" is supported

    • The distinct count is to count the same value in the field only once, that is, count the number of different values,      can be understood as count(distinct field)

    • The number of records is the number of records in the field, equivalent to count(*)

    • String stitching is the stitching of data values under this field by grouping columns into strings that are stitched together into a single value.

    • When the calculated data is very large, the traditional exact de-duplication count may not be calculated, and the result can be calculated quickly using approximate de-duplication count.

    Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result.

    numeric field
    • Sum

    • Average

    • Maximum

    • Minimum

    • Distinct Count

    • Approximate counting unrepeated Note: Only "Real-time data" is supported.

    • Records count

    • Variance

    • Standard deviation

    • Median


    time field
    • Distinct Count

    • Approximate counting unrepeated Note: Only "Real-time data" is supported.

    • Records count

    • Earliest time

    • Latest time

    • The distinct count is to count the same value in the field only once, that is, count the number of different values, can be understood as count(distinct field)

    • When the calculated data is very large, the traditional exact number of de-duplicated records may not be calculated, and the result can be calculated quickly using the approximate number of de-duplicated records.

    Note: The accuracy of the calculation result depends on the database type and data volume, theoretically the smaller the data volume the smaller the difference with the de-duplication count result.

    • The records count is the number of records in the field, equivalent to count(*)

    • The earliest time and latest time refer to the earliest and latest time within the group.


    Note: The database versions supported by the "Approximate counting unrepeated" function are: CLICKHOUSE, ORACLE (12.1 or above), SQLSERVER (2019 or above), PRESTO, REDSHIFT, VERTICA, MAXCOMPUTE. restriction, only the same field at the same time to do the need to sort the aggregation operation (such as to find the de-duplication count, median, percentile, approximate de-duplication count), at the same time for two or more fields to do such aggregation operation will occur errors. When the database system is VERTICA, only one of the (exact) distinct count or approximate counting unrepeated can be used at the same time due to the limitations of VERTICA.

    2.4 Effect view

    After the group summary is set up, click the Save button in the upper right corner and after updating the data, you can see the newly created self-service dataset in the package, as shown in the following figure.

    image (18).png

    3. Caution

    If the selected group field is numeric and the value size is infinite, the interval grouping option cannot be selected and there is no response after clicking interval grouping.

    Attachment List


    Theme: Data Processing
    Already the First
    Already the Last
    • Helpful
    • Not helpful
    • Only read

    售前咨询电话

    400-811-8890转1

    在线技术支持

    在线QQ:800049425

    热线电话:400-811-8890转2

    总裁办24H投诉

    热线电话:173-1278-1526

    文 档反 馈

    鼠标选中内容,快速反馈问题

    鼠标选中存在疑惑的内容,即可快速反馈问题,我们将会跟进处理。

    不再提示

    10s后关闭