Missing Value Handling

  • Last update: July 29, 2025
  • Overview

    A data warehouse receives massive amounts of data from diverse sources, where discrepancies in formats, statistical calibers, and value representation methods may exist.

    Therefore, data with missing values is inevitable. This document introduces how to handle such data.

    You can click to download the example data: Internet.xls.

    Missing Data Deletion

    If feasible, simply deleting a small amount of sample data is the most effective way to handle data with missing values.

    For example, the Event_Date field in the Internet table has null values. If invalid, data with missing values can be directly deleted.

    2-1.png

    Add a Data Transformation node, and drag in a DB Table Input operator to extract data from the Internet table, as shown in the following figure.

    2-2.png

    Use a Data Filtering operator to remove data records whose Event_Date are null, as shown in the following figure.

    The effect of the Data Filtering operator is equivalent to that of the SQL statement: Event_Date is not null.

    2-3.png

    Missing Data Imputation

    When excessive missing values make deletion impractical, imputation with specified data becomes necessary.

    For example, in the following table, the Session_Duration field has many null values, and you want to replace these null values with 0.

    3-1.png

    Use a New Calculation Column operator to add a new column of data. Enter the formula IF(ISNULL(#{Session_Duration}),0.0,#{Session_Duration}). This means that if Session_Duration is empty, the function returns 0.0; otherwise, it returns the specific value of Session_Duration, as shown in the following figure.

    iconNote:
    The last two parameters of the IF function must be of the same type. For example, if the values of Session_Duration in the database are of double type, 0 replacing the null values also needs to be written in the 0.0 format.

    3-2.png

    Click Data Preview. The null values have been replaced with 0, as shown in the following figure.

     3-3.png

    附件列表


    主题: Data Development - Scheduled Task
    Previous
    Next
    • Helpful
    • Not helpful
    • Only read

    滑鼠選中內容,快速回饋問題

    滑鼠選中存在疑惑的內容,即可快速回饋問題,我們將會跟進處理。

    不再提示

    10s後關閉

    Get
    Help
    Online Support
    Professional technical support is provided to quickly help you solve problems.
    Online support is available from 9:00-12:00 and 13:30-17:30 on weekdays.
    Page Feedback
    You can provide suggestions and feedback for the current web page.
    Pre-Sales Consultation
    Business Consultation
    Business: international@fanruan.com
    Support: support@fanruan.com
    Page Feedback
    *Problem Type
    Cannot be empty
    Problem Description
    0/1000
    Cannot be empty

    Submitted successfully

    Network busy