Overview of Real-Time Task

  • Last update: June 26, 2026
  • iconNote:

    The Real-Time Task function in Data Development is available in FineDataLink only with a V5.0 license.

    To apply for a trial or learn details, contact technical support. For details, see Technical Support Channel Introduction.

    Overview

    Application Scenario

    1. You want to read data from source systems in real time and write real-time data to the target database. For example:

    • New order and contract data needs to be synchronized from the MySQL database of a CRM business system to the data warehouse in real time.

    • Data from each branch of a group-level enterprise needs to be written to Kafka in real time for the headquarters' data lake to consume.

    2. You want to process the read data in real time and then write the result data to the data warehouse. For example:

    • Water data of a water utility, such as upstream and downstream water levels and water levels of multiple tributaries, needs to be aggregated and calculated in real time before being presented.

    • Manufacturing enterprises need to display metrics, such as production line yield rates, utilization rates, factory efficiency, losses, and working hours, in real time.

    Function Description

    You can use Real-Time Task to deliver data from Point A to Point B in real time. Data cleaning, such as real-time data parsing in the data warehouse, is supported during delivery. Downstream business processes can use the final results. Real-time delivery with in-flight cleansing reduces latency and converts raw data into consumable data, improving both timeliness and utilization.

    Usage Restriction

    1. Function points related to Real-Time Task must be registered. For details, see the "Function Point Limit" section of Registration Introduction.

    2. To create a real-time task in Data Development, you must have Management permission on a specific folder or all real-time tasks. For details, see Scheduled Task Management Permission.

    3. Cluster projects of V5.0.1.2 and later versions support real-time tasks.

    4. From FineDataLink V5.0.1.5, if a WebSocket Input, MQTT Input, IBM MQ Input, or Pulsar Input operator is used in a real-time task, you cannot use Group Summary, Flink SQL, or Data Association in the task.

    Differences from Other Modules

    iconNote:
    For details, see FineDataLink Module Introduction.

    Function Description

    1. Data source:

    For data sources supported by real-time tasks, see Real-Time Task.

    2. Synchronization scenario:

    In a real-time task, only single-table real-time synchronization is supported. The synchronization types are described below:

    • Full + Incremental Synchronization: First synchronize all inventory data, and then continuously synchronize the changes.

    • Incremental Synchronization Only: Continuously synchronize data from the specified start point.

    3. Task development:

    The Real-Time Task module of FineDataLink provides various types of operators that can be used in combination to meet your diverse data processing needs.

    An operator is a basic unit of a real-time task. Multiple operators connected by lines define the execution flow and form a complete real-time task. The following table describes currently available operators:

    Category
    OperatorDescriptionRemark
    Data InputKafka InputIt enables real-time data reading from Kafka.
    CDC InputIt enables real-time data reading from databases.
    Paimon InputIt enables real-time data reading from Paimon.
    Pulsar InputIt enables real-time data reading from Pulsar.
    MQTT InputIt enables real-time data reading from MQTT.
    IBM MQ InputIt enables real-time data reading from IBM MQ.
    Dimension Table InputIt enables real-time data reading from dimension tables, facilitating cross-database joins using Data Association.
    WebSocket InputIt enables data reading from WebSocket.
    Data OutputDB Table Output (Real-Time Task)It enables real-time data output to a specified database.
    Kafka OutputIt enables real-time data output to Kafka.
    Paimon OutputIt enables real-time data output to Paimon.
    WebSocket OutputIt enables real-time data output to WebSocket.
    Scheduled Task InvocationIt enables calling a scheduled task within a real-time task.
    ConnectionData AssociationIt enables real-time cross-database joins.If all upstream input operators of Data Association are real-time data sources, a Flink engine must be configured.
    Data UnionIt enables merging tables row-wise into a single table.
    LaboratoryFlink SQLYou can query and process data in a real-time task by writing SQL statements.A Flink engine must be configured.
    TransformationColumn to RowIt enables converting columns in a table to rows, same as the Column to Row operator in scheduled tasks.
    JSON ParsingYou can parse JSON data and output data in the row-column format.
    XML ParsingYou can parse XML data into row-column format data.
    Field SettingYou can select and rename fields, and change data types.
    New Calculation ColumnIt enables creating a field by referencing existing fields or performing calculations based on them without modifying the original fields.
    Data FilteringYou can filter data to get records that meet specified conditions.
    Field-to-Column SplittingIt is used to split values of a field by delimiter or character count, where the split result forms multiple new columns with values.
    Group SummaryIt enables real-time aggregation calculations on the data flow.A Flink engine must be configured.
    Field-to-Row SplittingIt is used to split values of a field by delimiter, where the split result forms a new column with values.
    OthersRemarkYou can add remarks to tasks and operators.

    4. Task configuration:

    After developing a real-time task, you can configure Retry After Failure, Dirty Data Tolerance, Result Notification, Log Level Setting, and other settings for it. For details, see Single Real-Time Task Management.

    5. Task O&M:

    • You can view the execution status of real-time tasks and start or pause them in batches. For details, see Scheduled Task O&M – Task Management.

    • Two startup modes are available for real-time tasks:

      • Start: The task starts from the last checkpoint.

      • Restart: You can click Restart to initialize the task, in which case the synchronization will restart based on the synchronization type set in the input operator.

    • If the upstream input operators of Data Association include multiple real-time data sources, or if the Group Summary operator is used, the following applies after the real-time task starts:    

      • If the task stops before the full synchronization phase completes (for example, due to dirty data), you must restart it to ensure data consistency.

    Procedure

    iconNote:
    For details, see Registration Introduction.
    Step
    DescriptionHelp Document
    1Configure the data connections to the required data source and destination.Data Source Creation and Management
    2Configure the middleware for staging data.Transmission Queue Configuration
    3Enable logging for the data source to be read.Overview of Database Environment Preparation
    4Deploy and start Kafka.

    Kafka Deployment - KRaft Mode 

    Kafka Deployment - ZooKeeper Mode 

    5

    (Optional) Configure the Flink engine.

    • If the upstream input operators of Data Association include multiple real-time data sources, you must configure the Flink engine.

    • If you need to use the Group Summary operator in a real-time task in Data Development, you must configure the Flink engine.

    Flink Engine
    6Create and develop a real-time task.See the document of each operator.
    7Start the task./
    8Understand the real-time capture task.Function Description of Real-Time Capture Task
    Manage individual real-time tasks and control resources for individual tasks. Single Real-Time Task Management 

    Manage real-time tasks in batches.

    /

    附件列表


    主题: Data Development - Real-Time Task
    • Helpful
    • Not helpful
    • Only read

    滑鼠選中內容,快速回饋問題

    滑鼠選中存在疑惑的內容,即可快速回饋問題,我們將會跟進處理。

    不再提示

    10s後關閉

    Get
    Help
    Online Support
    Professional technical support is provided to quickly help you solve problems.
    Online support is available from 9:00-12:00 and 13:30-17:30 on weekdays.
    Page Feedback
    You can provide suggestions and feedback for the current web page.
    Pre-Sales Consultation
    Business Consultation
    Business: international@fanruan.com
    Support: support@fanruan.com
    Page Feedback
    *Problem Type
    Cannot be empty
    Problem Description
    0/1000
    Cannot be empty

    Submitted successfully

    Network busy