The Scheduled Task module in Data Development supports data extraction, transformation, and loading using various nodes and operators on a visual page. It enables the automatic execution of scheduled tasks through the Timed Scheduling function, helping you build offline data warehouses effortlessly and ensuring efficient and stable data production.
This article describes the capabilities of the Scheduled Task module.
Function Description
For details, see the "Types of Data Sources Supported by Scheduled Task" section of Data Sources Supported by FineDataLink.
The data sources in the first and second columns in the following table can be combined flexibly. For example, you can read file data and write the data into a database after processing (optional).
API (API)
Writing data into web services is not supported currently.
Connector (Jodoo)
Writing data into SAP RFC is not supported currently.
Others
1. The Union All and Data Association operators can be used to merge two different tables.
2. The SQL Script node allows for processing multi-table data by writing SQL statements.
The Data Distribution function can be used to save the processed data to multiple tables.
You can execute the scheduled task by calling the API.
You can view the status of the scheduled task instance by calling the API.
You can terminate the running of the scheduled task by calling the API.
For details, see Calling Scheduled Task Related APIs.
Scheduled Task enables you to convert data to structured data, semi-structured data, and unstructured data flexibly.
Example:
Clean the raw data to output standardized data.
Data Cleaning★★★★★
The data update methods include incremental update, full update, and comparison-based update. For details, see Overview of Data Synchronization Method.
Integrating various types of heterogeneous data is supported to achieve data association and dimensional data modeling. For details, see Union All, Data Association, SQL Script, and Spark SQL.
Scheduled Task provides instant alerts for key business data when limits are reached. For details, see Notification.
Notification of scheduled task execution failure is also supported. For details, see Task Control - Result Notification.
Data can also be output as files for storage. For details, see Function Description of File Output Operator and File Transfer.
1. You can be prompted for DDL changes in the source table of a scheduled task and notify specified users. For details, see Synchronizing DDL Changes Using Scheduled Task.
2. Data Lineage helps you understand the entire data production process and the impact scope of modifications to data tables. For details, see Lineage Analysis.
3. The front-end Database Table Management function enables visual management of multi-source data and helps understand data conditions quickly by searching for specific database tables.
Task development and editing are separated from task execution. You can edit and modify scheduled tasks in a development environment, where you can also perform task commissioning, and then publish them to a stable production environment, ensuring that task development and execution do not interfere with each other.
For details, see Development Mode and Production Mode.
The canvas-like development page enables efficient task development and easy O&M.
For details about differences between Data Synchronization and Data Transformation, see Differences Between Data Synchronization and Data Transformation.
You can define the execution order of data flow nodes and scripts (such as serial and parallel execution) by connecting nodes.
Logic nodes for task orchestration, such as Conditional Branch and Loop Container, are provided.
Examples of the step flow are shown in the following figure.
A data flow focuses on data processing, such as data synchronization and transformation and file transfer.
Various visual operators are provided for fast data input, transformation, and output.
The data flow can flexibly meet all data cleaning and transformation needs.
FineDataLink provides three scheduling strategies, namely time-based, event dependency-based, and trigger-based scheduling, which can be flexibly combined to meet diverse scheduling needs.
For details, see Overview of Scheduling Plan.
You can set task priority based on the business data priority and set the log level to ensure focused support for critical businesses.
For details, see Task Control - Task Attribute.
FineDataLink offers fault tolerance mechanisms, including Timeout Limit, Retry After Failure, and Dirty Data Tolerance. When encountering controllable risks such as network fluctuations or a small amount of dirty data, the platform automatically retries the task to ensure smooth task operation.
For details, see Task Control - Fault Tolerance Mechanism.
With the result notification enabled, the platform will notify the relevant personnel in case of task execution failure.
The notification channel includes platform messages, emails, SMSs, DingTalk chatbot, Lark chatbot, and WeCom chatbot.
The notification content includes task exceptions, dirty data, and notification of source table structure changes.
For details, see Task Control - Result Notification.
You can restore and hard delete the deleted tasks in Scheduled Task, Data Pipeline, and Data Service in Recycle Bin.
For details, see Recycle Bin.
1. You can monitor and manage scheduled tasks in real time under O&M Center > Scheduled Task > Running Record, where you can view task running information such as the running status, duration, and triggering method, and retry scheduled tasks (for supplementing data or re-executing tasks after solving the dirty data that causes data writing failure).
For details, see Running Record.
2. You can set the execution frequency and the event-based scheduling plan for multiple/single scheduled tasks and modify and delete the execution frequency and the event-based scheduling plan of the task under O&M Center > Scheduled Task > Scheduling Plan.
For details, see Scheduled Task O&M - Scheduling Plan.
3. You can view task information such as the number of scheduled tasks, task scheduling status, and scheduling plan execution status, and configure result notification for tasks in batches under O&M Center > Scheduled Task > Task Management.
For details, see Scheduled Task O&M - Task Management.
4. You can manage the memory resources used by scheduled tasks for an independently deployed FineDataLink project.
For details, see Load Distribution.
Permission related to the Scheduled Task module is explained in the following table.
Data Connection Use Permission
After being granted Use permission on a data connection, the user can use the data connection to create scheduled tasks, pipeline tasks, and API tasks after login.
Data Connection Management Permission
After being granted Management permission on a data connection, the user can copy, rename, modify, and delete the data connection under System Management > Data Connection > Data Connection Management after login.
Data Connection Authorization Permission
After being granted Authorization permission on a data connection, the user (subordinate admin) can assign corresponding permission on the data connection to manageable users under System Management > Permission Management after login.
In FineData Link, you (the admin) want to assign Use permission on Scheduled Task in Data Development, Data Pipeline, Data Service, and Database Table Management to other users for them to:
Use Scheduled Task and Data Pipeline for data processing.
Use Data Service to publish APIs.
Use Database Table Management to write/debug SQL statements, view created tables, as well as delete, clear, and copy and paste tables in FineDataLink.
You (the super admin) can assign Authorization permission on Data Platform to a user to make the user a subordinate admin.
The subordinate admin can assign Use permission on modules on the data platform to subordinate members.
You (the super admin) can assign Authorization permission on Scheduled Task to a user to make the user a subordinate admin.
The subordinate admin can assign Management permission on scheduled tasks to subordinate members to allow them to view and edit the scheduled task.
滑鼠選中內容,快速回饋問題
滑鼠選中存在疑惑的內容,即可快速回饋問題,我們將會跟進處理。
不再提示
10s後關閉
Submitted successfully
Network busy