Availability Indicator

  • Last update: 2025-12-23
  • Overview

    Version

    FineOps VersionFunctional Change

    V2.12.0

    /

    Application Scenario

    Most enterprises have clear requirements for the stability of their business systems. O&M teams leverage system availability indicators to assess the quality of services provided by these systems.

    System availability directly impacts user experience. If system availability falls short of expectations during specific periods or if frequent downtime occurs, it not only affects end-user satisfaction but may also lead to business interruptions and customer loss.

    So, how can O&M teams effectively monitor system availability to ensure both the stability of business systems and user satisfaction?

    Function Description

    FineOps provides an Availability Statistics function to help address this challenge, by which O&M teams can:

    1. Define and calculate system availability: It enables the definition and calculation of system availability, offering availability rates across different time periods to help teams gain comprehensive insight into the system operation.

    2. Configure shutdown plans: You can create shutdown plans to distinguish scheduled shutdowns and crashes for accurate downtime statistics, enabling a more holistic assessment of system stability.

    3. Obtain detailed downtime records: FineOps displays detailed downtime records, assisting O&M teams in identifying potential issues and vulnerabilities for timely adjustments and maintenance.

    With these functions, O&M teams can effectively monitor the system or device operation, promptly identify and resolve issues, and ultimately improve system availability and user satisfaction.

     

    Availability Indicator

    Prerequisite

    System availability is determined from O&M project monitoring data collected by FineOps.

    Therefore, to obtain accurate system availability indicators, you must ensure that FineOps' O&M project monitoring is functioning properly.

    For details, see Prerequisites of Using Monitoring Dashboards.

    Defining Availability Indicators

    Definition

    System availability indicator = ∑Available time slices of the system/ (∑Available time slices of the system + ∑Unplanned unavailable time slices of the system)

    iconNote:
    Server downtime is excluded from the statistics.
    • Available time slices of the system: periods during which FineOps' monitoring agent is alive and successfully collects monitoring data from the O&M project

    • Unplanned unavailable time slices of the system: two calculation methods available

    All Unplanned Stop

    Definition: Unavailable period – (Scheduled downtime ∩ Unavailable period)

    Example:

    Assume that scheduled downtime is from 3:00 to 4:00, and the actual unavailable periods are from 1:00 to 2:00 and from 2:30 to 3:30.

    In this case, the periods from 1:00 to 2:00 and from 2:30 to 3:00 are counted as unplanned downtime.

    Unavailable period: A period is identified as unavailable if during which:

    • The FineOps monitoring agent is alive, but fails to collect monitoring data from the O&M project for more than 3 minutes.

    • The load score of components ( FineReportFineDataLinkFineBI - Application NodeEngine - Calculation Node, or Engine - Metadata Node) exceeds 100 after five consecutive Full GC events.

    • The FineReportFineDataLinkFineBI - Application Node, Engine - Calculation Node, or Engine - Metadata Node component disappears.

    Scheduled downtime ∩ Unavailable period:

    It refers to the intersection of the scheduled downtime window and unavailable periods.

    System Crash Only

    Definition: unavailable periods caused by a system crash 

    A node will be identified as experiencing a crash if any of the following occurs:

    • The load score of components ( FineReportFineDataLinkFineBI - Application NodeEngine - Calculation Node, or Engine - Metadata Node) exceeds 100 after five consecutive Full GC events.

    • The FineReportFineDataLinkFineBI - Application Node, Engine - Calculation Node, or Engine - Metadata Node component disappears.

    Setting Method

    1. Log in to FineOps as the admin, click an O&M project, and choose Availability Statistics > Availability Indicator.

    2. Click the  icon. You can customize the calculation logic of Unplanned Unavailable Time Slices of the System, which defaults to All Unplanned Stops.

    3. Click OK to apply the change.

    Viewing Availability Indicators

    1. Log in to FineOps as the admin, click an O&M project, and choose Availability Statistics > Availability Indicator.

    2. For multi-node O&M projects, you can switch the application node in the top-right corner to view its availability indicators. By default, the overall indicators of the entire project are displayed.

    3. Availability indicators of four periods are displayed, including Availability Indicator of Yesterday, Availability Indicator of Last 7 Days, Availability Indicator of Last 30 Days, and Availability Indicator of Last Year.

    4. Clicking the availability indicator of any period will update data in the availability column chart below accordingly, allowing you to view availability indicator details in the selected period.

    Downtime Record

    Prerequisite

    To view downtime records of an O&M project, you must ensure the project is available.

    Otherwise, the message will be displayed: "Abnormal project status. Use this project after restoring it or view other projects."

     

    Viewing Downtime Records

    The Downtime Record list displays detailed downtime entries within the selected time period.

    1. Log in to FineOps as the admin, click an O&M project, and choose Availability Statistics > Availability Indicator.

    2. You can select the query period in the upper right corner of the Downtime Record module. By default, records from the past week are displayed.

    2. For multi-node O&M projects, you can switch the application node to view its downtime records. By default, the downtime records of the entire project are displayed.

    4. All downtime records within the selected period are displayed in a detail table format.

    ContentDescription

    Stop Time

    The start time of the unavailable period
    Recovery TimeThe end time of the unavailable period
    DurationThe length of the unavailable period (Recovery Time - Start Time)
    Downtime Type

    Downtime occurs in the following three scenarios:

    Planned Shutdown: It refers to shutdowns caused by plans created in Shutdown Plan. Hover over it to view the plan details.

    System Crash: FineOps has identified a project crash. Common causes and recommended solutions are listed in the following table.

    Unplanned Stop: FineOps has detected abnormal monitoring data collection for the project, but has not identified a crash. Hover over it, and you can create a shutdown plan.

    Automatic Restart

    It indicates whether the strategy (that enables automatic restarts upon crashes) set in Crash Handling Strategy is triggered.

    You can click No (if any) to view specific reasons.

    The following table describes the common crash causes and recommended solutions.

    Crash CauseRecommended Solution

    Overflow errors caused by insufficient memory

    You can view the specific templates that caused the issue, including:

    • Aborted templates

    • Templates that consume excessively high memory for calculation

    • Templates with excessively long calculations

    • Templates with excessively long SQL statement execution

    • Templates with dataset row counts exceeding the recommended limit

    • Templates with cell counts exceeding the recommended limit

    1. Troubleshoot the template performance and optimize the content.

    For details about template performance, see Template Performance.

    2. Enable template limits and configure reasonable limit ranges.

    For details, see Template Limit.

    3. You are advised to use the System Inspection function to diagnose the FanRuan application. If the on-heap memory size of the current system is less than the recommended value, set the on-heap memory size to the recommended value.

    For details about system inspection, see System Inspection.

    JDK version issueYou are advised to use JDK8 of V1.8.0_181 or later versions.
    Unreasonable memory configuration in the system

    You are advised to use the System Inspection function to diagnose the FanRuan application and set the memory size to the recommended value.

    For details about system inspection, see System Inspection.

    Insufficient disk space

    Check server disk space and use the Resource Cleanup function to remove unnecessary files.

    For details about resource cleanup, see Resource Cleanup.

    Low memory-mapped file limit

    Use the System Inspection function to diagnose the FanRuan application and set the vm.max_map_count configuration to the recommended value.

    For details about system inspection, see System Inspection.

    Known bugs in the current version

    You are advised to upgrade the FanRuan application to the latest minor version.

    For upgrades of FineOps-deployed projects,  see Extranet-Based O&M Project Upgrade.

    For FineBI updates and upgrades, see FineBI Version Upgrade Overview.

    For FineReport updates and upgrades, see FineReport Upgrade/Update Instruction.

    For FineDataLink updates and upgrades, see Minor Version Upgrade Instruction for V4.2.x.

    Unknown crash causesYou are advised to contact technical support personnel for feedback.
    Application exits due to SSH session closure

    If an application is started in an SSH terminal session, closing that terminal will also terminate the application.

    You are advised to use alternative command-line remote tools, such as SecureCRT, or configure the FineDataLink server to start automatically upon system boot.

    For details, see Automatic Tomcat Startup in Windows Upon System Boot.

    Thread blockage due to slow queries in the data source

    You are advised to improve data-fetching performance by using functions such as Extracted Data CacheSQL Optimization, and Data Preprocessing.

    Thread blockage due to excessive logging

    You are advised to adjust the log output level to reduce log volume or check if the disk is running out of space.

    For details about log levels, see Log Introduction.

    Prolonged system memory release

    1. You are advised to use the System Inspection function to diagnose the FanRuan application. If the on-heap memory size of the current system is unreasonable, set the on-heap memory size to the recommended value.

    For details about system inspection, see System Inspection.

    2. You are advised to use a CPU with higher performance.

    Attachment List


    Theme: Project Management
    Previous
    Next
    • Helpful
    • Not helpful
    • Only read

    滑鼠選中內容,快速回饋問題

    滑鼠選中存在疑惑的內容,即可快速回饋問題,我們將會跟進處理。

    不再提示

    10s後關閉

    Get
    Help
    Online Support
    Professional technical support is provided to quickly help you solve problems.
    Online support is available from 9:00-12:00 and 13:30-17:30 on weekdays.
    Page Feedback
    You can provide suggestions and feedback for the current web page.
    Pre-Sales Consultation
    Business Consultation
    Business: international@fanruan.com
    Support: support@fanruan.com
    Page Feedback
    *Problem Type
    Cannot be empty
    Problem Description
    0/1000
    Cannot be empty

    Submitted successfully

    Network busy