Introduction
Objective: configure the monitoring of the ServiceNav monitoring solution by itself.
This document is intended for administrators of the ServiceNav solution.
It describes the checkpoints to be deployed on the ServiceNav solution devices to ensure that they are working properly.
For security reasons, some checkpoints can only be implemented inside the ServiceNav VPN network.
This document describes the control points to be implemented for each equipment of the solution. For each equipment, it is described :
- What equipment oversees it
- On which network: VPN, Internet
- Name of the checkpoint
- Checkpoint arguments
- Inspection point frequency
- Criticality of the control point
- If a notification policy is to be implemented on a mandatory basis: the nature of the notification policy is the responsibility of the solution administrator.
Prerequisites
Some control points are to be positioned on each supervision box so that they supervise themselves.
On the other hand, it is necessary to have a supervision box that supervises all the other components of the ServiceNav solution. This supervision box will be called in this document : the supervisor.
In order to ensure that the supervisor is itself operational, it must be cross-supervised by another supervision box. This second supervision box will be called sentinel in the rest of the document.
The ServiceNav solution therefore requires at least two supervision boxes in order to be supervised.
It is recommended to locate the sentry on a different infrastructure from the rest of the ServiceNav solution.
The following diagram shows a typical implementation.
ServiceNav Box
The following checkpoints are to be implemented for all supervision units.
From | Model | Arguments | Control interval (min) | Notif. obligatory. |
Create/record a device with IP address: 127.0.0.1 on the "coservit" community | ||||
She herself | CPU | Model Ident. | Model Ident. | |
She herself | LIN-DiskIO | Model Ident. | Model Ident. | |
She herself | LIN-Diskspace | Model Ident. | Model Ident. | X |
She herself | LIN-Network_traffic | Model Ident. | Model Ident. | |
She herself | LIN-RAM | Model Ident. | Model Ident. | |
She herself | LIN-Swap | Model Ident. | Model Ident. | |
She herself | check_vsb_remote_health | Model Ident. | Model Ident. | X |
Create/record a device with IP address = ServiceNav VPN network on the "coservit" community | ||||
Box | LIN-Proc-CPU-Usage | process :nscaapache2nagiosremoteOperationBoxalert threshold : 70 critical threshold : 80 | 5 | X |
Box | LIN-Proc-RAM-Usage | process :nscaapache2nagiosremoteOperationBoxalert threshold : 70 critical threshold : 80 | 10 | X |
Box | Lin-Process-SWAP | process :nscaapache2nagiosremoteOperationBoxalert threshold : 10 critical threshold : 20 | 15 | X |
Box | Flax-Process-Nb-Byname | process : apache2 alert threshold : 20 critical threshold : 80 | 5 | X |
Box | Flax-Process-Nb-Byname | process : nagios alert threshold : 50 critical threshold : 100 | 5 | X |
Box | VSBox-Live-Status | Name of the box defined on the website | 15 | X |
ServiceNav Shared Box
Implement the checkpoints defined in the previous chapter.
In addition, add the following checkpoints:
From | Model | Arguments | Control interval (min) | Notif. obligatory. |
Create/record a device with IP address = ServiceNav VPN network on the "coservit" community | ||||
Box | TCP-Port | 5667 | 5 | X |
If the supervisor also performs the role of a shared ServiceNav Box, the checkpoint must be created on the sentinel.
ServiceNav Box - supervisor
From | Model | Arguments | Control interval (min) | Notif. obligatory. |
Create/record a device with IP address: 127.0.0.1 on the "coservit" community | ||||
She herself | CPU | Model Ident. | Model Ident. | |
She herself | LIN-DiskIO | Model Ident. | Model Ident. | |
She herself | LIN-Diskspace | Model Ident. | Model Ident. | X |
She herself | LIN-Network_traffic | Model Ident. | Model Ident. | |
She herself | LIN-RAM | Model Ident. | Model Ident. | |
She herself | LIN-Swap | Model Ident. | Model Ident. | |
She herself | check_vsb_remote_health | Model Ident. | Model Ident. | X |
Create/record a device with IP address = ServiceNav VPN network on the "coservit" community | ||||
Sentinel | LIN-Proc-CPU-Usage | process :nscaapache2nagiosremoteOperationBoxalert threshold : 70 critical threshold : 80 | 5 | X |
Sentinel | LIN-Proc-RAM-Usage | process :nscaapache2nagiosremoteOperationBoxalert threshold : 70 critical threshold : 80 | 10 | X |
Sentinel | Lin-Process-SWAP | process :nscaapache2nagiosremoteOperationBoxalert threshold : 10 critical threshold : 20 | 15 | X |
Sentinel | Flax-Process-Nb-Byname | process : apache2 alert threshold : 20 critical threshold : 80 | 5 | X |
Sentinel | Flax-Process-Nb-Byname | process : nagios alert threshold : 50 critical threshold : 100 | 5 | X |
Sentinel | VSBox-Live-Status | Name of the box defined on the website | 15 | X |
ServiceNav Box - sentry
From | Model | Arguments | Control interval (min) | Notif. obligatory. |
Create/record a device with IP address: 127.0.0.1 on the "coservit" community | ||||
She herself | CPU | Model Ident. | Model Ident. | |
She herself | LIN-DiskIO | Model Ident. | Model Ident. | |
She herself | LIN-Diskspace | Model Ident. | Model Ident. | X |
She herself | LIN-Network_traffic | Model Ident. | Model Ident. | |
She herself | LIN-RAM | Model Ident. | Model Ident. | |
She herself | LIN-Swap | Model Ident. | Model Ident. | |
She herself | check_vsb_remote_health | Model Ident. | Model Ident. | X |
Create/record a device with IP address = ServiceNav VPN network on the "coservit" community | ||||
Box | LIN-Proc-CPU-Usage | process :nscaapache2nagiosremoteOperationBoxalert threshold : 70 critical threshold : 80 | 5 | X |
Box | LIN-Proc-RAM-Usage | process :nscaapache2nagiosremoteOperationBoxalert threshold : 70 critical threshold : 80 | 10 | X |
Box | Lin-Process-SWAP | process :nscaapache2nagiosremoteOperationBoxalert threshold : 10 critical threshold : 20 | 15 | X |
Box | Flax-Process-Nb-Byname | process : apache2 alert threshold : 20 critical threshold : 80 | 5 | X |
Box | Flax-Process-Nb-Byname | process : nagios alert threshold : 50 critical threshold : 100 | 5 | X |
Sentinel | VSBox-Live-Status | Name of the box defined on the website | 15 | X |
ServiceNav - Website
From | Model | Arguments | Control interval (min) | Notif. obligatory. |
Create/record a device with IP address: the one of the ServiceNav VPN network or local IP on the "coservit" community | ||||
Box | CPU | Model Ident. | Model Ident. | |
Box | LIN-DiskIO | Model Ident. | Model Ident. | |
Box | LIN-Diskspace | Model Ident. | Model Ident. | X |
Box | LIN-Network_traffic | Model Ident. | Model Ident. | |
Box | LIN-RAM | Model Ident. | Model Ident. | |
Box | LIN-Swap | Model Ident. | Model Ident. | |
Box | LIN-Proc-CPU-Usage | process :rsyncsshdcronexim4openvpnsnmpdalert threshold : 10 critical threshold : 20 | 5 | X |
Box | LIN-Proc-CPU-Usage | process : mysqldapache2alert threshold : 70 critical threshold : 90 | 5 | X |
Box | LIN-Proc-CPU-Usage | process :ODS_PerfData,ODS_StatusDataVS_UpdateBoxProviderVSB_InitializationVS_CommandProcessingVS_ExternalSynchroVS_ITDiscoveryVS_ITInventoryalert threshold : 50 critical threshold : 70 | 5 | X |
Box | LIN-Proc-CPU-Usage | process : ndo2db alert threshold : 70 critical threshold : 90 | 5 | X |
Box | LIN-Proc-CPU-Usage | process : beam.smpepmdinet_gethostalert threshold : 15 critical threshold : 50 | 5 | X |
Box | LIN-Proc-RAM-Usage | process : rsyncsshdcronexim4openvpnsnmpdalert threshold : 5 critical threshold : 10 | 10 | X |
Box | LIN-Proc-RAM-Usage | process :ODS_PerfDataODS_StatusDataVS_UpdateBoxProviderVSB_InitializationVS_CommandProcessingVS_ExternalSynchroVS_ITDiscoveryVS_ITInventoryalert threshold : 50 critical threshold : 70 | 10 | X |
Box | LIN-Proc-RAM-Usage | process : apache2alert threshold : 5 scritical eye : 10 | 5 | X |
Box | LIN-Proc-RAM-Usage | process : mysqldalert threshold : 50 critical threshold : 70 | 5 | X |
Box | LIN-Proc-RAM-Usage | process : ndo2dbalert threshold : 50 critical threshold : 70 | 10 | X |
Box | LIN-Proc-RAM-Usage | process : beam.smpepmdinet_gethostalert threshold : 10 critical threshold : 20 | 10 | X |
Box | Lin-Process-Swap | process : rsyncsshdcronexim4openvpnsnmpdalert threshold : 5 critical threshold : 10 | 15 | X |
Box | Lin-Process-Swap | process : mysqld,apache2alert threshold : 10 critical threshold : 20 | 15 | X |
Box | Lin-Process-Swap | process :ODS_PerfDataODS_StatusDataVS_UpdateBoxProviderVSB_InitializationVS_CommandProcessingVS_ExternalSynchroVS_ITDiscoveryVS_ITInventoryalert threshold : 50 critical threshold : 70 | 15 | X |
Box | Lin-Process- Swap | process : ndo2db alert threshold : 10 critical threshold : 20 | 10 | X |
Box | Lin-Process-Swap | process : beam.smpepmdinet_gethostalert threshold : 5 critical threshold : 10 | 15 | X |
Box | Flax-Process-Nb-Byname | process : apache2 alert threshold : 70 critical threshold : 100 | 5 | X |
Box | Flax-Process-Nb-Byname | process : ndo2db alert threshold : 1000 critical threshold : 1500 | 5 | X |
Box | TCP-Port | Port: 80Threshold alert : 2 Critical Threshold : 4 | 1 | X |
Box | TCP-Port | Port: 443Threshold alert : 2 Critical Threshold : 4 | 1 | X |
Box | TCP-Port | Port : 9465Threshold alert : 2 Critical Threshold : 4 | 1 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process : ODS_PerfData Message ready" thresholds : 10:50 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process ODS_StatusData Message ready" thresholds : 10:50 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process : VSB_Initialization Message ready" thresholds : 2:5 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process : VS_CommandProcessing Message ready" thresholds : 2:5 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process : VS_ITDiscovery Message ready" thresholds : 10:20 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process : VS_ITInventory Message ready" thresholds : 10:20 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | check_vsp_process_health | User name : supervision Password : supervision Vhost : %2f Process : VS_ExternalSynchro Message ready" thresholds : 10:20 Message processing thresholds : 2:5 IDLE time thresholds : 300:900 | 10 | X |
Box | Check_Aliveness_RabbitMQ | Same as model | 2 | X |
Create/record a device with IP address: public IP address on the "coservit" community | ||||
Sentinel | TCP-Port | Port: 80Threshold alert : 2 Critical Threshold : 4 | 1 | X |
Sentinel | TCP-Port | Port: 443Threshold alert : 2 Critical Threshold : 4 | 1 | X |
ServiceNav - Business Intelligence
From | Model | Arguments | Control interval (min) | Notif. obligatory. |
Create/record a device with IP address: the one of the ServiceNav VPN network or local IP on the "coservit" community | ||||
Box | CPU | Model Ident. | Model Ident. | |
Box | LIN-DiskIO | Model Ident. | Model Ident. | |
Box | LIN-Diskspace | Model Ident. | Model Ident. | X |
Box | LIN-Network_traffic | Model Ident. | Model Ident. | |
Box | LIN-RAM | Model Ident. | Model Ident. | |
Box | LIN-Swap | Model Ident. | Model Ident. | |
Box | Lin-Process-CPU | process :rsyncsshdcronexim4openvpnsnmpdalert threshold : 10 critical threshold : 20 | 5 | X |
Box | LIN-Proc-CPU-Usage | process : mysqldalert threshold : 70critical threshold : 90 | 5 | X |
Box | LIN-Proc-CPU-Usage | process : javaalert threshold : 70critical threshold : 90timeperiod : 00:00 - 08:00 | 5 | |
Box | LIN-Proc-RAM-Usage | process : rsyncsshdcronexim4openvpnsnmpdalert threshold : 5 critical threshold : 10 | 10 | X |
Box | LIN-Proc-RAM-Usage | process : mysqldalert threshold : 50 critical threshold : 70 | 10 | X |
Box | LIN-Proc-RAM-Usage | process : javaalert threshold : 50 critical threshold : 70 timeperiod : 00:00 - 08:00 | 5 | |
Box | Lin-Process-Swap | process : rsyncsshdcronexim4openvpnsnmpdalert threshold : 5 critical threshold : 10 | 15 | X |
Box | Lin-Process-Swap | process : mysqldalert threshold : 10 critical threshold : 20 | 15 | X |
Box | Lin-Process- Swap | process : javaalert threshold : 10 critical threshold : 20timeperiod : 00:00 - 08:00 | 5 | |
Box | VS_VBI_check_dw_vs_param: kpi_performance_scheduler | User name : supervision Password: Process name kpi_performance_scheduler_status Expected status(es) : SUCCESS Time to expiry of status: 1440 | 720 | X |
Box | VS_VBI_check_dw_vs_param: kpi_status_scheduler | User name : supervision Password: Process name : kpi_status_scheduler_status Expected status(es) : SUCCESS Time to expiry of status: 1440 | 720 | X |
Box | VS_VBI_check_dw_vs_param : inventory_quantity_synchronizer | User name : supervision Password: Process name inventory_quantity_synchronizer_status Expected status(es) : SUCCESS Time to expiry of status: 1440 | 720 | X |