ServiceNav artificial intelligence boosted by experts from the Grenoble IA Institute
MIAI Grenoble Alpes is the multidisciplinary institute in Artificial Intelligence, based in Grenoble. This center of expertise is headed by Eric Gaussier, formerly head of the Grenoble Computer Science Laboratory (LIG).
In 2014, within the framework of a governmental project "Fond Unique Interministériel" - a program allowing to finance R&D projects for a short or medium term market launch by associating the skills of large companies, SMEs and laboratories, Coservit and the LIG have gotten closer and started a succession of common projects: in 6 years, 4 Artificial Intelligence topics have been treated, involving about 20 experts and 10 publications of articles in scientific journals!
We reveal to you the stakes of these collaborative projects of high expertise...
Rachid Mokhtari, R&D Director at Coservit: "As ServiceNav is a SaaS solution, the volume of objects monitored is not controlled by Coservit: the platform must therefore be scalable, robust, automated and autonomous for a minimum cost of maintenance and time spent in order to ensure the availability rate contracted with customers (99.75%). This is also the case for OnPremise platforms.
Today the elements supervised are essentially computer equipment (servers, switches, firewalls, routers, IP cameras...). Tomorrow it will be a question of supervising connected objects in large numbers! We must be ready for these future challenges. »
Beyond the scalability of ServiceNav, the research work with the LIG has aimed to add Artificial Intelligence to supervision in order to meet the following uses:
- Reduction of false positives and false negatives
The market reports a number of false positives ranging from 30 to 80% for a conventional monitoring solution.
For example: for an average of 50 incidents per day, 25 of them are really not incidents. This is due to incorrectly configured thresholds: either too high or too low.
This means that operators spend half of their time on alerts that are not incidents or that are not a priority: time spent on unnecessary actions, loss of confidence in the supervision solution ...
- Incident prediction
When a critical problem occurs, the operating crews are in "firefighter" mode, focused on that emergency. Whether during the day or during on-call duty, the stress generated is enormous and the pressure from management or users is complicated to manage.
Predicting incidents with enough anticipation is therefore a solution for an organized and serene way of working, allowing at the same time to obtain a better quality of service and therefore higher business application availability rates.
- Identify the source of a problem as quickly as possible - Root Cause Analysis
Getting to the source of a complex problem takes time. The objective is to reduce this time to a minimum through analysis and reasoning assisted by Artificial Intelligence.
Eric: "The LIG is one of the most important computer laboratories in France, working in particular on distributed computing and the processing of large volumes of data.
LIG research teams are interested in real production data to test and prove their algorithms. Coservit with its ServiceNav SaaS monitoring platform and its real-time flows of millions of data was therefore the right candidate. »
As Eric explains, "the new issues are focused on causal relationships and not just on correlations between objects. This approach is new, difficult and little addressed at the LIG level. ServiceNav's "root cause analysis" project has therefore enabled the LIG to make progress in the vast field of causality with real use cases. »
- Scalability and robustness of the monitoring platform
As a result of these 4 years of research, Coservit now has a strong Big Data competence. Few publishers on the market have successfully made the switch to these complex technologies.
- Increase availability rates of critical applications
By injecting algorithms into this "big engine" that is Big Data, the forecasting of alerts in the near future (+2h, +15h...) or more distant future (+3 months, +15 months...) allows to save uptime, to reduce the pressure on the operating teams both in working hours but also and especially during on-call periods.
Rachid: "Working on causality simplifies the analysis of complex systems by focusing on the essential root cause alerts..."
- Improve the RoI of monitoring systems through operator efficiency and automation
As Rachid points out, "ServiceNav is now able to reduce noise, i.e. reduce false positives without generating false negatives. Automatic threshold adjustment is the key to achieving up to 70% of noise reduction without changing the current paradigm and organization of operators. Our recommendation system fits with current user processes with little change management.
We are now working on behavioural analysis to automatically detect abnormal system changes over a large data set. »