We have running Dynatrace for a couple of months already with a lot of success, however one of the main challenges we face is the managing of alerts; We really have not done any extra configuration other than adding an email address for the delivery of various alerts such as High Overall Failed Transaction Rate and Host Memory Unhealthy/
We get on a daily basis a significant amount of alerts most of which we don't fully understand how the event was triggered and in some cases were we do have some understanding of the event and know its a false alert (i.e machine being rebooted due to maintenance)
Most of the alerts seem to have predefined thresholds/Rule (ie evaluation timeframe) and or use calculated values for setting a threshold that can change over time, We are wondering if anyone knows of a document or source that explain in detail the nature/objective of each alert, how it gets triggered and how it can be adjusted if applicable
Appreciate any guidance, thanks
Answer by Andreas G. ·
Hi. Most of these out-of-the-box Incidents that you get are based on our baselines. There is a good documentation on this - you can start here: Baseline and Smart Alerting Explained
JANUARY 15, 3:00 PM GMT / 10:00 AM ET