• Forums
    • Public Forums
      • Community Connect
      • Dynatrace
        • Dynatrace Open Q&A
      • Application Monitoring & UEM
        • AppMon & UEM Open Q&A
      • Network Application Monitoring
        • NAM Open Q&A
  • Home /
  • Public Forums /
  • Application Monitoring & UEM /
  • AppMon & UEM Open Q&A /
avatar image
Question by amol k. · Jul 05, 2014 at 07:06 PM ·

Fine Tuning Incidents

Hi,

I have created BTM’s for few critical transactions in Dynatrace and I have created dashboard where I have shown incidents dashlets where if response time or error rate of these transactions breached its get highlighted on incident dashboard.

But my problem is that I have set incidents on Avg response time and percentage of failure rate which giving me false alert as if any 1 of the requests response time is high that automatically show me high response time and if in last 1 minute if I have only 1 request and that request gets error it shows 100% error. Please suggest how I can avoid such false alerts.

I know that I can put these transactions in base lining feature of Dynatrace but it will not give me response time and failure rate alerts for all BTMs separately which I want to highlight in dashboard.

Also when I read documentation I found sensitivity field for incidents which were available prior to DT 4 is there any way where I can enable that field to check feasibility of my requirement.

Please suggest.

 

Regards,

Amol Khawre  

Comment

People who like this

0 Show 0
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

8 Replies

  • Sort: 
  • Most voted
  • Newest
  • Oldest
avatar image

Answer by amol k. · Sep 11, 2014 at 11:21 PM

HI,

 

To handle this scenario I created BTM and I have added web request count in BTM for web Request count calculation but when I add this measure in BTM I'm not getting threshold breached incidents even after threshold violations.

Please find attachment for details.

Please suggest.

IncidentsDetails.zip

Amol.

Comment

People who like this

0 Show 0 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image

Answer by Andreas G. · Jul 28, 2014 at 06:24 PM

Your Incident uses two different Conditions connected with OR. Is it possible that the other Measure exceeded the threshold and therefore triggered the Incident. So - please have a look at all the measures that you use in your incident definition

As for Percentiles: No - this is currently not supported

Comment

People who like this

0 Show 0 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image

Answer by amol k. · Jul 26, 2014 at 10:49 PM

Hi,

Please suggest if we can set alerts on percentile of response time as average giving false alerts.

Amol.

Comment

People who like this

0 Show 0 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image

Answer by amol k. · Jul 24, 2014 at 10:37 PM

Hi Andi,

Thanks for explanation of logic. Please find attachment for details which you asked and help  me understand this issue.

IncidentsDetails.zip

Amol khawre

Comment

People who like this

0 Show 0 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image

Answer by amol k. · Jul 23, 2014 at 09:48 PM


HI Andi,

Thanks for revert but I have already shared incident configuration details in previous attachement where I highlighted that I have used Avg aggregation in incidents but still Its not working for me.

Please suggest.

Amol

 

Comment

People who like this

0 Show 1 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image Andreas G. ♦ · Jul 23, 2014 at 09:54 PM 0
Share

Can you create another chart but this time use a 10s aggregation. I am interested in the individual data points. Reason why this is interesting is becuase Incidents are evaluated by constantly looking at the full timewindow of your evaluation timeframe. Its like a "rolling window". So - it could be possible that a very high failure rate will push the Avg. number across your threshold if you analyze the right 5 minute time window.

Andi

avatar image

Answer by amol k. · Jul 23, 2014 at 04:32 PM

Please suggest.

Amol Khawre

 


 

Comment

People who like this

0 Show 1 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image Andreas G. ♦ · Jul 23, 2014 at 09:28 PM 0
Share

Hi Amol

Is it possible that you chart e.g: "Average %" - but in the incident you choose something like "Max"

Remember that in both Charting as well as in Incidents you have the option to specify which "Aggregation" dynaTrace shoudl use to evaluate a measure. We can do min, max, avg, count,

So - please double check the "Aggregation" setting in your incidents to align it what you see in the Chart.

avatar image

Answer by amol k. · Jul 12, 2014 at 06:27 PM

Hi Richad,

I have chnaged config as you suggested but still its not working for me. Please find attached excel for details and help me to understand if I is their any gap in my understanding about Incidents.

 

IncidentFineTuning.xlsx

Regards,

Amol Khawre

Comment

People who like this

0 Show 0 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image

Answer by amol k. · Jul 08, 2014 at 12:00 AM

Please help.

Amol

Comment

People who like this

0 Show 2 · Share
10 |2000000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Toggle Comment visibility. Current Visibility: Viewable by all users

Up to 10 attachments (including images) can be used with a maximum of 50.0 MiB each and 250.0 MiB total.

avatar image Rick B. · Jul 08, 2014 at 12:50 AM 0
Share

Hi Amol,

Fine-tuning incidents comes down to balancing the condition measure aggregation with the evaluation timeframe.

For instance, if you want to soften your incident to reduce false positives, you may want to extend your existing "average" condition over a timeframe of 5 minutes so that there is certainly a problem before throwing the alert.  Likewise if you have a lot of traffic and some of it is usually slow but you want to track slowness across the board, you may want to consider a "minimum" aggregation over your timeframe of 1 minute to be sure that all requests were slow.

Rick B

avatar image amol k. Rick B. · Jul 08, 2014 at 01:08 AM 0
Share

Thanks Richard I will check and update.

How to get started

First steps in the forum
Read Community User Guide
Best practices of using forum

NAM 2019 SP5 is available


Check the RHEL support added in the latest NAM service pack.

Learn more

LIVE WEBINAR

"Performance Clinic - Monitoring as a Self Service with Dynatrace"


JANUARY 15, 3:00 PM GMT / 10:00 AM ET

Register here

Follow this Question

Answers Answers and Comments

2 People are following this question.

avatar image avatar image

Forum Tags

dotnet mobile monitoring load iis 6.5 kubernetes mainframe rest api dashboard framework 7.0 appmon 7 health monitoring adk log monitoring services auto-detection uem webserver test automation license web performance monitoring ios nam probe collector migration mq web services knowledge sharing reports window java hybris javascript appmon sensors good to know extensions search 6.3+ server documentation easytravel web dashboard kibana system profile purelytics docker splunk 6.1 process groups account 7.2 rest dynatrace saas spa guardian appmon administration production user actions postgresql upgrade oneagent measures security Dynatrace Managed transactionflow technologies diagnostics user session monitoring unique users continuous delivery sharing configuration alerting NGINX splitting business transaction client 6.3 installation database scheduler apache mobileapp RUM php dashlet azure purepath agent 7.1 appmonsaas messagebroker nodejs 6.2 android sensor performance warehouse
  • Forums
  • Public Forums
    • Community Connect
    • Dynatrace
      • Dynatrace Open Q&A
    • Application Monitoring & UEM
      • AppMon & UEM Open Q&A
    • Network Application Monitoring
      • NAM Open Q&A