Hi,
I am checking the availability of the individual nodes, if app server(tomcat) goes down needs to trigger the health rule violation as critical and send an email but it is not working. Please help me. I have attached my configuration.
For Your Information : AppDynamics Version 3.9.6.0
AppServerAgent Version 3.9.6.0
Using AppDynamics Trial Version of SAAS.
Hi,
We have known bug for similar case for agent app availability metric but that is already fixed in 3.9.6 ui version and your ui is in fixed version, Can you please file help ticket at https://help.appdynamics.com/tickets/new Or send email to help@appdynamics.com (we assume you have permissions to log tickets as i could not locate this account in portal) with machine agent logs collected in debug mode from machine agent node reporting to "Rad_Demo2_Node3" to speed up the process
We confirm we could see the issue from your saas UI for affected metrics and yes the rule should have triggered as you stated.
Team would file bug based on analysis of splunk logs and machine agent logs, this helps us to better track bug if any we file on help ticket , hope that is fine with you
Regards,
Arun
Hi,
It was worked the day before with same configuration. Please refer the attached screen shot for the same.
Thanks,
Logu
Health Rule always shows green color while app server agent not available (Red). Please refer the attached image.
Please give me any suggestion or idea on locating the problem which is really helpful for me. Thanks in advance.
Hi,
From screenshot http://community.appdynamics.com/appdynamics/attachments/appdynamics/appdynamics-discussions/2460/1/... looks like the alert triggered recent time as we could see on going status duration as 1 min. We do have bug around the availability metric but you are using controller UI version 3.9.6 which already fix around this.
Can you please confirm the availability metric in metirc browser and screenshot from policy violations screen for latest view if there is any disconnect in latest alert status.
Regards,
Arun
Hi,
I don't know what is going on health rule evaluation. Please refer the attached screenshot and start time. It is wrongly getting executed periodically. It seems my saas controller getting corrupted ? is there any workaround or reset ? . Kindly, help me. I don't have any idea.
Please AppDynamics Team help me.
Hi,
Can you please provide the saas account name / controller UI URL details? so that we will login from our portal user and check the behavior in UI for more clarity.
Regards,
Arun
Hi Arun,
I've made some changes in the health rule configuration after that the old event violation state changed as Cancelled Please refer the below event which causes the problem. Now it is working fine.
My Controller : https://rad.saas.appdynamics.com/controller
user name/account name : RAD
Hi,
Glad to hear that news from you, Can you let us know that changes you did, we suspect either resaving or updating metric path in condition could have helped? please let us know the changes did
Regards,
Arun
Hi,
I have added new health rule but still having the problem in execution of health rule violation. I've verified with metric browser and it is showing correctly. is there any bug ? or is there any solution to fix ?
Please refer my controller : https://rad.saas.appdynamics.com/controller
account/user name : RAD
Hi,
Somehow we could not locate your saas accunt login details from our portal , Can you please create a test user login and password for us (user with read only permisions should be enough) and share and also send the screenshot from controller UI depicting the issue to assist you better.
Regards,
Arun
Hi,
The problems :
1) Custom Apache Server availability violation not executing. but it is showing the value correctly in metric browser.Please check the below rest uri.
2) After violation executed we have configured two actions 1) email and 2) api call (remediation script). but occationally api called and most of the time it is not getting executed.
Hi,
- We see data is up till 12/22/2014 2:07 pm , started reporting back at 12/22/2014 2:14 pm and is down from 12/22/2014 2:23 pm and is up from 12/22/2014 2:40 pm to current time
- We see the rule have failed to trigger because of the data to looks i set to 1ast 1 minute and evaluation frequency is 30 mts It could be case that when rule evaluated the the value is not zero , say it evaluated at 2:06 pm or 2:42pm today than
the health rule will not violate , Hope that make sense to you
refer screenshots listed below depicting the same
- Regarding other issue, we understood that you are referring to "Demo2 App Server Down Policy" policy we see only on action and is defined on action "Create App Server Down Alarm" with CreateAlarmDownAPI.py script pointed and we do not see any email digests on this, Please send the machine agent logs and screenshot of the rule with actions you are referring to assist you further on this
Regards,
Arun
Hi,
Thanks for the explanation. I'll make the evaluation frequency to 5min to verify. If I've any issues further will send you the necessary logs.
I have added apache monitor extension and it is working fine but the rule is not executed. I've verified in metric browser and it is showing the value correctly. Please verify this one.
Thanks.
Hi,
We see only two rules "Demo2 App Server Down Health Rule" and "Demo1 App Server Down Health Rule" on Apache custom metrics and both looks to be updated just now to evaluate every 5 minutes, can you point us the rule you are referring in UI?
Regards,
Arun
Hi,
for last 5min, I am getting the value as 0 while executing below the rest but health rule not working.
Hi,
I am refering Demo2 App Server Down Health Rule.
Please see the below screenshot :
Hi,
We have known bug for similar case for agent app availability metric but that is already fixed in 3.9.6 ui version and your ui is in fixed version, Can you please file help ticket at https://help.appdynamics.com/tickets/new Or send email to help@appdynamics.com (we assume you have permissions to log tickets as i could not locate this account in portal) with machine agent logs collected in debug mode from machine agent node reporting to "Rad_Demo2_Node3" to speed up the process
We confirm we could see the issue from your saas UI for affected metrics and yes the rule should have triggered as you stated.
Team would file bug based on analysis of splunk logs and machine agent logs, this helps us to better track bug if any we file on help ticket , hope that is fine with you
Regards,
Arun
Hi,
Thanks for your kindly support and I'll email the problem to help@appdynamics.com. I don't have permission for help center.
Hopefully, if it is the problem, we should have the solution too. Thanks again.
Thanks,
Loganathan.
Hi Loganathan,
I just searched your name instead searchng account from portal and if i am not wrong loganathan.ideas2it@gmail.com is your email id and you are using self service trial account, yes we just see now you do not have permissions from portal
Unfortunately you would need to upgrade Pro trial to file help tickets, i will try to reproduce similar issue in local and will file an internal request and will keep you posted any updates on internal bug if any , Hope that is fine with you for now.
We appreciate your cooperation with us on this.
Regards,
Arun
Hi Arun,
If I understood correctly, are you going to file the bug after reproducing in your local ?
Unfortunately my trial version going to expire soon, so please post me if you get any progress on this one and I am waiting and keep watching this post.
Thanks,
Loganathan.
Hi Loganathan,
Yes i will try to reproduce the issue in local and will bug accordingly and will update in this post any progress on the replication or bug.
Regards,
Arun