For Internal AppDynamics Audiences
If the Events Service is dropping events, determine why:
Events Service is losing events... | Then... |
Essentially all the time | T-shirt size chosen is clearly too small |
During peak load times | Determine whether or not losing some events is tolerable |
Sometimes the analytical value of the aggregate of events matters more than any single event. In these cases, dropping some events may be fine.
This sort of reasoning applies to deficiencies in any KPI or criterion.
The answer is not always to scale up—you may discover that you are over-provisioned. In that case, you can scale down.
Sizing must be an iterative process. The sizing that you come up with initially might be right over the longer term, or it might not. Try to get a sense of how similarly or differently traffic is behaving as time goes on.
Bear in mind that the sizing estimates you obtain from this series of articles are based on testing one particular set of infrastructure—namely, EC2 instances—which may differ in many ways from the infrastructure found in on-prem deployments.
In the field, you may encounter virtual machines or bare metal—each of which may behave differently even if the specs are superficially similar. For example, a given deployment might be on AWS while another might be on GCP—and different clouds behave differently.