About tophercullen

tophercullen · ‎12-05-2018

I've been using the this app fine for many things but appear to have run into a case where it fails catastrophically. I have a alert setup to trigger when there are no results for a given search. When this triggers and the webhook triggers, it results in this error: Obviously, nothing makes it to slack. I've played with the options I can see (such as message formating). As far as I can tell, this issue occurs regardless.

tophercullen · ‎07-13-2018

I understand what you are saying. However, you misunderstand what is going on. The add-on, with what it is doing, tries to be efficient about log stream events by check a log stream attribute to determine if it should recurse into a log stream. Thats smart. However, It can't do anything about the log streams themselves. The api doesn't allow you to filter on the logs streams of a log group in your request. You can only request all the streams in a log group, with optional sort or paginate. If you have too many streams in a group, yes, you will run into potential problems because it can't be smart about that. It can only ever get all the streams in a group. That all said, what i'm suggesting doesn't change any 'efficiency'. You query log groups for their streams and then, instead of "lastEventTimestamp", you check the "lastIngestionTime" to determine if you should recurse. As far as i know, that attribute is a more clear indicator of change. From their API docs "lastEventTime updates on an eventual consistency basis. It typically updates in less than an hour from ingestion, but may take longer in some rare situations."

tophercullen · ‎07-11-2018

Are the log coming from lambda? could be this issue: https://answers.splunk.com/answers/671220/lambda-cloudwatch-logs-often-missing-due-to-edge-c.html?minQuestionBodyLength=80

tophercullen · ‎07-11-2018

If these logs are coming from lambda functions, it won't work. I litreally just made a post about it. https://answers.splunk.com/answers/671220/lambda-cloudwatch-logs-often-missing-due-to-edge-c.html

tophercullen · ‎07-11-2018

My use case: I have several very small lambda functions that run hourly and output ~20 lines each time. I've configured a cloudwatch input for each functions log group with a 600s frequency. These are the only things using cloudwatch api. Problem: While the setup technically 'works', more often than not, only the first log line of each invocation is indexed. The remaining lines are conspicuously missing. Occasionally, it does get all the log lines. Root cause: I looked over the internal logs and I only see normal operations. I think the issue is how the add-on tests for new logs. Each invocation of lambda creates a new log stream. See example my example log stream (some data redacted for privacy) { "firstEventTimestamp": 1531317415254, "lastEventTimestamp": 1531317415254, "creationTime": 1531317406089, "uploadSequenceToken": "Atoken", "logStreamName": "MyLogStream", "lastIngestionTime": 1531317582675, "arn": "someARN", "storedBytes": 0 }, This example is of a log stream that has completed its lambda execution, meaning there are some 20+ lines in there. Yet, we can clearly see that the "EventTimestamp" attributes are identical. Thats odd, lets look at the events. { "nextForwardToken": "sometoken", "events": [ { "ingestionTime": 1531317415315, "timestamp": 1531317415254, "message": "START RequestId somestuff \n" }, { "ingestionTime": 1531317430720, "timestamp": 1531317415255, "message": "Mylogline \n" }, ......... <redacted event list> ......... { "ingestionTime": 1531317597749, "timestamp": 1531317582672, "message": "Final log line \n" }, { "ingestionTime": 1531317597749, "timestamp": 1531317582673, "message": "END RequestId: somestuff \n" }, { "ingestionTime": 1531317597749, "timestamp": 1531317582673, "message": "REPORT RequestId: some more stuff" } ], "nextBackwardToken": "sometoken" } Here can clearly see the each event's timestamp is different, and the last events timestamp does NOT match the "lastEventTimestamp" attribute of the log stream. The "lastEventTimestamp" matches the first event. I believe that the add-on is using the "lastEventTimestamp" to determine if there are more log events to index. This will cause missing logs under certain conditions. In my case, when the lambda executions are still running when it indexes the log stream. Lucky me, since my hourly lambda executions happen on the same interval as the check (*/10), I'm missing nearly all my logs. Best i can tell, it should use the "lastIngestionTime" attribute which actually seems to indicate some sort of change. EDIT: I modified the execution interval several times to no avail. Maybe 1/10 streams are ever indexed properly, making this input pretty useless.

Posts	5
Solutions	0
Karma Given	0
Karma Received	1
Member Since	‎03-19-2015

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

Slack Webhook v3 fails when triggered with an empt...

Lambda Cloudwatch logs often missing due to edge c...

Slack Webhook v3 fails when triggered with an empt...

Re: Lambda Cloudwatch logs often missing due to ed...

Re: Splunk addon for AWS: Only last message in log...

Re: Splunk addon for AWS: Only last message in log...

Lambda Cloudwatch logs often missing due to edge c...

Join the Conversation

Slack Webhook v3 fails when triggered with an empt...

Lambda Cloudwatch logs often missing due to edge c...

Slack Webhook v3 fails when triggered with an empt...

Re: Lambda Cloudwatch logs often missing due to ed...

Re: Splunk addon for AWS: Only last message in log...

Re: Splunk addon for AWS: Only last message in log...

Lambda Cloudwatch logs often missing due to edge c...