Hello,
I'm using the Splunk Add-on for Unix and Linux, a 6.4.x Universal Forwarder as the forwarder, and Splunk Enterprise 6.5 as the indexer.
I found the search results of index=os
are duplicated a lot, so I investigated the detail:
- the "os" rawdata which is named "journal.gz" includes duplicated all fields data.
- the number of duplicated data increased as time proceeds.
- if I change the index name from "os", the data are not duplicated.
- if forward stand-alone Splunk Enterprise 6.4 which is configured same as above, the data are not duplicated.
This issue occurs to only for the "os" index, so I'm guessing that the cause of duplication exists at the indexing process using the *nix add-on, but I don't have any idea how to solve this problem, and I would not like to solve with a search statement (like dedup
command).
Please kindly tell me any idea to solve?
Thank you,
I solved this problem.
I changed useACK
configuration to false, then that duplication stopped.
I know the potential of duplication,
but in this case, index file is not double and one raw data includes same data over multiline.
I did not understand why the duplication occurred and I hope that duplication stopped with useACK=true
...
I solved this problem.
I changed useACK
configuration to false, then that duplication stopped.
I know the potential of duplication,
but in this case, index file is not double and one raw data includes same data over multiline.
I did not understand why the duplication occurred and I hope that duplication stopped with useACK=true
...
Sorry, there are some misunderstandings and new findings in my question:
My guessing is below:
If my understanding is correct, I should configure the forwarder outputs.conf not to send duplicated data while indexer is stopped. But I don't understand why "_internal" index is not duplicated.
Could you validate my understanding and tell me how to configure Splunk?
Thank you,