About brayps

brayps · ‎05-02-2023

We are looking for recommendations on a path forward to deal with duplicate events.

brayps · ‎05-02-2023

My team has duplicate events in our index (~600 GB). We have fixed duplicate source and need to remove the existing duplicates from the index. What are the best practices for managing duplicates over a large index? So far we've explored two options - Create a summary index with duplicates removed - its a large compute load to run this deduplication job and populate a new index all at once. How can we do this efficiently and prevent our job from auto-cancelling? - We would like to be able to update the new index from the one containing duplicates on ingest. Are there best practices for doing this reliably? - Delete duplicate events from current index - this is less attractive, due to permanent deletion

brayps · ‎10-26-2022

Thank you for your response! I couldn't get this method to work as it was setting the _time field to the default 1970-01-01

brayps · ‎10-26-2022

I have a time chart of count by field | timechart count by field_name limit=0 I would like to divide each value in the statistics table by the mean of that field. Current Output: Time A B 1 1 4 2 2 5 3 3 6 Desired Output: Time A B 1 0.5 0.8 2 1 1 3 1.5 1.2 I can use a `foreach` to perform an operation on every column but I am having trouble configuring a subquery within that to calculate the mean and divide by it.

Posts	4
Solutions	0
Karma Given	4
Karma Received	0
Member Since	‎07-20-2022

Online Status	Offline
Date Last Visited	‎05-03-2023 10:25 AM

Large-scale index deduplication-What are the best ...

How to divide a field by its average?

Re: Large-scale index deduplication help

Large-scale index deduplication-What are the best ...

Re: How to divide a field by its average?

How to divide a field by its average?