Ideally you want to have twice the number of input pipelines as you have indexer cluster members. If you are using two HFs for input, then these should be configured for at least 4 parallel ingestion pipelines per HF. Depending on your HF hardware resources, you can run a large number of parallel pipelines (e.g., 8 or more) to get a more even data distribution. Keep in mind like the other comments: if you have only one input stream then this will possibly not help you. Line breaker and event breaker are you friends here too.
... View more