<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Decrease bundle size in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353365#M64705</link>
    <description>&lt;P&gt;It will be the contents of your SH apps.&lt;/P&gt;

&lt;P&gt;You should be able to find prospective big-file candidates by looking through the app folders on the SH.&lt;BR /&gt;
Large lookup files are normally a good candidate, so you could blacklist *.csv - but you would loose the ability to execute remote lookups.&lt;/P&gt;

&lt;P&gt;Another thing to check is that you dont have any binaries or archive files in there by mistake.&lt;/P&gt;</description>
    <pubDate>Tue, 19 Dec 2017 21:51:28 GMT</pubDate>
    <dc:creator>nickhills</dc:creator>
    <dc:date>2017-12-19T21:51:28Z</dc:date>
    <item>
      <title>Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353362#M64702</link>
      <description>&lt;P&gt;The bundle in the search head has grown upto 776 MB. Its not getting pushed as a result. &lt;BR /&gt;
How to reduce the bundle size. The files are populated in the bundle even if we delete the files in tar. &lt;BR /&gt;
why are two bundles getting created on its own and about the same size even if we don't work on the search head. &lt;/P&gt;

&lt;P&gt;Please provide examples of a replication blacklist which can be very specific. &lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 20:36:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353362#M64702</guid>
      <dc:creator>nawazns5038</dc:creator>
      <dc:date>2017-12-19T20:36:59Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353363#M64703</link>
      <description>&lt;P&gt;Normally the last 5 copies of the bundle are dropped onto peers, so it can end up consuming a fair bit of space.&lt;BR /&gt;
Have you been able to work out what the large files are - are they lookups, and whats the path/name of the files?&lt;/P&gt;

&lt;P&gt;If it was a big lookup - eg yourApp/lookups/big.csv&lt;/P&gt;

&lt;P&gt;in distsearch.conf in yourApp/local&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[replicationBlacklist]
bigFile = big.csv
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;would do the trick&lt;/P&gt;

&lt;P&gt;edit: to add link&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/DistSearch/Limittheknowledgebundlesize"&gt;http://docs.splunk.com/Documentation/Splunk/latest/DistSearch/Limittheknowledgebundlesize&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 20:51:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353363#M64703</guid>
      <dc:creator>nickhills</dc:creator>
      <dc:date>2017-12-19T20:51:14Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353364#M64704</link>
      <description>&lt;P&gt;There are tons of files in the bundle. Do you know of any way to view the contents of tar in descending order of size. &lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 21:43:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353364#M64704</guid>
      <dc:creator>nawazns5038</dc:creator>
      <dc:date>2017-12-19T21:43:01Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353365#M64705</link>
      <description>&lt;P&gt;It will be the contents of your SH apps.&lt;/P&gt;

&lt;P&gt;You should be able to find prospective big-file candidates by looking through the app folders on the SH.&lt;BR /&gt;
Large lookup files are normally a good candidate, so you could blacklist *.csv - but you would loose the ability to execute remote lookups.&lt;/P&gt;

&lt;P&gt;Another thing to check is that you dont have any binaries or archive files in there by mistake.&lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 21:51:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353365#M64705</guid>
      <dc:creator>nickhills</dc:creator>
      <dc:date>2017-12-19T21:51:28Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353366#M64706</link>
      <description>&lt;P&gt;Ya i can see some huge lookup files &lt;/P&gt;

&lt;P&gt;Will there be any impact if i blacklist the lookups in the bundle . &lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 22:00:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353366#M64706</guid>
      <dc:creator>nawazns5038</dc:creator>
      <dc:date>2017-12-19T22:00:47Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353367#M64707</link>
      <description>&lt;P&gt;if they are used in lookups, you would have to run them them on the SH, not the remote peers:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;local Syntax: local=&lt;BR /&gt;
Description: If local=true, forces the&lt;BR /&gt;
lookup to run on the search head and&lt;BR /&gt;
not on any remote peers. Default:&lt;BR /&gt;
false&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;&lt;A href="https://docs.splunk.com/Documentation/SplunkCloud/6.6.3/SearchReference/Lookup"&gt;https://docs.splunk.com/Documentation/SplunkCloud/6.6.3/SearchReference/Lookup&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Your best bet is to blacklist the huge ones, and see what impact that has on size/usability&lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 22:06:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353367#M64707</guid>
      <dc:creator>nickhills</dc:creator>
      <dc:date>2017-12-19T22:06:35Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353368#M64708</link>
      <description>&lt;P&gt;ya if we upload a lookup in a search head it will not get pushed into search peers right ? &lt;/P&gt;

&lt;P&gt;should we write local=true in the query while using the command ? &lt;/P&gt;

&lt;P&gt;How do we avoid the lookup not getting pushed into the searchpeers &lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 22:12:24 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353368#M64708</guid>
      <dc:creator>nawazns5038</dc:creator>
      <dc:date>2017-12-19T22:12:24Z</dc:date>
    </item>
    <item>
      <title>Re: Decrease bundle size</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353369#M64709</link>
      <description>&lt;P&gt;In a distributed env, the lookups are shared with all the peers, so lookup processing is distributed across all the peers.&lt;/P&gt;

&lt;P&gt;By default all lookups will do this, including lookups you manually upload.&lt;/P&gt;

&lt;P&gt;Generally speaking large lookup files are discouraged in favour of summary indexing, but this is not always convenient if the lookup data is coming from a remote source. (or another search)&lt;/P&gt;

&lt;P&gt;What I do is blacklist the big CSV in distsearch, and then on the SH run a scheduled search which does &lt;CODE&gt;|inputlookup local=true ...|collect...&lt;/CODE&gt; to write it to a summary index. My main searches, then use the SI rather than the lookup.&lt;/P&gt;</description>
      <pubDate>Tue, 19 Dec 2017 22:19:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Decrease-bundle-size/m-p/353369#M64709</guid>
      <dc:creator>nickhills</dc:creator>
      <dc:date>2017-12-19T22:19:19Z</dc:date>
    </item>
  </channel>
</rss>

