<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Good Data Input .. No Indexing in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97314#M20289</link>
    <description>&lt;P&gt;Thanks for the response, the data seems to at least be indexing now (updated in the body above), it just never appears on the Search page. Currently the only "Source type" is syslog, though there are seven other enabled data sources with files.  Perhaps I am missing some step to get other source types to appear in the search?&lt;/P&gt;</description>
    <pubDate>Wed, 01 Aug 2012 19:27:34 GMT</pubDate>
    <dc:creator>vbrtrmn</dc:creator>
    <dc:date>2012-08-01T19:27:34Z</dc:date>
    <item>
      <title>Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97308#M20283</link>
      <description>&lt;P&gt;I have a data source on the local file system configured as such..&lt;/P&gt;

&lt;P&gt;Path:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;/data/splunk/rrsearch/server-01/processed.1341878400.gz
/data/splunk/rrsearch/server-01/processed.1341964800.gz
/data/splunk/rrsearch/server-02/processed.1341878400.gz
/data/splunk/rrsearch/server-02/processed.1341964800.gz
/data/splunk/rrsearch/server-03/processed.1341878400.gz
/data/splunk/rrsearch/server-03/processed.1341964800.gz
...etc...
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;UL&gt;
&lt;LI&gt;Path: /data/logs/rrsearch&lt;/LI&gt;
&lt;LI&gt;Set Host: Segment on Path / 4&lt;/LI&gt;
&lt;LI&gt;Source type: Manual / Baseline Search&lt;/LI&gt;
&lt;LI&gt;Index: baseline_search&lt;/LI&gt;
&lt;LI&gt;Whitelist: .+processed.+gz$&lt;/LI&gt;
&lt;LI&gt;Blacklist: &lt;EM&gt;left empty&lt;/EM&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;The Data Inputs - Files &amp;amp; Directories screen shows 620 files.&lt;/P&gt;

&lt;P&gt;The problem is none of the data ever seems to get indexed, other data in the /data/splunk path does get indexed for other projects.  I feel I'm missing one small step, can anyone throw me a bone?&lt;/P&gt;

&lt;P&gt;Per @Lamar's request, inputs.conf&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[default]
host = wsi-hub

[monitor:///data/splunk/remote]
host_segment = 4
sourcetype = syslog
blacklist = .*.gz
disabled = 0
host = 

[monitor://$SPLUNK_HOME/var/log/splunk]
blacklist = *.gz
disabled = false

[monitor:///data/logs/rrsearch]
disabled = false
followTail = 0
host = 
host_regex = 
index = baseline_search
whitelist = .+processed.+gz$
sourcetype = Baseline Search
host_segment = 4
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;In indexes:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;Index Name: baseline_search
Max Size: 500,000
Frozen Archive: None 
Current Size: 3,807
Event Count: 54,237,503
Earliest Event: May 13, 2012 7:59:59 PM
Latest Event: Jul 30, 2012 7:59:59 PM
Home Path: /opt/splunk/var/lib/splunk/baseline_search/db
App: search
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 12 Jul 2012 16:40:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97308#M20283</guid>
      <dc:creator>vbrtrmn</dc:creator>
      <dc:date>2012-07-12T16:40:52Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97309#M20284</link>
      <description>&lt;P&gt;Without being able to see your 'actual' input configuration I'll take a guess and say that you've got to make sure you're searching on index=baseline_search unless you've set your default indexes to include that one.&lt;/P&gt;

&lt;P&gt;Include your inputs.conf and we may be able to get a bit further.&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jul 2012 18:14:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97309#M20284</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-07-12T18:14:59Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97310#M20285</link>
      <description>&lt;P&gt;I'll give a nod to Lamar's answer, but I also notice that your whitelist doesn't match the filenames...  You have&lt;/P&gt;

&lt;P&gt;Whitelist: &lt;CODE&gt;.+processed.+gz$&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;Which should be&lt;/P&gt;

&lt;P&gt;Whitelist: &lt;CODE&gt;.+parsed.+gz$&lt;/CODE&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jul 2012 20:14:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97310#M20285</guid>
      <dc:creator>lguinn2</dc:creator>
      <dc:date>2012-07-12T20:14:57Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97311#M20286</link>
      <description>&lt;P&gt;I put in the file names incorrectly. &lt;EM&gt;DOH&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jul 2012 19:24:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97311#M20286</guid>
      <dc:creator>vbrtrmn</dc:creator>
      <dc:date>2012-07-31T19:24:54Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97312#M20287</link>
      <description>&lt;P&gt;Finally got sudo access on the server, I updated the question.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jul 2012 19:29:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97312#M20287</guid>
      <dc:creator>vbrtrmn</dc:creator>
      <dc:date>2012-07-31T19:29:08Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97313#M20288</link>
      <description>&lt;P&gt;I would first, clean up your input for the processed files.&lt;/P&gt;

&lt;P&gt;There are a few issues with it --&lt;BR /&gt;
First, the monitoring stanza won't pick up the data since the directory that you're monitoring is invalid (/data/logs)&lt;BR /&gt;
Additionally, I would define the fourth segment in your monitor.&lt;BR /&gt;
Lastly, I wouldn't put spaces in my sourcetype as Splunk doesn't respond well to spaces in sourcetypes.&lt;/P&gt;

&lt;P&gt;Fixes Below:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[monitor:///data/splunk/rrsearch/*/]
disabled = false
index = baseline_search
whitelist = .+processed.+gz$
sourcetype = Baseline_Search
host_segment = 4
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;That should get you a little closer to where you want to be.&lt;/P&gt;

&lt;P&gt;Hope it helps.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jul 2012 19:44:41 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97313#M20288</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-07-31T19:44:41Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97314#M20289</link>
      <description>&lt;P&gt;Thanks for the response, the data seems to at least be indexing now (updated in the body above), it just never appears on the Search page. Currently the only "Source type" is syslog, though there are seven other enabled data sources with files.  Perhaps I am missing some step to get other source types to appear in the search?&lt;/P&gt;</description>
      <pubDate>Wed, 01 Aug 2012 19:27:34 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97314#M20289</guid>
      <dc:creator>vbrtrmn</dc:creator>
      <dc:date>2012-08-01T19:27:34Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97315#M20290</link>
      <description>&lt;P&gt;Yeah, you'll probably want to enable this index 'baseline_search' to be searched by default by your user/role.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.3/Admin/Addandeditroles"&gt;http://docs.splunk.com/Documentation/Splunk/4.3.3/Admin/Addandeditroles&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;In particular, these two parameters:&lt;/P&gt;

&lt;P&gt;srchIndexesDefault&lt;BR /&gt;
srchIndexesAllowed&lt;/P&gt;</description>
      <pubDate>Wed, 01 Aug 2012 19:38:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97315#M20290</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-08-01T19:38:45Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97316#M20291</link>
      <description>&lt;P&gt;I would be curious why you decided to segment this data off from your syslog data.&lt;/P&gt;

&lt;P&gt;Again, just curious.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Aug 2012 19:39:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97316#M20291</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-08-01T19:39:56Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97317#M20292</link>
      <description>&lt;P&gt;Adding a new role worked great!&lt;/P&gt;

&lt;P&gt;Splunk will be used by manager/marketing types making reports and such. I wanted to make my search engine data as segregated as possible from any syslog data. The search engine data is scrubbed to disassociate individual IPs from their searches. Some of the data in syslog may contain individually identifiable information which they are strictly forbidden from viewing.&lt;/P&gt;

&lt;P&gt;I can view the data because I have ethical standards &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;Thanks a lot for taking time to help me with this.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Aug 2012 20:03:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97317#M20292</guid>
      <dc:creator>vbrtrmn</dc:creator>
      <dc:date>2012-08-01T20:03:43Z</dc:date>
    </item>
    <item>
      <title>Re: Good Data Input .. No Indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97318#M20293</link>
      <description>&lt;P&gt;No problem, remember to flag this as your answer so that the next group of folks that run into this issue can easily figure out what to do.&lt;/P&gt;

&lt;P&gt;Take care.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Aug 2012 00:53:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Good-Data-Input-No-Indexing/m-p/97318#M20293</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-08-03T00:53:14Z</dc:date>
    </item>
  </channel>
</rss>

