<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data onboarding, splunk-server field massing up the whole data in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660285#M111654</link>
    <description>&lt;P&gt;1. There are no samples of neither orignal data nor search results so we can't know what you mean,&lt;/P&gt;&lt;P&gt;2. Splunk does not manipulate data on its own unless it's configured to do so. We don't know your configuration so we can't tell you what's going on during the onboarding process.&lt;/P&gt;&lt;P&gt;Did you check the configuration for sourcetype, source and host in question? Do you even refer to raw data, search-time extracted fields or indexed fields?&lt;/P&gt;&lt;P&gt;We have no idea what's going on because you haven't shown anything apart from a simple search (which we have no idea of knowing what to expect from not knowing the events) and some random timestamps.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 10 Oct 2023 19:14:46 GMT</pubDate>
    <dc:creator>PickleRick</dc:creator>
    <dc:date>2023-10-10T19:14:46Z</dc:date>
    <item>
      <title>Data onboarding, splunk-server field massing up the whole data</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660244#M111649</link>
      <description>&lt;DIV&gt;can someone help me with this issue where splunk is reading the file, but 'adding' a information that is NOT in the original file.&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;P&gt;&lt;SPAN&gt;If you search below&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;index="acob_controls_summary" sourcetype=acob:json source="/var/log/acobjson/*100223*rtm*"

|search system=CHE control_id=AU2_A_2_1 compliance_status=100%&lt;/LI-CODE&gt;
&lt;P&gt;&lt;SPAN&gt;You will get two result, and mainly separated by “last_test_date”&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;BR /&gt;one showing&lt;BR /&gt;"2023-10-02 15:42:30.784049"&lt;BR /&gt;and other showing&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;"2023-10-02 14:56:45.047265"&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;BR /&gt;ironically,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;attached file is the SAME file (just changed the file name after copied onto my machine), that we are seeing from the splunk,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;yet there is only ONE entry which is the second the one "2023-10-02 14:56:45.047265&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;where does that “2023-10-02 15:42:30.784049” came from?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;we have a cluster environment therefore many splunk-server auto creates but why is it making a new 'test date' which actually separates one entry into two, AND give one a good return yet another one with wrong info.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Oct 2023 14:13:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660244#M111649</guid>
      <dc:creator>yohhpark</dc:creator>
      <dc:date>2023-10-10T14:13:12Z</dc:date>
    </item>
    <item>
      <title>Re: Data onboarding, splunk-server field massing up the whole data</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660273#M111651</link>
      <description>&lt;P&gt;I am not seeing attachments or other screenshots.&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Usually when I see duplicate events like this it has been because a file was replicated somehow "underneath" Splunk within a directory where Splunk thinks it is a new file and starts indexing it again. Or, I've seen this happen if you have the log files going to a shared mount point and two different Forwarders are pointing at the same files.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;A few questions to help you troubleshoot:&lt;BR /&gt;- You mention splunk-server.&amp;nbsp; What does the&amp;nbsp;&lt;EM&gt;splunk_server&lt;/EM&gt; field, along with values of things like&amp;nbsp;&lt;EM&gt;host,&lt;/EM&gt; sourcetype, and&lt;EM&gt; source &lt;/EM&gt;look like for these events?&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Your timestamps are wildly off, and not necessarily in a predictable way (e.g. just by 1 hour).&amp;nbsp; Does your log data have timestamps within it, or are you relying on the timestamp being derived from when Splunk "sees" your log?&lt;BR /&gt;- Have you poked around in the _internal index to see where Splunk "saw" any files matching the following:&lt;/P&gt;&lt;PRE&gt;/var/log/acobjson/*100223*rtm*&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;NOTE:&amp;nbsp; Don't look for&lt;EM&gt; source=/var/log/acobjson/*100223*rtm*&lt;/EM&gt; in &lt;EM&gt;index=_internal&lt;/EM&gt;, because the&lt;EM&gt; source=&lt;/EM&gt; in this context refers to the Splunk log files that were indexed.&amp;nbsp; You can start without specifying a field, but you can also try something like&amp;nbsp;&lt;EM&gt;&lt;SPAN&gt;index=_internal series=/var/log/acobjson/*100223*rtm*&amp;nbsp;&lt;/SPAN&gt;&lt;/EM&gt;since that is one field Splunk will log this info in as it is monitoring files.&lt;/P&gt;</description>
      <pubDate>Tue, 10 Oct 2023 18:32:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660273#M111651</guid>
      <dc:creator>_JP</dc:creator>
      <dc:date>2023-10-10T18:32:18Z</dc:date>
    </item>
    <item>
      <title>Re: Data onboarding, splunk-server field massing up the whole data</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660275#M111652</link>
      <description>&lt;P&gt;I think the issue is more complicated than that. I understand not to look for internal.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;that is not the issue. the issue is that splunk generates different data from the orginal source with different test date. which is NOT in the file.&amp;nbsp;&lt;/P&gt;&lt;P&gt;it has to do with the cluster environment. anyone super expert in such?&lt;/P&gt;</description>
      <pubDate>Tue, 10 Oct 2023 18:49:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660275#M111652</guid>
      <dc:creator>yohhpark</dc:creator>
      <dc:date>2023-10-10T18:49:45Z</dc:date>
    </item>
    <item>
      <title>Re: Data onboarding, splunk-server field massing up the whole data</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660278#M111653</link>
      <description>&lt;P&gt;Can you provide a screenshot of the event data within Splunk, and what it looks like within the file?&amp;nbsp; If necessary redact anything private. It would also help if you could have the Splunk default fields selected so they appear in-line with your event data (host, index, linecount, punct, source, sourcetype, splunk_server, timestamp)&lt;BR /&gt;&lt;BR /&gt;I'm having a difficult time visualizing only the timestamp portion being different between two events and one log file.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Oct 2023 18:58:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660278#M111653</guid>
      <dc:creator>_JP</dc:creator>
      <dc:date>2023-10-10T18:58:09Z</dc:date>
    </item>
    <item>
      <title>Re: Data onboarding, splunk-server field massing up the whole data</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660285#M111654</link>
      <description>&lt;P&gt;1. There are no samples of neither orignal data nor search results so we can't know what you mean,&lt;/P&gt;&lt;P&gt;2. Splunk does not manipulate data on its own unless it's configured to do so. We don't know your configuration so we can't tell you what's going on during the onboarding process.&lt;/P&gt;&lt;P&gt;Did you check the configuration for sourcetype, source and host in question? Do you even refer to raw data, search-time extracted fields or indexed fields?&lt;/P&gt;&lt;P&gt;We have no idea what's going on because you haven't shown anything apart from a simple search (which we have no idea of knowing what to expect from not knowing the events) and some random timestamps.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Oct 2023 19:14:46 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660285#M111654</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2023-10-10T19:14:46Z</dc:date>
    </item>
    <item>
      <title>Re: Data onboarding, splunk-server field massing up the whole data</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660913#M111727</link>
      <description>&lt;P&gt;No. Unless explicitly configured to do so (which may be the case but it's beyond us to know how your environment is configured) splunk doesn't "generate" data. It ingests (and possibly modifies if it's configured that way) data it's given for ingestion.&lt;/P&gt;&lt;P&gt;And clustering doesn't change the data. It merely replicates (if needed) data to other nodes.&lt;/P&gt;</description>
      <pubDate>Mon, 16 Oct 2023 17:10:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Data-onboarding-splunk-server-field-massing-up-the-whole-data/m-p/660913#M111727</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2023-10-16T17:10:33Z</dc:date>
    </item>
  </channel>
</rss>

