<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: summary indexing created multi value in orig_host entries in Knowledge Management</title>
    <link>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78533#M782</link>
    <description>&lt;P&gt;Thanks for the clarification.  I modified my question to be a little clearer.  I had never noticed that multi value behavior in any results I've examined previously so it threw me.&lt;BR /&gt;&lt;BR /&gt;
Really just want to eliminate unwanted fields when creating summary data.&lt;/P&gt;</description>
    <pubDate>Wed, 13 Apr 2011 23:36:41 GMT</pubDate>
    <dc:creator>ualbanytech</dc:creator>
    <dc:date>2011-04-13T23:36:41Z</dc:date>
    <item>
      <title>summary indexing created multi value in orig_host entries</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78531#M780</link>
      <description>&lt;P&gt;I'm using Splunk 4.1.6 and getting started with creating summary data.&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;Edit&lt;/STRONG&gt;: What I'm trying to do is eliminate fields I don't need when writing to a summary index.
  I originally tried using the fields command but, I continued to see fields I did not specify in my summary index.&lt;/P&gt;

&lt;P&gt;I created a scheduled search which runs daily for "yesterday" and writes to a summary index.&lt;/P&gt;

&lt;P&gt;The search (I have replaced my real host names with &lt;CODE&gt;&amp;lt;hostA_3&amp;gt;&lt;/CODE&gt;, &lt;CODE&gt;&amp;lt;hostA_4&amp;gt;&lt;/CODE&gt;, &lt;CODE&gt;&amp;lt;hostB_5&amp;gt;&lt;/CODE&gt;, &lt;CODE&gt;&amp;lt;hostB_7&amp;gt;&lt;/CODE&gt;&lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;splunk_server=splunk-uad* index=uad-* host=&amp;lt;hostB&amp;gt;* OR host=&amp;lt;hostA&amp;gt;* sourcetype=access_combined_rsptime NOT netid="-" | dedup netid, clientip, host |sort - _time|  stats values(host) AS host by _time, req_time, clientip, netid
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;This went fine.&lt;/P&gt;

&lt;P&gt;However when I started playing with some reports against this summary data I noticed that I seemed to get 2 sporadic events out of 9,998 whose "orig_host" value is mangled.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;orig_host="&amp;lt;hostA_3&amp;gt;.itsli.albany.edu &amp;lt;hostB_5&amp;gt;.itsli.albany.edu"
orig_host="&amp;lt;hostA_4&amp;gt;.itsli.albany.edu &amp;lt;hostB_7&amp;gt;.itsli.albany.edu"
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;It seems splunk concatenated two of my host names together for two arbitrary events.&lt;/P&gt;

&lt;P&gt;I checked the events the summary events were created from and there is no "host" fields with those bad values in the original events.&lt;/P&gt;

&lt;P&gt;Am I doing something subtle/ignorant in my summary search that caused this?&lt;/P&gt;</description>
      <pubDate>Sat, 09 Apr 2011 05:57:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78531#M780</guid>
      <dc:creator>ualbanytech</dc:creator>
      <dc:date>2011-04-09T05:57:25Z</dc:date>
    </item>
    <item>
      <title>Re: summary indexing created multi value in orig_host entries</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78532#M781</link>
      <description>&lt;P&gt;your stats is split by _time, req_time, clientip, and netid. If you happen to have two entries with identical values for those four, which is possible (and is in fact what probably happened, for two sets of those values), then all values of &lt;CODE&gt;host&lt;/CODE&gt; that have the same value set will be made into a multi-valued field with a list of the values. That's what the &lt;CODE&gt;values()&lt;/CODE&gt; function does and is supposed to do. If that's not what you want, then you might want to use &lt;CODE&gt;first()&lt;/CODE&gt; or &lt;CODE&gt;last()&lt;/CODE&gt; instead.&lt;/P&gt;</description>
      <pubDate>Sat, 09 Apr 2011 07:11:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78532#M781</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2011-04-09T07:11:06Z</dc:date>
    </item>
    <item>
      <title>Re: summary indexing created multi value in orig_host entries</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78533#M782</link>
      <description>&lt;P&gt;Thanks for the clarification.  I modified my question to be a little clearer.  I had never noticed that multi value behavior in any results I've examined previously so it threw me.&lt;BR /&gt;&lt;BR /&gt;
Really just want to eliminate unwanted fields when creating summary data.&lt;/P&gt;</description>
      <pubDate>Wed, 13 Apr 2011 23:36:41 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78533#M782</guid>
      <dc:creator>ualbanytech</dc:creator>
      <dc:date>2011-04-13T23:36:41Z</dc:date>
    </item>
    <item>
      <title>Re: summary indexing created multi value in orig_host entries</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78534#M783</link>
      <description>&lt;P&gt;I went back and adjusted my search as follows:&lt;BR /&gt;
   index=uad-* host=&lt;HOSTB&gt;* OR host=&lt;HOSTA&gt;* sourcetype=access_combined_rsptime NOT netid="-" | dedup netid, clientip, host | stats values(netid)  AS netid  by clientip, host&lt;/HOSTA&gt;&lt;/HOSTB&gt;&lt;/P&gt;

&lt;P&gt;I decided I don't care about the time since req time is irrelevant for what I'm trying to answer with my sum. data. Uniq users on our hosts per day.&lt;/P&gt;

&lt;P&gt;Still not sure why I couldn't control the fields being written to summary index with the fields command.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 09:27:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/summary-indexing-created-multi-value-in-orig-host-entries/m-p/78534#M783</guid>
      <dc:creator>ualbanytech</dc:creator>
      <dc:date>2020-09-28T09:27:53Z</dc:date>
    </item>
  </channel>
</rss>

