<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359420#M106264</link>
    <description>&lt;P&gt;cisco_ironport_web.log has the following events - &lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;Event - 1&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;1489714117.601 56 27.1.11.11 TCP_REFRESH_HIT/200 54491 GET http://www.flatbed-scanner-review.org/inter-banner_flatbed.jpg bhussain@buttercupgames.com DIRECT/www.flatbed-scanner-review.org image/jpeg DEFAULT_CASE-DefaultGroup-Demo_Clients-NONE-NONE-DefaultRouting &amp;lt;nc,ns,0,-,-,-,-,0,-,-,-,-,-,-,-,nc,-&amp;gt; - http://www.flatbed-scanner-review.org/
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;Event - 2&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;1489713615.376 809 211.166.11.101 TCP_MISS/200 147639 GET http://www.vindy.com/ myuan@buttercupgames.com DIRECT/www.vindy.com text/html DEFAULT_CASE-DefaultGroup-Demo_Clients-NONE-NONE-DefaultRouting &amp;lt;IW_news,3.4,0,-,-,-,-,0,-,-,-,-,-,-,-,IW_news,-&amp;gt; - -
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I use the following reg-ex to extract user, url and domain&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;"field1","field2","field3","field4","field5","field6","url","user","field9","field10","field11","field12","field13","domain"
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;It doesn't work for second event, because domain fields has '-'. How do I fix it?&lt;/P&gt;</description>
    <pubDate>Tue, 29 Sep 2020 13:15:49 GMT</pubDate>
    <dc:creator>jagadeeshm</dc:creator>
    <dc:date>2020-09-29T13:15:49Z</dc:date>
    <item>
      <title>Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359420#M106264</link>
      <description>&lt;P&gt;cisco_ironport_web.log has the following events - &lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;Event - 1&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;1489714117.601 56 27.1.11.11 TCP_REFRESH_HIT/200 54491 GET http://www.flatbed-scanner-review.org/inter-banner_flatbed.jpg bhussain@buttercupgames.com DIRECT/www.flatbed-scanner-review.org image/jpeg DEFAULT_CASE-DefaultGroup-Demo_Clients-NONE-NONE-DefaultRouting &amp;lt;nc,ns,0,-,-,-,-,0,-,-,-,-,-,-,-,nc,-&amp;gt; - http://www.flatbed-scanner-review.org/
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;Event - 2&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;1489713615.376 809 211.166.11.101 TCP_MISS/200 147639 GET http://www.vindy.com/ myuan@buttercupgames.com DIRECT/www.vindy.com text/html DEFAULT_CASE-DefaultGroup-Demo_Clients-NONE-NONE-DefaultRouting &amp;lt;IW_news,3.4,0,-,-,-,-,0,-,-,-,-,-,-,-,IW_news,-&amp;gt; - -
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I use the following reg-ex to extract user, url and domain&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;"field1","field2","field3","field4","field5","field6","url","user","field9","field10","field11","field12","field13","domain"
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;It doesn't work for second event, because domain fields has '-'. How do I fix it?&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 13:15:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359420#M106264</guid>
      <dc:creator>jagadeeshm</dc:creator>
      <dc:date>2020-09-29T13:15:49Z</dc:date>
    </item>
    <item>
      <title>Re: Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359421#M106265</link>
      <description>&lt;P&gt;Hi jagadeeshm,&lt;BR /&gt;
try &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;(GET|POST)\s(?&amp;lt;url&amp;gt;[^ ]*)\s(?&amp;lt;user&amp;gt;[^ ]*)\s([^ ]*\s){4}(?&amp;lt;domain&amp;gt;[^ ]*)
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;you can test it at &lt;A href="https://regex101.com/r/1qW58r/1"&gt;https://regex101.com/r/1qW58r/1&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Bye.&lt;BR /&gt;
Giuseppe&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 07:45:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359421#M106265</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2017-03-17T07:45:01Z</dc:date>
    </item>
    <item>
      <title>Re: Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359422#M106266</link>
      <description>&lt;P&gt;It doesn't actually extract domain name, which is my core issue.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 12:49:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359422#M106266</guid>
      <dc:creator>jagadeeshm</dc:creator>
      <dc:date>2017-03-17T12:49:29Z</dc:date>
    </item>
    <item>
      <title>Re: Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359423#M106267</link>
      <description>&lt;P&gt;Hi jagadeeshm,&lt;BR /&gt;
sorry correct 5 instead 4 (see &lt;A href="https://regex101.com/r/1qW58r/2"&gt;https://regex101.com/r/1qW58r/2&lt;/A&gt;)&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;(GET|POST)\s(?&amp;lt;url&amp;gt;[^ ]*)\s(?&amp;lt;user&amp;gt;[^ ]*)\s([^ ]*\s){5}(?&amp;lt;domain&amp;gt;[^ ]*)
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Bye.&lt;BR /&gt;
Giuseppe&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 12:54:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359423#M106267</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2017-03-17T12:54:42Z</dc:date>
    </item>
    <item>
      <title>Re: Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359424#M106268</link>
      <description>&lt;P&gt;I tried the regex101 link, it extracts the domain field at the very end. That field is not always populated So I tried to extract the domain from the string right after "DIRECT/". This would be my solution. But only if you are not looking at the field at the end.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;(GET|POST)\s(?&amp;lt;url&amp;gt;[^ ]*)\s(?&amp;lt;user&amp;gt;[^ ]*)\s([^ ]){6}\/(?&amp;lt;domain&amp;gt;[^ ]*)
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 10 Apr 2018 13:09:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359424#M106268</guid>
      <dc:creator>cradeke_splunk</dc:creator>
      <dc:date>2018-04-10T13:09:17Z</dc:date>
    </item>
    <item>
      <title>Re: Proper reg-ex to extracts cisco_ironport_web.log fields like - user, domain and url</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359425#M106269</link>
      <description>&lt;P&gt;Instead of re-inventing the wheel, you could take some inspiration from Splunk Add-on for Cisco WSA&lt;BR /&gt;
&lt;A href="https://splunkbase.splunk.com/app/1747/"&gt;https://splunkbase.splunk.com/app/1747/&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;If I look at the sample data and props/transforms in that TA it seems to support very similar data to what you have. The regex in there does not perfectly match (the part between &lt;CODE&gt;&amp;lt;...&amp;gt;&lt;/CODE&gt; is giving some issues I think), but might be a good start.&lt;/P&gt;</description>
      <pubDate>Tue, 10 Apr 2018 14:01:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Proper-reg-ex-to-extracts-cisco-ironport-web-log-fields-like/m-p/359425#M106269</guid>
      <dc:creator>FrankVl</dc:creator>
      <dc:date>2018-04-10T14:01:22Z</dc:date>
    </item>
  </channel>
</rss>

