<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Scraping Start Times in Knowledge Management</title>
    <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403793#M6474</link>
    <description>&lt;P&gt;Oh the first thing after the search made no difference was to switch it to "_raw", just reverted it back for the comment.&lt;/P&gt;</description>
    <pubDate>Thu, 21 Feb 2019 17:58:57 GMT</pubDate>
    <dc:creator>Regleston</dc:creator>
    <dc:date>2019-02-21T17:58:57Z</dc:date>
    <item>
      <title>Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403783#M6464</link>
      <description>&lt;P&gt;I am trying to extract the time taken for a process to execute from  my logs.  This is they syntax of the log:&lt;/P&gt;

&lt;P&gt;Time taken: 1.422 seconds&lt;/P&gt;

&lt;P&gt;I have been through the message boards on here and tried dozens of different suggestions from similar issues but none of the rex/regex  combinations have worked and just given me the time taken. Can someone please help me out?&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 19:22:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403783#M6464</guid>
      <dc:creator>Regleston</dc:creator>
      <dc:date>2019-02-20T19:22:18Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403784#M6465</link>
      <description>&lt;P&gt;Do you want to extract the value of time (1.422) from the logs?&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 20:34:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403784#M6465</guid>
      <dc:creator>Vijeta</dc:creator>
      <dc:date>2019-02-20T20:34:19Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403785#M6466</link>
      <description>&lt;P&gt;Yes, I'm just trying to get what every time shows in that area.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 20:45:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403785#M6466</guid>
      <dc:creator>Regleston</dc:creator>
      <dc:date>2019-02-20T20:45:42Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403786#M6467</link>
      <description>&lt;P&gt;Try this regex: &lt;/P&gt;

&lt;P&gt;Time\staken:\s(?[^\s]+)&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 20:49:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403786#M6467</guid>
      <dc:creator>pkeenan87</dc:creator>
      <dc:date>2019-02-20T20:49:59Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403787#M6468</link>
      <description>&lt;P&gt;try this anywhere:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;| makeresults count=1
| eval data = "Time taken: 1.422 seconds;;;Time taken: 1.712 seconds;;;Time taken: 1.333 seconds;;;Time taken: 1.290 seconds"
| makemv delim=";;;" data
| mvexpand data
| rename COMMENT as "the above generates data below is the solution" 
| rex field=data "taken:\s+(?&amp;lt;time_taken&amp;gt;[^\s]+)"
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;hope it helps&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 21:12:50 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403787#M6468</guid>
      <dc:creator>adonio</dc:creator>
      <dc:date>2019-02-20T21:12:50Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403788#M6469</link>
      <description>&lt;P&gt;It seems to work great with your sample data but for some reason when I add it to the end of my base search I get the same result as I would if I didn't add it.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 21:47:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403788#M6469</guid>
      <dc:creator>Regleston</dc:creator>
      <dc:date>2019-02-20T21:47:17Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403789#M6470</link>
      <description>&lt;P&gt;share your base search and a sample full event&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 22:21:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403789#M6470</guid>
      <dc:creator>adonio</dc:creator>
      <dc:date>2019-02-20T22:21:32Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403790#M6471</link>
      <description>&lt;P&gt;index=app_hdfs host=FQHN source+/opt/hdp/dna-sqoop-ingest/log/mls-ingest-logs/ingest-mls-summary*.log | rex field=data "taken:\s+(?[^\s]+)"&lt;/P&gt;

&lt;P&gt;Time taken: 1.422 seconds&lt;BR /&gt;
host =  FQHN  source =  /opt/hdp/dna-sqoop-ingest/log/mls-ingest-logs/ingest-mls-summary.log sourcetype =   ingest-mls-summary&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 14:03:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403790#M6471</guid>
      <dc:creator>Regleston</dc:creator>
      <dc:date>2019-02-21T14:03:02Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403791#M6472</link>
      <description>&lt;P&gt;(Previous messages got removed for some reason)&lt;BR /&gt;
That works great with your sample data but with my base search does nothing.&lt;/P&gt;

&lt;P&gt;My search query:&lt;BR /&gt;
&lt;STRONG&gt;index= host= source=//mls-ingest-logs/ingest-mls-summary*.log| rex field=data "taken:\s+(?[^\s]+)"&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;Log Event:&lt;BR /&gt;
Time taken: 1.422 seconds&lt;BR /&gt;
host =  &lt;HOST&gt; source = /&lt;PATH&gt;/ingest-mls-summary.log sourcetype = ingest-mls-summary&lt;/PATH&gt;&lt;/HOST&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 16:51:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403791#M6472</guid>
      <dc:creator>Regleston</dc:creator>
      <dc:date>2019-02-21T16:51:23Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403792#M6473</link>
      <description>&lt;P&gt;the field &lt;CODE&gt;data&lt;/CODE&gt; is in my example, you can remove it for your query &lt;BR /&gt;
&lt;CODE&gt;.... | rex field=_raw "taken:\s+(?&amp;lt;time_taken&amp;gt;[^\s]+)"&lt;/CODE&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 17:54:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403792#M6473</guid>
      <dc:creator>adonio</dc:creator>
      <dc:date>2019-02-21T17:54:51Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403793#M6474</link>
      <description>&lt;P&gt;Oh the first thing after the search made no difference was to switch it to "_raw", just reverted it back for the comment.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 17:58:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403793#M6474</guid>
      <dc:creator>Regleston</dc:creator>
      <dc:date>2019-02-21T17:58:57Z</dc:date>
    </item>
    <item>
      <title>Re: Scraping Start Times</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403794#M6475</link>
      <description>&lt;P&gt;ok, the query i provided first is a dummy query that creates fake data and shows that the regex works&lt;/P&gt;

&lt;P&gt;try it top make sure its fine.&lt;BR /&gt;
on your real data, use this:&lt;BR /&gt;
&lt;CODE&gt;index=YOUR_INDEX_NAME_HERE host=YOUR_HOST_NAME_HERE source=//mls-ingest-logs/ingest-mls-summary*.log| rex field=_raw "taken:\s+(?[^\s]+)"&lt;/CODE&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 18:12:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Scraping-Start-Times/m-p/403794#M6475</guid>
      <dc:creator>adonio</dc:creator>
      <dc:date>2019-02-21T18:12:54Z</dc:date>
    </item>
  </channel>
</rss>

