<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Difficulty in determining the query to extract a dataset in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699940#M115935</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/272346"&gt;@Cleanhearty&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;check if the used fields (amount, gender, and category) are in the lookup and the name is exactly the same (field names are case sensitive).&lt;/P&gt;&lt;P&gt;then check the amount field format.&lt;/P&gt;&lt;P&gt;ciao.&lt;/P&gt;&lt;P&gt;Giuseppe&lt;/P&gt;</description>
    <pubDate>Tue, 24 Sep 2024 13:29:52 GMT</pubDate>
    <dc:creator>gcusello</dc:creator>
    <dc:date>2024-09-24T13:29:52Z</dc:date>
    <item>
      <title>Difficulty in determining the query to extract a dataset</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699494#M115885</link>
      <description>&lt;P&gt;As a newbie I am currently working on a mini internship project which requires me to analyse a dataset using splunk. I have completed almost all but the last part of it which reads&amp;nbsp;&amp;nbsp;"gender that performed the most fraudulent activities and in what category". Basically im supposed to get the gender (F or M) that performed the most fraud in specifically in what category.&amp;nbsp;The dataset which consists of a column of&amp;nbsp; steps, customer, age,gender, Postcodeorigin, merchant, category,amount and fround from a file name fraud_report.csv . The file has already been uploaded to splunk.&amp;nbsp; I am just stuck at the query part.&lt;/P&gt;</description>
      <pubDate>Thu, 19 Sep 2024 06:06:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699494#M115885</guid>
      <dc:creator>Cleanhearty</dc:creator>
      <dc:date>2024-09-19T06:06:44Z</dc:date>
    </item>
    <item>
      <title>Re: Difficulty in determining the query to extract a dataset</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699497#M115886</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/272346"&gt;@Cleanhearty&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I suppose that you already ingested the csv file in a lookup or in an index.&lt;/P&gt;&lt;P&gt;If in a lookup you can define what you mean with "&lt;SPAN&gt;gender that performed&amp;nbsp;the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;most fraudulent activities and in what category", I suppose that you mean most fraudolent by amount,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;so you could try something like this:&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;| inputlookup fraud_report.csv
| stats max(amount) AS amount BY gender category
| sort -amount 
| head 10&lt;/LI-CODE&gt;&lt;P&gt;in this way, you have the top 10 categories by gender that have the greatest amount.&lt;/P&gt;&lt;P&gt;My hint is also to follow the Splunk Search Tutorial (&lt;A href="https://docs.splunk.com/Documentation/SplunkCloud/latest/SearchTutorial/WelcometotheSearchTutorial" target="_blank"&gt;https://docs.splunk.com/Documentation/SplunkCloud/latest/SearchTutorial/WelcometotheSearchTutorial&lt;/A&gt;) to learn how to run similar searches.&lt;/P&gt;&lt;P&gt;Ciao.&lt;/P&gt;&lt;P&gt;Giuseppe&lt;/P&gt;</description>
      <pubDate>Thu, 19 Sep 2024 06:30:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699497#M115886</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2024-09-19T06:30:06Z</dc:date>
    </item>
    <item>
      <title>Re: Difficulty in determining the query to extract a dataset</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699939#M115934</link>
      <description>&lt;P&gt;Thanks for the help. Unfortunately it didnt return any results(statistics(0)). That's weird.&amp;nbsp;&lt;/P&gt;&lt;P&gt;PS: I replaced the file name with the origial.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Sep 2024 13:13:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699939#M115934</guid>
      <dc:creator>Cleanhearty</dc:creator>
      <dc:date>2024-09-24T13:13:11Z</dc:date>
    </item>
    <item>
      <title>Re: Difficulty in determining the query to extract a dataset</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699940#M115935</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/272346"&gt;@Cleanhearty&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;check if the used fields (amount, gender, and category) are in the lookup and the name is exactly the same (field names are case sensitive).&lt;/P&gt;&lt;P&gt;then check the amount field format.&lt;/P&gt;&lt;P&gt;ciao.&lt;/P&gt;&lt;P&gt;Giuseppe&lt;/P&gt;</description>
      <pubDate>Tue, 24 Sep 2024 13:29:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Difficulty-in-determining-the-query-to-extract-a-dataset/m-p/699940#M115935</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2024-09-24T13:29:52Z</dc:date>
    </item>
  </channel>
</rss>

