<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: data model storage and backups in Reporting</title>
    <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151243#M3356</link>
    <description>&lt;P&gt;Exclude the &lt;CODE&gt;datamodel_summary&lt;/CODE&gt; directories from backup.&lt;BR /&gt;
If you restore an index, Splunk recreates the accelerated data model (that is what is stored in &lt;CODE&gt;datamodel_summary&lt;/CODE&gt;) automatically.&lt;/P&gt;</description>
    <pubDate>Sun, 09 Aug 2015 22:35:45 GMT</pubDate>
    <dc:creator>helge</dc:creator>
    <dc:date>2015-08-09T22:35:45Z</dc:date>
    <item>
      <title>data model storage and backups</title>
      <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151238#M3351</link>
      <description>&lt;P&gt;Question from my backup guys and I couldn't find a good answer in the docs- I don't understand the structure of the data model data on the system. Indexes with a data model defined have a datamodel_summary directory:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[splunk@splunk3 splunk]$ ll ./firewall
total 60
drwx------.  37 splunk splunk  4096 May  2 13:26 colddb
drwx------. 340 splunk splunk 24576 May  2 13:35 datamodel_summary
drwx------. 306 splunk splunk 20480 May  3 10:06 db
drwx------.   2 splunk splunk  4096 Aug 17  2013 thaweddb
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;In the _internaldb index directory, I seem to have one of these and another "summary" directory that looks like it's associated somehow with the splunk deployment monitor:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[splunk@splunk3 splunk]$ ll _internaldb/
total 532
drwx------. 2216 splunk splunk 126976 May  3 09:47 colddb
drwx------. 2519 splunk splunk 184320 May  3 09:55 datamodel_summary
drwx------.  306 splunk splunk  28672 May  3 10:08 db
drwx------. 2519 splunk splunk 184320 May  3 09:50 summary
drwx------.    2 splunk splunk   4096 Aug 16  2013 thaweddb

[splunk@splunk3 splunk]$ ll _internaldb/summary/998_163BFC27-2C4C-4CDE-83CD-F8B48C29BA80/20D17CF6-2E61-47A1-B3A4-FF57509916DF/
total 596
drwx------. 2 splunk splunk 32768 Dec 23 05:10 splunk_deployment_monitor_nobody_1a56f43bf8d5bf20
drwx------. 2 splunk splunk 32768 Dec 23 05:10 splunk_deployment_monitor_nobody_26e747c470c62ba8
&amp;lt;snip several lines /&amp;gt;
drwx------. 2 splunk splunk 24576 Jan 11 14:08 splunk_deployment_monitor_nobody_NSd0dc3ea132443bbf
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;From the backup perspective, the backups are throwing a thousands of errors each night for non-existant files (were there when the drive was scanned, but not when it came time to back up). I'm fairly sure it's okay to tell them to exclude the datamodel_summary (and summary) directories entirely since they can be recreated after a restore, but for my own sanity I'd like to understand the structure a bit more.&lt;/P&gt;

&lt;OL&gt;
&lt;LI&gt;Can we exclude the data models from backup?&lt;/LI&gt;
&lt;LI&gt;What is that extra summary directory in _internaldb all about? Likewise, it can be excluded?&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Sat, 03 May 2014 15:30:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151238#M3351</guid>
      <dc:creator>jeff</dc:creator>
      <dc:date>2014-05-03T15:30:57Z</dc:date>
    </item>
    <item>
      <title>Re: data model storage and backups</title>
      <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151239#M3352</link>
      <description>&lt;P&gt;I think the summary directory is related to report acceleration turned on for a search owned by nobody in the &lt;CODE&gt;splunk_deployment_monitor&lt;/CODE&gt; app... I also think those two kinds of accerelations don't need to be backed up because they don't contain anything unique but rather only summaries of existing index data.&lt;/P&gt;</description>
      <pubDate>Sat, 03 May 2014 15:45:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151239#M3352</guid>
      <dc:creator>martin_mueller</dc:creator>
      <dc:date>2014-05-03T15:45:05Z</dc:date>
    </item>
    <item>
      <title>Re: data model storage and backups</title>
      <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151240#M3353</link>
      <description>&lt;P&gt;The summary directory you see are for summary databases , this one seems to be generated by the deployment monitor app. &lt;/P&gt;

&lt;P&gt;Your tsdix files should go in the data model_summary dir if you do not tell them otherwise (in / via indexes.conf , look for tsidx_homepath or similar)&lt;/P&gt;

&lt;P&gt;By default summary data should go to $SPLUNK_HOME/var/lib/splunk/database/summary&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 16:32:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151240#M3353</guid>
      <dc:creator>lmyrefelt</dc:creator>
      <dc:date>2020-09-28T16:32:20Z</dc:date>
    </item>
    <item>
      <title>Re: data model storage and backups</title>
      <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151241#M3354</link>
      <description>&lt;P&gt;indexes.conf - tstatsHomePath for datamodels&lt;BR /&gt;
indexes.conf - tsidxStatsHomePath for accelerations&lt;BR /&gt;
indexes.conf - summaryHomePath for summary data&lt;/P&gt;</description>
      <pubDate>Mon, 05 May 2014 20:29:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151241#M3354</guid>
      <dc:creator>lmyrefelt</dc:creator>
      <dc:date>2014-05-05T20:29:33Z</dc:date>
    </item>
    <item>
      <title>Re: data model storage and backups</title>
      <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151242#M3355</link>
      <description>&lt;OL&gt;
&lt;LI&gt;as backup, the data should be generated when you first run /using  the data models in the pivot if i don't remember wrong, so there should not be any point in making backups of them. If you create your own data models for your data, you should take a backup of the data model configuration.&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Mon, 05 May 2014 20:32:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151242#M3355</guid>
      <dc:creator>lmyrefelt</dc:creator>
      <dc:date>2014-05-05T20:32:01Z</dc:date>
    </item>
    <item>
      <title>Re: data model storage and backups</title>
      <link>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151243#M3356</link>
      <description>&lt;P&gt;Exclude the &lt;CODE&gt;datamodel_summary&lt;/CODE&gt; directories from backup.&lt;BR /&gt;
If you restore an index, Splunk recreates the accelerated data model (that is what is stored in &lt;CODE&gt;datamodel_summary&lt;/CODE&gt;) automatically.&lt;/P&gt;</description>
      <pubDate>Sun, 09 Aug 2015 22:35:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/data-model-storage-and-backups/m-p/151243#M3356</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2015-08-09T22:35:45Z</dc:date>
    </item>
  </channel>
</rss>

