Splunk Search

Does splunk store data in ansi.? I.E. is len(_raw) an indicator of bytes stored?

wkupersa
Path Finder

Hi!

I am trying to determine how much certain events affect our license and storage. I am a user of my system and not an admin. Therefore, I am more limited in what I can see and do in order to answer my question.

I am wondering how Splunk licenses/stores incoming events. Does splunk store data in ansi? i.e. is len(_raw) an indicator of bytes stored?

Thanks!

0 Karma
1 Solution

richgalloway
SplunkTrust
SplunkTrust

The len function returns the "character length" of a string, which does not necessarily map to byte count since strings are UTF-8, not ANSI. To further complicate the matter, Splunk compresses data in indexes so having a byte count would not reflect the actual storage space used, but would indicate license use.

License and storage use are really admin matters. What problems are you trying to solve that requires users to know about them?

---
If this reply helps you, Karma would be appreciated.

View solution in original post

richgalloway
SplunkTrust
SplunkTrust

The len function returns the "character length" of a string, which does not necessarily map to byte count since strings are UTF-8, not ANSI. To further complicate the matter, Splunk compresses data in indexes so having a byte count would not reflect the actual storage space used, but would indicate license use.

License and storage use are really admin matters. What problems are you trying to solve that requires users to know about them?

---
If this reply helps you, Karma would be appreciated.

wkupersa
Path Finder

Thanks! I am looking at Windows event logs. As a consumer of the information, I want as many event types. Mgmt wants cost vs benefit. I can't approach cost without knowing impact to license and storage. Even if I send some hosts to a server where I have admin, is there a good way to determine impact of certain eventtypes w/o sending them to their own index? A command to see size of a field (including _raw) would be useful.

0 Karma

richgalloway
SplunkTrust
SplunkTrust

If you have admin access to a server, send the new data to a separate index on that server. Then you can see exactly how much storage and license is used by that index. Using a separate index makes it simpler to see the effect of the new data and makes it easy to delete the data when you're done. It also avoids affecting any apps that read existing indexes.

---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Stay Connected: Your Guide to May Tech Talks, Office Hours, and Webinars!

Take a look below to explore our upcoming Community Office Hours, Tech Talks, and Webinars this month. This ...

They're back! Join the SplunkTrust and MVP at .conf24

With our highly anticipated annual conference, .conf, comes the fez-wearers you can trust! The SplunkTrust, as ...

Enterprise Security Content Update (ESCU) | New Releases

Last month, the Splunk Threat Research Team had two releases of new security content via the Enterprise ...