How to configure csv sourcetype "mscs:storage:blob...

mlevsh · ‎12-30-2021

We are trying to Configure Azure Storage Blob Modular Inputs for Splunk Add-on for Microsoft Cloud Services to get reports, that come in csv format. We have created props.conf TA folder in /opt/splunk/etc/apps/Splunk_TA_microsoft-cloudservices/local folder with the following sourcetype stanza and still field extraction is not working. Any advices?

[mscs:storage:blob:csv]
BREAK_ONLY_BEFORE_DATE =
DATETIME_CONFIG = CURRENT
INDEXED_EXTRACTIONS = csv
KV_MODE = none
LINE_BREAKER = ([\r\n]+)
NO_BINARY_CHECK = true
SHOULD_LINEMERGE = false

Thank you!

VatsalJagani · ‎06-08-2022

@mlevsh - As mentioned by @mbjerkeland_spl, INDEXED_EXTRACTION will not work. Use search-time CSV extraction.

# props.conf
[data]
REPORT-headers = data_headers


# transforms.conf
[data_headers]
CLEAN_KEYS = 0
DELIMS = ","
FIELDS = <comma-separated-field-list>

I hope this helps!!!

mbjerkeland_spl · ‎06-08-2022

INDEXED_EXTRACTIONS = CSV is not supported by modular inputs according to https://docs.splunk.com/Documentation/Splunk/latest/Data/Extractfieldsfromfileswithstructureddata

Input types that the indexed field extraction feature supports

This feature works with the following input types:

File-based inputs only (such as monitoring files, directories, or archives.)
Inputs that use the oneshot input type (or through the "Upload" feature in Splunk Web.)

It does not work with modular inputs, network inputs, or any other type of input.

You should instead use delimited field extractions to achieve the same result. See:

richgalloway · ‎12-30-2021

It would help to see sample input (sanitized as necessary).

Where did you put the props.conf file? Did you restart Splunk after modifying the file? Does the data come in to the indexers directly or via a heavy forwarder?

---
If this reply helps you, Karma would be appreciated.

mlevsh · ‎01-05-2022

Hi @richgalloway

Sorry for the delay. Hopefully you will see my reply

1) Where did you put the props.conf file?
in app local directory: /opt/splunk/etc/apps/Splunk_TA_microsoft-cloudservices/local

2) Did you restart Splunk after modifying the file?
Yes, splunk was restarted

3) Does the data come in to the indexers directly or via a heavy forwarder?
Best to my knowledge, they are going to heavy forwarder first.

Input is csv file with a lot of columns (from A to BA) , first line is a header. When I onboard with the same props.conf via Data input right on Heavy Forwarder - it extracts fields perfectly.

richgalloway · ‎01-06-2022

Here are a few things to verify.

The props.conf file is installed on the heavy forwarder.
In case data goes to the indexer directly rather than through the HF, install the props.conf on the indexer as well.
The sourcetype is set correctly in inputs.conf.

---
If this reply helps you, Karma would be appreciated.

mlevsh · ‎01-06-2022

@richgalloway Thank you for your reply! I will verify if data are going directly to the heavy forwarder or indexers . We have Heavy Forwarder - on - prem , but search head and indexers are on Splunk Cloud. So we installed and configured Splunk add-on for MS Cloud Services on our on-prem Heavy Forwarder and assumed that Azure storage blob data are being pulled from our Heavy Forwarder

I also see the following error messages in internal logs:

Unable to find segmenter for conf=source::source_storage_blob_name.csv|host::our_heavy_forwarder|mscs:storage:blob:csv|remoteport::17575. Will attempt to use the default configuration

How to configure csv sourcetype "mscs:storage:blob:csv" for Splunk add-on for MS Cloud Services?

configuration

Input types that the indexed field extraction feature supports

Tech Talk Recap | Mastering Threat Hunting

Observability for AI Applications: Troubleshooting Latency

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

Are you a member of the Splunk Community?

How to configure csv sourcetype "mscs:storage:blob:csv" for Splunk add-on for MS Cloud Services?

configuration

Input types that the indexed field extraction feature supports

Tech Talk Recap | Mastering Threat Hunting

Observability for AI Applications: Troubleshooting Latency

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?