Does the Splunk Python interpreter that ModInput uses properly read/parse files that are not UTF-8 encoded?
When parsing windows-1252 encoded XML files with the TA-dmarc app, I got the following error:
validate_xml: xml parse error for file with Unsupported encoding windows-1252, line 1, column 44
However, when I parsed the file with the same scripts contained in TA-dmarc, but using the system Python interpreter, it succeeded with no errors.
An example that causes the error is available on GitHub.
https://github.com/jorritfolmer/TA-dmarc/blob/master/bin/dmarc/test/data/aol_rua.xml