A field is named product. I want to produce a list of products in my source, which are not made up of only english alphanumeric characters (any length).
I.e. Products:
Dog Collar
18 inch Dog Collar
20-inch Dog Collar
Ƨhock collar
would yield only:
20-inch Dog Collar
Ƨhock collar
(Because of the latin character and the hyphen.)
I've seen plenty of uses of sed to remove/replace the non-alphanumerics, but I don't want to remove, just get a list of the outliers.
Try like this. First two lines are to generate sample data. You need to replace that with your search.
| gentimes start=-1 | eval Product="Dog Collar#18 inch Dog Collar#20-inch Dog Collar#Ƨhock collar" | table Product | makemv Product delim="#" | mvexpand Product
| regex Product!="^[A-z\s0-9]+$"
This will keep events which have Product with any non-alphanumeric character.
Try like this. First two lines are to generate sample data. You need to replace that with your search.
| gentimes start=-1 | eval Product="Dog Collar#18 inch Dog Collar#20-inch Dog Collar#Ƨhock collar" | table Product | makemv Product delim="#" | mvexpand Product
| regex Product!="^[A-z\s0-9]+$"
This will keep events which have Product with any non-alphanumeric character.