So I've searched and searched and can't find a regex that quite fits what I want to do...What I'd like to do is extract just the ".com", ".net", ".org", etc from a URL.
My "domain" field shows: "http://cdn.springserve.com" or "https://www.allpennystocks.org", etc (for example).
I also get "www.familylifeins.com/Resources/Shared/scripts/widgets.js" sometimes in the domain field and of course I want to drop everything but the ".com"
What I need is just the top-level domain (".com", ".net", ".org", etc), and I've tried several different regex's I found here, but they don't quite work the way I need it to.
Basically I want to create a list of all the TLDs my company uses in a 90 day period.
Try this regex string.
(?<TLD>\.\w+?)(?:$|\/)
Thank you everyone...VERY much!
Don't forget to click Accept
on the best answer (for you) and upvote anything else that was helpful.
There are apps for that stuff:
URL Tollbox: https://splunkbase.splunk.com/app/2734/
URL Parser: https://splunkbase.splunk.com/app/1545/
URL Expander: https://splunkbase.splunk.com/app/3460/
URL Parser: https://splunkbase.splunk.com/app/3396/
Try this regex string.
(?<TLD>\.\w+?)(?:$|\/)
Great minds think alike 😉