I've found char count close enough - I can't remember if the indexing is done before or after conversion to utf8, so it may not 100% match the license count - but for determining volume of a particular source that is not in your metrics log I've always found it useful.
The metrics logs are the best way to measure, but depending on how many sources you have they may not be sampling the data you're after frequently enough.