Top-K: A Probabilistic Addition to RedisBloom

by itamarhaberon 7/2/2019, 6:31 PMwith 1 comments

by gkorlandon 7/4/2019, 1:53 PM

"In this benchmark, we extracted a list of the most common words in the book War and Peace, which contains over 500,000 words... Its accuracy was 100% in most cases, except for very high Ks where it ‘only’ achieved 99.9% accuracy"

Interesting how such probabilistic data structures can in most cases behave almost as good as deterministic.