Fuzzy Matching at Scale
Fuzzy Matching is a useful tool that has been well discussed. However, these popular methods based on edit-distances like Levenshtein or Jaro-Winkler have failed to keep up with increasing data sizes. This talk will walk you through modern methods based on character-based n-grams, vector space models, and approximate nearest neighbours for Fuzzy Matching at Scale.