build_corpus.RdCalculate word corpus for weighted jaccard matching
build_corpus(namelist1, namelist2)character vector of names from dataset 1
character vector of names from dataset 2
a data.table with columns for frequency, inverse frequency, and log inverse frequency for each word in the two strings.