X-Git-Url: https://git.xinqibao.xyz/Wordscapes.git/blobdiff_plain/9f38ff3246bf065726105d606d49dfbe54efe4af..0ab568a75c3963dfead0e525f5b19ecf79fc5f8f:/google-10000-english-master/LICENSE.md?ds=inline diff --git a/google-10000-english-master/LICENSE.md b/google-10000-english-master/LICENSE.md new file mode 100644 index 0000000..17eeda3 --- /dev/null +++ b/google-10000-english-master/LICENSE.md @@ -0,0 +1 @@ +Data files are derived from the *Google Web Trillion Word Corpus*, as described by [Thorsten Brants and Alex Franz](http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html), and distributed by the [Linguistic Data Consortium](http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13). Subsets of this corpus distributed by [Peter Novig](http://norvig.com/ngrams/). Corpus editing and cleanup by Josh Kaufman. \ No newline at end of file