Xinqi Bao's Git

more dictionaries
[Wordscapes.git] / google-10000-english-master / LICENSE.md
diff --git a/google-10000-english-master/LICENSE.md b/google-10000-english-master/LICENSE.md
new file mode 100644 (file)
index 0000000..17eeda3
--- /dev/null
@@ -0,0 +1 @@
+Data files are derived from the *Google Web Trillion Word Corpus*, as described by [Thorsten Brants and Alex Franz](http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html), and distributed by the [Linguistic Data Consortium](http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13). Subsets of this corpus distributed by [Peter Novig](http://norvig.com/ngrams/). Corpus editing and cleanup by Josh Kaufman.
\ No newline at end of file