From: Steven Tobin Date: Thu, 17 Dec 2020 22:48:38 +0000 (+0000) Subject: Merge pull request #116 from Laga/master X-Git-Url: http://git.99rst.org/?a=commitdiff_plain;h=22d81b9556a1fb63cde8245649b2e0d1a453f7b4;p=redacted-XKCD-password-generator.git Merge pull request #116 from Laga/master Add second, more extensive German wordlist --- 22d81b9556a1fb63cde8245649b2e0d1a453f7b4 diff --cc README.rst index 5b11f31,161caa6..4490fc6 --- a/README.rst +++ b/README.rst @@@ -134,9 -134,10 +134,10 @@@ Additional language - Spanish: a modifed version of archive.umich.edu in the `/linguistics` directory. It includes ~80k words. Less than 5 char. and latin-like words were deleted using regex. This list is public domain, see `here `_. - Finnish: a modified version of the Institute for the Languages of Finland `XML word list `_. Profanities and expressions containing spaces were removed using regex. The resulting list contains ~93k words. The list is published under GNU LGPL, EUPL 1.1 and CC-BY 3.0 licenses. - Italian: generated from dumps of the Italian-language Wikipedia, which is released under the Creative Commons Attribution-Share-Alike 3.0 licence. - - German: based on `this GPL v3 list `_. Single and double character words have been removed. + - German (ger-anlx): based on `this GPL v3 list `_. Single and double character words have been removed. + - German (eff_large_de.wordlist): based on `this public domain dictionary `_. Converted to UTF-8. - Norwegian: a modified version of `Norsk Ordbank in Norwegian Bokmål 2005 `_, 2018-06-28 update, which is released under the `CC-BY 4.0 license `_. Regex has been used to alter the list for cleanup and removal of words with impractical characters. The resulting list contains ~137k words. -- French: Cleaned version of `this list `_. Public domain. +- French: One cleaned version of `this list `_ (public domain), and one filtered to remove potentially offensive words. - Portuguese: Converted variant of the LibreOffice / Firefox poturguese dictionary (from `this link `_. GPL and BSD licenced. Additional language word lists are always welcome!