oconnore/@rfreeburn

## @rfreeburn
Hey Ryan,

I thought this would be the easiest way to answer you in 140+ characters.

My first point is that the entropy of a random string is not defined by the number of possible characters. If you were to randomly pick a number between 1 and 100, and I randomly picked seven 1's or 0's, my string would have more entropy

log(100,2) = 6.64
and
2^7 > 2^6.64

---

The second point is that three characters in a set do not provide 3 bits of entropy. For example, there are 94 printable characters on a typical keyboard:

len(ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz1234567890`~!@#$%^&*()-=_+[]\{}|;\':",./<>?) == 94

if we remove @:!, it becomes

len(ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz1234567890`~#$%^&*()-=_+[]\{}|;\'",./<>?) == 91

and

log(94, 2) = 6.555 bits
log(91, 2) = 6.508 bits

so we lost about .05 bits per character, not 3.

---

The last point is that people generally don't randomly generate their passwords. They use strings that are easily remembered. A user chosen password has very little entropy per character, and most users wouldn't pick a password containing @:! anyway. This presentation [1], explains how to estimate the entropy bits per user chosen password (but it's low, like 1-2 bits per character).

If you are actually generating passwords [2], either ignore the .05 bits/char that you lost, or add another character.

[1]: http://csrc.nist.gov/archive/pki-twg/y2003/presentations/twg-03-05.pdf
[2]: https://github.com/oconnore/diceware
[3]: http://www.xkcd.com/936/
	Hey Ryan,

	I thought this would be the easiest way to answer you in 140+ characters.

	My first point is that the entropy of a random string is not defined by the number of possible characters. If you were to randomly pick a number between 1 and 100, and I randomly picked seven 1's or 0's, my string would have more entropy

	log(100,2) = 6.64
	and
	2^7 > 2^6.64

	---

	The second point is that three characters in a set do not provide 3 bits of entropy. For example, there are 94 printable characters on a typical keyboard:

	len(ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz1234567890`~!@#$%^&*()-=_+[]\{}\|;\':",./<>?) == 94

	if we remove @:!, it becomes

	len(ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz1234567890`~#$%^&*()-=_+[]\{}\|;\'",./<>?) == 91

	and

	log(94, 2) = 6.555 bits
	log(91, 2) = 6.508 bits

	so we lost about .05 bits per character, not 3.

	---

	The last point is that people generally don't randomly generate their passwords. They use strings that are easily remembered. A user chosen password has very little entropy per character, and most users wouldn't pick a password containing @:! anyway. This presentation [1], explains how to estimate the entropy bits per user chosen password (but it's low, like 1-2 bits per character).

	If you are actually generating passwords [2], either ignore the .05 bits/char that you lost, or add another character.

	[1]: http://csrc.nist.gov/archive/pki-twg/y2003/presentations/twg-03-05.pdf
	[2]: https://github.com/oconnore/diceware
	[3]: http://www.xkcd.com/936/