id	summary	reporter	owner	description	type	status	priority	milestone	component	version	resolution	keywords	cc
2846	Convert common unicode punctuation to ascii.	bmfrosty	john	"It appears that some ebook reader hardware has trouble with some Unicode punctuation symbols.  Particuarly left and right - single and double quotes.  There are other marks that may also be present that it would be logical to convert.

Cursory research found the following:

Smart Quotes:

E2 80 9C 
E2 80 9D

Apostrophes and single quotes as:

E2 80 98
E2 80 99 

In ascii, quotes are 22, and apostrophes are 27.  A search and replace in the displayed text areas might be enough.  It could possibly even be done globally, assuming that neither the HTML or CSS portions would be using any of those unicode sequences.

It looks like a pretty good list can be found here:

http://www.utf8-chartable.de/unicode-utf8-table.pl?start=8192&number=128

Of additional note there are E2 80 9B and E2 80 9F which are high-reversed-9 quotation marks - single and double respectively, E2 80 90 through E2 80 95 which are various types of hyphens.  There are also various spaces in that chart, but I'm unsure if they're used much or at all.

A little bit of further googling found this article:

http://pivotallabs.com/users/cheister/blog/articles/603-unicode-transliteration-to-ascii

which linked to a project called 'unidecode' which can be found here:

http://rubyforge.org/projects/unidecode/"	enhancement	closed	minor		EPUB Output	trunk	fixed	conversion punctuation	
