Things:
- Loading a ISO-8559-1 encoded dump into a UTF-8 database breaks, fields are truncated at the first non-valid character. Not terribly surprising, but MySQL is silent about the breakage.
- Content which looks like, and is declared, ISO-8559-1 might actually get rendered by browsers as CP1252 (aka MS-ANSI WINDOWS-1252)
- The difference between these two are that 8859 doesn’t use 0x7f to 0x9f, but Windows does – for long hyphens, ellipsis etc
iconv -f cp1252 -t utf-8
IYF
Links:
Leave a Reply
Recent articles
- Docker, SELinux, Consul, Registrator
(Wednesday, 04. 29. 2015 – No Comments) - ZFS performance on FreeBSD
(Tuesday, 09. 16. 2014 – No Comments) - Controlling Exim SMTP behaviour from Dovecot password data
(Wednesday, 09. 3. 2014 – No Comments) - Heartbleed OpenSSL vulnerability
(Tuesday, 04. 8. 2014 – No Comments)
Archives
- April 2015
- September 2014
- April 2014
- September 2013
- August 2013
- March 2013
- April 2012
- March 2012
- September 2011
- June 2011
- February 2011
- January 2011
- October 2010
- September 2010
- February 2010
- September 2009
- August 2009
- January 2009
- September 2008
- August 2008
- July 2008
- May 2008
- April 2008
- February 2008
- January 2008
- November 2007
- October 2007
- September 2007
- August 2007
- December 2006
- November 2006
- August 2006
- June 2006
- May 2006
- March 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005