Remember Me | register
Entries Blogs

Forums > Animenano.com Issues and Requests > Unicode not displayed properly

ayyo
Question
Lvl: 2
Posts: 4
06/08/2006 12:47 AM EDT

Not sure if its only my computer, but jeff lawson's current post "Dekkai GOOD IDEA" appears in gibberish yet jpmeyer's blog title appears fine.

digiwombat
Digiwombat
Lvl: 4
Posts: 39
06/08/2006 02:11 AM EDT

I think the issue is more the blog software people are using than Animenano. Some of them use base Shift-JIS, but WordPress formats out katakana to UTF-8 encoding (or possibly whatever you set your WP install to default to). 

Or it could be Shift-JIS or EUC-JP not working in people's RSS feeds. I'm not entirely sure how Hung is doing the back end on this thing.

Although, it could be the opposite problem, and Animenano isn't converting UTF-8 alt codes into the proper characters. I've noticed this happen with "smart quotes" in the post titles. Either way, hopefully that sheds some light for people in one direction or another.

hung
Hung
Lvl: 12
Posts: 462
06/08/2006 02:18 AM EDT

Well, I've been reading up on it. It appears that ruby doesn't understand utf-8 formatting. Odd, since ruby comes from Japan... There's workarounds to this, and incidentally, I was working on a fix when I happened to start killing the db. So hopefully I can get back to that.

hung
Hung
Lvl: 12
Posts: 462
06/08/2006 02:54 AM EDT

Hmmm. So messing around with the db, it seems like I need to make the storage utf-8. After that, I'm still not sure if it'll fix everything. I changed that dekkai post, and it displays fine, so it could be a problem with my parser. Too tired to think about it right now.

By the way, happy birthday to me.

psgels
Psgels
Lvl: 4
Posts: 41
06/08/2006 03:15 AM EDT

Hehe, congrats on your birthday. ^_^

Tchyo
Tchyo
Lvl: 2
Posts: 5
06/08/2006 04:22 AM EDT

Actualy, UTF-8 used to be rather unpopular in CJK speaking countries, since it's heavier than legacy charsets. Maybe it changed, I'm not all that informed about computing in Japan. This lack of awareness about multi-byte encodings is indeed weird considering Shift_JIS and EUC-JP are also multi-byte. But well, PHP isn't either, and I wrote Unicode programs with it.

 

English-speaking people sure have it easy since UTF-8 is retro-compatible with ASCII ;)

jpmeyer
Jpmeyer
Lvl: 4
Posts: 32
06/08/2006 10:45 AM EDT

Oh also, I went to check my settings to see if everything was still the same after the db got fux0red and I noticed that the little bit of Japanese text in my profile was now garbled, but the title of my blog (writes: ai to yuuki no otogibanashi) still showed up correctly.  Shrug.

hung
Hung
Lvl: 12
Posts: 462
06/08/2006 01:54 PM EDT

Yeah, that was an experiment on my part. While I was trying to get unicode to work, it killed the title and description of your blog that was in japanese. So I fixed the title. Forgot about the text. Lemme see if I can fix that.

hung
Hung
Lvl: 12
Posts: 462
06/12/2006 01:04 PM EDT

I think I may have finally fixed that unicode bug. We'll have to wait for someone to post something with unicode, though. Just go about your usual posting and I'm sure a unicode post will show up from riuva or orz or someone.

hung
Hung
Lvl: 12
Posts: 462
06/14/2006 05:09 AM EDT

Er... NOW it's fixed!

Oh joyous day, Caloo Calay!

lolikitsune
Lolikitsune
Lvl: 5
Posts: 100
06/14/2006 07:46 AM EDT

Frabjous, Hung, simply frabjous.

hung
Hung
Lvl: 12
Posts: 462
06/14/2006 12:01 PM EDT

At least, I thought it was last night. Looks like it's still choking on some stuff. Oh well.

Anime-nano-rss