Skip to main content
0 online

Argh... Unicode in HTML problems

iwz by iwz · May 9, 2005 · 66 views

We are trying to make our webapp more international, and currently support English, Spanish, Portugeuse, German, French, and a bunch of other languages. ISO8859-1 seems to be a good enough code set for these languages.

We will soon have to support Greek. o_O It's a completely different code set, ISO8859-7.

So, my question is, should we change code sets based on user locale, or just go with UTF-8? We're running all Java, so all Strings are internally Unicode.

If I try to switch to UTF-8, it seems like lots of characters change from being renderable into a square character, or a question mark. Is this just because my machine can't display those characters? Do I have to convert all Strings into HTML entities like so? ` Or can I just display them normally like so? –

So confusing...

To contribute to the discussion, please log in.

10 Comments

yay #4 yayOG 2004

hey i read this book, it had a section on unicode, not just technical details but over view, it might help you answer some questions you can conclude yourself
http://www.joelonsoftware.com/articles/Unicode.html

so far ive never had to deal with it, the funny thing is, i actually still deal with IBM EBCDIC here, the less famous father of ASCII

yay #3 yayOG 2004

I know that if you don't have the proper character sets on your system then yes you would see a lot of [] [] [] []

iwz #3.1 iwz

hm so, is that good then, when you see those squares? maybe...

yay #3.1.1 yayOG 2004

if it's UTF-8, and you're viewing in English, the change should be transparent

IIRC the reason for UTF-8 was to allow the extended encodings while allowing English to still line up ASCII style

flomojopoanode #2 flomojopoanodeFounder

ask the brothers who code www.watchtower.org. They're super international. Even intergalactic planetary.

iwz #2.1 iwz

nice, they made something called MEPS-16 which should work great! thanks

:(

flomojopoanode #2.1.1 flomojopoanodeFounder

but how do they translate the WEBSITE, (i'm not talking about the Mags and Literature)

yay #2.1.2 yayOG 2004

LOL

thefunkyfresh #2.1.3 thefunkyfreshFounder

hahaha

yay #1 yayOG 2004

yikes thank goodness i haven't had to deal with this yet, sorry bud

Welcome Back to eZabel

It's been a while. Here's what's new.

eZabel Lore

A complete history of our community — stats, Hall of Fame, legendary threads, and more.

View the Lore →

Curator Commentary

Look for the blue speech bubbles on threads, profiles, and news — notes and context from iwz.

Everything Preserved

All 225,969 pieces of content from 2000–2014 are here — forums, messages, journals, photos, polls, and events.