https://www.duolingo.com/wordgeek416

Vietnamese encoding fail

wordgeek416
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 23
  • 19
  • 18
  • 17
  • 10
  • 6
  • 4
  • 931

Victoria Park subway station in Toronto, Ontario, Canada includes a public art installation called "Roots". Part of the installation is a series of images of the globe with the word for "community" written in various languages. Each image has three languages.

(The above image is from Wikipedia. Below is a close-up view.)

If you look closely at the circle at the bottom left, you'll see the words "Coäng ñoàng" which at first glance looks vaguely Vietnamese but the letters don't look right (in particular, Vietnamese doesn't use ñ and doesn't have umlauts).

Searching for "coäng ñoàng" turns up thousands of pages that claim to be in Vietnamese, for example this page about Vietnamese immigrants in London, Ontario

Before Unicode came along, there were several encoding standards for Vietnamese. I found a conversion table with various legacy character encoding schemes and figured out that the original text was "cộng đồng" but encoded in VNI. Some single glyphs in Unicode are represented by multiple byte characters in VNI.

  • ộ Unicode U+1ED9, VNI oä LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT BELOW
  • đ Unicode U+0111, VNI ñ LATIN SMALL LETTER D WITH STROKE
  • ồ Unicode U+1ED3, VNI oà LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRAVE

So it looks like the artist who created the work had a VNI-encoded file from whoever created the Vietnamese text, but was reading it on a Unicode-based system so the letters came out looking different from what was originally intended.

2 years ago

3 Comments


https://www.duolingo.com/wordgeek416
wordgeek416
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 25
  • 23
  • 19
  • 18
  • 17
  • 10
  • 6
  • 4
  • 931

Found another one outside the station:

This is supposed to say "cội rễ" ("roots")

2 years ago

https://www.duolingo.com/AnCatDubh
AnCatDubh
  • 18
  • 15
  • 14
  • 13
  • 13
  • 13
  • 12
  • 12
  • 12
  • 12
  • 11
  • 11
  • 11
  • 11
  • 11
  • 10
  • 10
  • 9
  • 9
  • 9
  • 9
  • 8
  • 8
  • 7
  • 6
  • 6
  • 5
  • 4
  • 3
  • 2
  • 1471

And the Hebrew is wrong, too...

2 years ago

https://www.duolingo.com/LiKenun
LiKenun
  • 20
  • 17
  • 13
  • 13
  • 13
  • 12
  • 11
  • 10
  • 8
  • 6
  • 6
  • 6
  • 3
  • 2
  • 2

You’d think that since these art installations are being put up in communities that have these diverse populations, they would actually look for someone competent in those languages to do some basic quality checks.

1 year ago
Learn Vietnamese in just 5 minutes a day. For free.