Jump to content


  • Please log in to reply

Densetsu's Translation Toolbox

, Recommended Resources for ROM Translation Projects (Go to first unread post)
Densetsu Post #16 Posted 20 October 2011 - 01:32 AM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

View Postthe_randomizer, on 19 October 2011 - 02:15 AM, said:

Update: Right now, I have to take screenshots of each individual text box since there are no up-to-date and/or working tools that will assist me with extracting the necessary scripts. Crap. I was afraid this would happen. None of the tools I found do what I need to do (neither does Geiger's Snes9x). Any suggestions?
I wish I could help, but I have no idea. I can only give advice specifically related to translating.

If you ever need help figuring out certain kanji in some of the screenshots, start a new topic in the ROM Hacking and Translations subforum and post the screens there. I'd be happy to take a look at them for you. Good luck!



the_randomizer Post #17 Posted 20 October 2011 - 02:50 AM

    GBAtemp Advance Fan

  • PipPipPipPipPip

  • Group: Members
  • Posts: 971
  • Member No.: 278,342
  • Joined: 29-April 11
  • Location: In front of the PC monitor

  •  

The funny thing is, Breath of Fire strictly uses the Kana system (like Mother 2, but this uses spacing between particles). I've seen a program called "Cartographer" that supposedly dumps text, but the author isn't clear on how to use the commands. Oh well. I'll post something on the Romhacking forums and go from there. So I'll see what they say about it.

Too bad no one made a program like FF3usME which allows extraction and insertion of dialogue. But, that's what I get for wanting to do something like this.


Densetsu Post #18 Posted 20 November 2011 - 12:56 AM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

Jeremy Blaustein, a professional Japanese-English video game translator, has single-handedly translated a lot of works including Metal Gear Solid, three Silent Hill titles, a Phoenix Wright game, etc.

In this 12-part interview, he talks about what it's like to translate games professionally as well as the difficulties that he encounters with translating. It's very engaging and worth a listen, whether you're interested in translating games or not. I enjoyed how candid (uncensored) it was and it has some random funny stuff throughout.

Part 1
Part 2
Part 3
Part 4
Part 5
Part 6
Part 7
Part 8
Part 9
Part 10
Part 11
Part 12

The interview isn't exclusively about translating games, which is why I decided not to link these in the OP.


Densetsu Post #19 Posted 27 January 2012 - 10:44 AM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

I saw this post elsewhere, but I didn't want to post in that thread because it would've been off-topic, so I took it here:
@Sora de Eclaune

View PostSora de Eclaune, on 27 January 2012 - 06:22 AM, said:

Also, to those who said google english isn't proper and can't be... It can be proper if you do a word-by-word translation and make a guesstimation as to the best possible translation of the phrase.
Wrong.

Try translating these using Google. I want to see what meanings you can come up with:
Warning! Spoiler inside. 
Think those are easy? Ok, then try these:
Warning! Spoiler inside. 

Tell me what you think those mean and then I'll give you the correct answers. Mind you, the ones in the second spoiler are lifted straight from games that are currently being translated. I didn't make them up just to prove an extreme point. This is what you can typically expect to encounter when translating games.


Sora de Eclaune Post #20 Posted 27 January 2012 - 11:05 PM

    General Moophington


  • Group: Members
  • Posts: 2,257
  • Member No.: 273,879
  • Joined: 15-February 11
  • Location: Derpinschmoozer

  •  

Warning! Spoiler inside. 


Warning! Spoiler inside. 


I tried my best with that.


Phoenix Goddess Post #21 Posted 27 January 2012 - 11:16 PM

    The Ninja's Protégéé


  • Group: Members
  • Posts: 3,571
  • Member No.: 175,436
  • Joined: 25-April 09
  • Location: Away from civilization.

  •  

View PostSora de Eclaune, on 27 January 2012 - 11:05 PM, said:

Warning! Spoiler inside. 


Warning! Spoiler inside. 


I tried my best with that.


And this is why google translate(and machine translators in general) is always a bad idea.
Not to mention that machine translators can't help you with verbs, particles, direct objects, or hardly anything else to make the sentences sound less like gibberish and more like accurate sentences. They will almost always get them wrong.


Densetsu Post #22 Posted 28 January 2012 - 04:48 AM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

View PostSora de Eclaune, on 27 January 2012 - 11:05 PM, said:

I tried my best with that.

Correct Translations 
So if I were being generous, I'd say you got 2 out of 3, or a 67% translation accuracy rate, doing your best. And those sentences weren't even that hard. Now for the fun stuff:
Correct Translations 
So you see, Google Translate is utter garbage when it comes to translating. You can get barely acceptable translations of textbook Japanese, but when the Japanese involves puns and vernacular spoken language (which is more common than textbook Japanese), you're going to get it wrong if you rely on Google.

And these are just a few examples. Imagine doing this for an entire game, which may contain anywhere from hundreds to thousands of lines of Japanese text.

As a related aside, I also get annoyed when I see "translation" projects where someone is listed as a "translator" in the credits. Unless you can actually read Japanese, understand it, and put it into another language in a way that is meaningful to the target audience, you didn't "translate" anything. Google did. And poorly, I might add. You should list yourself in the credits as a "Google translation editor," or even "Google translation guesstimator." The title of "translator" should be reserved only for translators.

/rant


FAST6191 Post #23 Posted 01 February 2012 - 02:39 AM

    Techromancer


  • Group: Reporters
  • Posts: 10,960
  • Member No.: 32,303
  • Joined: 21-November 05

  •  

Hopefully the following will not be too much rambling and something nice will come of it.
I already went and although I could bash machine translation (I certainly throw my hat in with those that consider it all but useless) it would serve little purpose at this point but I have been thinking about writing a post or something about what level of Japanese a rom hacker should know if they intend to work with Japanese roms and while I can do the hacking side of things I thought it best to also ask those that do translating. I am not aiming for the the bare minimum (which is arguably next to nothing) but something that allows people to be reasonably functional.

To this end things one should appreciate about the Japanese language when playing rom hacker (hacker side of the fence)
The types of Japanese characters and how they work-
Hiragana and Katakana which are collectively known as the kana. They are the basic constructs of the language with katakana usually being used for loanwords and foreign words and being somewhat more angular than the more freeform hiragana which are used for native words. There are some fairly accepted ordering/sorting methods (I believe gojuon order is the name of the most popular- stuff like http://www.romhackin...t/utilities/55/ should add it) and few games deviate here but said ordering might well leave out some of less common, possibly obsolete, ones the script writer might use or that the game might add entire characters for characters with punctuation (see Dakuten and Handakuten) so be aware of this when constructing tables.

Kanji- the elaborate symbols originally coming out of China of which there are many and most consider to be the harder part of the language to the point where translators tend to know a selection (there are various levels but there is not a set upper limit of kanji) and some works/fields are known for downplaying their use (most notably in the game/anime world for the likes of shounen manga and anime which is nice as games aimed at those audiences are some of the most popular targets for translation). There is no universally accepted ordering of kanji which makes relative searching (one of the most powerful yet simple tools available to the rom hacker dealing with an unknown text encoding) tricky at best.
However there are lists of selected Kanji that are taught to people but unless you also find yourself interested in Japanese education/orthographic history you probably do not need to delve into it as a rom hacker (if I am not overstepping my mark then as a translator it might help as games tend to be written by modern writers for a modern audience and knowing the way the modern language is taught/learned/graded can be a good thing*) and more modern games do often share orderings or parts thereof between games so do give it a look to other games (especially from the same developer/publisher). Although the lists, the distinction between kokuji, kokkun and categories/moji (such that might still be considered to exist) might not be that important you should know the terms radical and stroke ordering of which radical refers to the base stroke/component it derives from and stroke is quite literally how many strokes are needed to finish the deal and is about as close as you might get to a proper ordering system (everything else is so much good luck if you happen to encounter it).
I do not know of any rom hacking tools (or really much in the way of tools in general) that can add these lists like you might be able to add kana or Roman alphabets.
Equally know that games can and have picked and chosen only the ones they use (although they can be in say the order of the script/use in the game/use in the font) or a subset of common ones and added any extras to them.

*for instance a game writer will probably not be calling on some ultra obscure kanji in general text (if in a game as a "magic symbol" or decoration however then all bets are off) so there is probably no need to bust our your copy of Dai Kan-Wa Jiten for the help screen of the latest shounen tie in game.

All three can be intermingled in the text effectively at random (obviously there are situations in the language where things are expected to follow others but as far as your basic regex searches are concerned it is random). If however you want to change a bit of the font to allow for some basic translation and keep some Japanese intact it is probably better to axe a few kanji.

Furigana- ostensibly an optional pronunciation key for Kanji it can go well beyond that depending on the author but it is designed for those that might not know Kanji that well to read things in it (in Japan it is supposed to be for younger people but it often helps those with less familiarity with the language too). Uncommon back on 16 bit and earlier systems the DS however (especially with the touch screen) has seen a fair bit of it (the first Zelda I believe is a good example). This can often be the reason a sentence might be longer than it should be at first glance or pointers might be odd.

Japanese characters in general are fixed width unlike the Roman alphabet (think ijlt vs WQMK and such to say nothing of punctuation) and tend to sit within lines (think jypqf) unlike Roman character sporting languages leading to one of the harder aspects of text hacking in variable width fonts (not necessary for Japanese so they tend not to be added in to the game code) and if you are really flash true line handling code. This cuts the other way as well but I am not sure this is the venue to deal with half width and full width characters/encodings not to mention I will go so far as to accuse it of largely being a historical quirk (indeed one might argue some of the insane stuff the older consoles did was to work around these sorts of issues) and not something that warrants more than a passing mention for most hacking work.

Japanese text by virtue of Kanji but often in general tends to be somewhat shorter in terms of actual characters (the debate still rages when it comes to speech) so expect issues when it comes to fixed width menus (if you have hacked the font then you might consider adding two or more characters into a single tile as a crude but effective workaround) and to have to deal with pointers a lot.

Japanese technically does not have spaces between words with any you see being more aesthetic thanservingavitalfunctioninthelanguage so some of the more cute sentence structure hacking methods (When playing with a relative search tool on an English game I quite often assume there will be a sentence using the word the with a space either side and it often gives me an entire basic table with just that or assume the most common character in text is space and work from there) or pointer inferences (if you are unfortunate enough to be dealing with word level pointers so be it but thinking more a variation on the space being the most common/coming in given locations) but more importantly you should consider this when making a script dump if you are heading down that path and are using such things as a boundary and at least be prepared to be a bit flexible for your translator (I said aesthetic but fitting things in a text box can still be problem for those using Japanese).

However Japanese is not my forte so to the translators short of true language skills what would you like any hackers you work with to have an appreciation of? If you can tie it into a hacking concept like I did for some above even better but that is certainly not necessary. Equally some of my things to know might be given too much emphasis (I can not think of any real occasions where furigana has troubled me as a rom hacker and I mainly mentioned it as it can be a nice place to stick a note or two) or too little (I can not say I have examined custom or standard encodings to see if there is much in the way of kanji ordering vis a vis the ?-moji or schoolboy versions of kyoiku/gakushu vs joyo and if it would help to at least scan over such when dealing with custom kanji encoding setups then yeah). The only things I think I would want to add/expand upon is something on the various Japanese encoding methods (shiftJIS, EUC-JP and such) and how they might fall short but I have yet to be bored enough to vet the encodings against the language (did it once for Arabic and some of the lesser encoding systems there and pretty much vowed never again), something on the vertical vs horizontal writings but intro sequences and graphics aside I am not sure I have seen a game use tategaki (vertical) and maybe something on romaji but that is more of an IME problem than something that worries hackers (if truly necessary I would imagine it would be kicked to OS level, a nice library or to a programmer that knows Japanese).

On a different note entirely it has come up once or twice in recent months in various conversations so I figured it might be worth mentioning- translators official and otherwise have on occasion not shown the Japanese language much reverence (one of the many reasons several games have got retranslations over the years) and it does cut the other way so should something come into a Japanese game from another source (quite often Chinese and European history but certainly not limited to it) and have got butchered along the way then you can "revert" it to how it is/"should be". A less than brilliant example might be some of the names of creatures/concepts in Final Fantasy but if you prefer an analogy it is a bit like treating your DVD or some such as a perfect source for your video- the people charged with the initial encoding might still have hosed it up/phoned it in.

Also as for nice links I linked it up in the past but it was not here so http://www.loekaliza...m/mistakes.html might be worth a scan through and to save me digging up the thread I also linked up http://www.joelonsof...es/Unicode.html .


DS1 Post #24 Posted 01 February 2012 - 02:54 AM

    伝説の雀士

  • PipPipPipPipPip

  • Group: Members
  • Posts: 863
  • Member No.: 158,512
  • Joined: 18-February 09
  • Location: Yes!

  •  

I just read some random stuff on loekalization, I think everybody needs to read Biggest Mistakes, Urban Myths, and the Project From Hell (at least the too much leniency and computer-assisted translation parts)


Densetsu Post #25 Posted 01 February 2012 - 03:53 AM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

View PostFAST6191, on 01 February 2012 - 02:39 AM, said:

...short of true language skills what would you like any hackers you work with to have an appreciation of?
This topic may contain more information relevant to your question:
Japanese Programming Madness


Densetsu Post #26 Posted 06 May 2012 - 09:05 AM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

I'm just posting here because I had a random thought regarding translation and I wanted to get it out in writing.

I hypothesize that the name of the character Palutena (of Kid Icarus fame) was mis-translated from the original Japanese.

In Japanese, her name is パルテナ (refer to the Nintendo Wiki link above). The direct transliteration of パルテナ is PA-RU-TE-NA.

But the name パルテナ is taken from the Parthenon in Greece. The Japanese word for "Parthenon" is パルテノン (transliterated as PA-RU-TE-NO-N).

パルテノン = Parutenon = Parthenon
パルテ = Parutena = Should have been translated as Parthena, not Palutena.


My point in posting this to my Translation Toolbox is to get across the idea that when you translate, you really need to consider the source material from which a Japanese word or name is derived, rather than just directly transliterating because it's easy. I just so happened to know of the Parthenon, and I also happened to know that it was called パルテノン神殿 in Japanese, so I made the connection to Palutena (パルテナ).

Perhaps the team responsible for localizing Kid Icarus (which, incidentally, is called 光神話 パルテナの鏡, lit. "Light Mythology: Palutena's Mirror" in Japan) didn't know about the Parthenon. Maybe Palutena would have come to be known as Parthena had someone on the localization team been more familiar with Greek history. Or maybe they did know, but decided to go with "Palutena" anyway.

This issue came up once when someone pointed out the way we handled a translation in Blood of Bahamut:
Warning! Spoiler inside. 


We arrived at this translation by holding a poll:
Warning! Spoiler inside. 


The result prompted the following conversation:
Warning! Spoiler inside. 
Aganar is right about the words "Bahamut" and "Behemoth" coming from the same Semitic root, but our translation "error" was propagated from SE's initial error of turning the two words into two separate things.

We simply did not want to render the Japanese word kyojuu into "giant beast" because that would have sounded generic, so we had to come up with some candidate words to use instead. SE fans expected us to stick to terminology they're used to seeing in an SE game, so that's what we did. And correct though Aganar may be, the fact of the matter is that the vast majority of people wouldn't have cared about staying true to Semitic etymology.

Which brings me back to Palutena. Although the etymologically correct translation of パルテナ would probably have been "Parthena," maybe the translators had their reasons for localizing it to Palutena instead. This is just one instance of the phrase "translation is an art."

Go figure.


Masquerade-Q DELETED Posted 06 May 2012 - 10:36 AM

    Member

  • Pip

  • Group: Newcomers
  • Posts: 10
  • Member No.: 302,661
  • Joined: 04-May 12

  •  

Post Deleted By a Moderator On 06 May 2012 - 08:29 PM.
Reason: No. ~Den
 

FAST6191 Post #27 Posted 06 May 2012 - 12:17 PM

    Techromancer


  • Group: Reporters
  • Posts: 10,960
  • Member No.: 32,303
  • Joined: 21-November 05

  •  

A nice example and I get the reasoning but where they might derive from the same work and maybe even word to my reading at best it is a Bahamut and Neo Bahamut situation.

"Catastrophis - God of Catastrophes"

Depending upon your translation more than a few Abrahmic, Greek, Roman, Norse...... mythological figures lent their names to or had their names derived from phenomena or at least derived fairly recognised synonyms (possibly also antonyms and words that are related but take some explaining- Vulcanisation is the process of adding sulphur to plastics for instance). I agree it can look a bit odd but "Final chapter" tends to mean (whether it is a trope I will leave for a later debate) boss of all bosses give or take bonus content throughout fiction and even more so in games.

Equally it might not have as much recognition as say samurai, ninja, shuriken or katana... but kyojuu/Kaiju (pushing it a bit perhaps) has never the less become something of a loanword in English (more or less referring to the monsters as seen in Godzilla films) with other connotations which may well be different to Japanese (which seems to roughly translate/have similar range as beast- beast of burden, a mythological beast, "what a beast"/is that your beast?) but exist none the less.

Anyway the talk of continent/world bearing monsters has now got me thinking of Discworld so before I start pondering whether modern interpretations of mythology (it it a bit of a longer range but see the evolution and variations on elves) or indeed the mythology of modern fiction (certainly they have entered the lexicon- see superman) and have to beat down the meeja studies/philosophy part of me once more I think I will leave with if you ever needed another reason why machine translation is not something you should ever use this would be it- I can see machine translation getting somewhere even to the point where it can use context, dialect and maybe a massive database but this sort of thing is true or AI or not at all.


Densetsu Post #28 Posted 06 May 2012 - 10:28 PM

    GBAtemp Ninja


  • Group: Moderators
  • Posts: 2,676
  • Member No.: 116,738
  • Joined: 02-February 08
  • Location: 臨兵闘者皆陳列在前

  •  

View PostFAST6191, on 06 May 2012 - 12:17 PM, said:

Equally it might not have as much recognition as say samurai, ninja, shuriken or katana... but kyojuu/Kaiju (pushing it a bit perhaps) has never the less become something of a loanword in English (more or less referring to the monsters as seen in Godzilla films) with other connotations which may well be different to Japanese (which seems to roughly translate/have similar range as beast- beast of burden, a mythological beast, "what a beast"/is that your beast?) but exist none the less.
Agreed. After all, we don't translate words like "sushi." Sushi is sushi is sushi. The same is true for all the other words you mentioned. In my last post I forgot to mention that we probably could have just rendered 巨獣 as simply "kyojuu" and left it at that.
Warning! Spoiler inside. 



FAST6191 Post #29 Posted 06 May 2012 - 11:15 PM

    Techromancer


  • Group: Reporters
  • Posts: 10,960
  • Member No.: 32,303
  • Joined: 21-November 05

  •  

English suffix or portmanteau of something like shogun and senate? I guess it matters little in the end but I figured it should be pondered at least.

Also it might just be a bit too much total war back when but shogun to me translated more closely to a lord in the classical English sense as opposed to the rather more narrow terms general and warlord in that they were (generals- not terribly useful in peacetime other than readiness but lords tend to do the actually running of stuff too) or am I conflating it with daimyo?. Now this might be falling back to the samurai as more than a warrior thing but I sense this is getting off topic.

Speaking of translating sushi though ignoring that I can (whether I should though...) get it from street vendors outside asiatown if I want and it is not surprising to see it in the chiller cabinet of a basic city centre supermarket ( http://moblog.net/me...ravel-snack.jpg ) it has some implications of the exotic and moreover it is not quite translated precisely (how many people will still call it raw fish as opposed to the general method of preparation that it is). This applies even more so for rice balls and some milk drinks so there been occasions in translations where even sushi will get axed in favour of some random generic foodstuff even to the point where it cuts the other way and it will end up being called something like scampi (I just checked and it too has similar "mistranslations" to sushi or regional variations).

Equally I also failed to acknowledge properly that I agreed entirely with the decision and that as it has become part of the S.E. mythology (both as part of the translated works and in general) to differ here would be a cause to raise an eyebrow.







Users browsing this topic

1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users