Romanization guidelines

Japanese Romanization

Using Machine Translation tools like DeepL or Google Translate is expressly forbidden. Do not Romanize or translate if you do not know the language well enough.

This guide assumes the reader has basic knowledge of Japanese grammar to be able to correctly Romanize. If you wish to learn more about Japanese grammar, visit http://www.guidetojapanese.org. See also the wiki page for Romanization walkthrough.

  • If you are unsure about the Romanization, leave a comment on the entry. You can use @(username) to notify the editor who added the Romanization.
  • Commonly accepted Romanizations will be allowed, even if they’re technically incorrect and are generally exempt from the guidelines. More accurate Romanizations can always be added as aliases.
  • If the Romanization is provided by the producer on an official page, or by another official source such as the record label, use that instead.

Voice synth-related Japanese slang and terminology is collected in this Google Spreadsheet.

Basic Rules

Use the rules of Modified Hepburn Romanization unless an exception has been indicated below.

  • Capitalization: Capitalize all words that are NOT single kana particles(は、が、を、etc)
    • E.g. 歌に形はないけれど would be Romanized as ‘Uta ni Katachi wa Nai Keredo’
  • Separate all words and particles with spaces.
  • When adding the P to a producer’s name, do not use a dash.
    • E.g. めざめP would be romanized as ‘MezameP’ not ‘Mezame-P’
  • If ん is followed by a vowel(including や・ゆ・よ), Romanize it as n’
    • E.g. してやんよ would be Romanized as “Shiteyan’yo”
  • Use the same order as the Japanese name. Usually this is Lastname Firstname. Providing the alternate order (usually the western order Firstname Lastname) is usually not necessary. Note that the rule stated above still applies: if an officially accepted Romanization already exists, use that.

Exceptions

  • Loanwords are Romanized into their original language. コーヒー would be Romanized as ‘coffee’.
    • However, if the loanword is complex scientific latin nomenclature, the common name for said article is preferred. For an example, see http://vocadb.net/S/25395 *If the entire name is composed of loanwords, leave the Romanized name field empty. Use the translated name field instead.
  • づ is Romanized as ‘dzu’
  • Do not use macrons for long vowels. こおり is Romanized as ‘koori’, not ‘kōri’
  • Use a dash when appending honorific suffixes to names unless the suffix is part of the name.
    • お姉さん would be Romanized as ‘Onee-san’, but 西沢さんP would be Romanized as ‘NishizawasanP’ because the –san is part of the producer’s name.

Particles

Wikipedia: http://en.wikipedia.org/wiki/Japanese_particles

  • は is Romanized as ‘wa’ if it is used after a noun as a topic marker:
    • E.g. 恋は戦争 would be Romanized as ‘Koi wa Sensou’, the は is not part of a word and functions as a particle
  • へ is Romanized as ‘e’ if it is used after the name of a place to indicate direction as a particle.
    • E.g. 駆ける、星空の彼方へ would be Romanized as ‘Kakeru, Hoshizora no Kanata e’, the へ is not part of a word and functions as a particle
  • を is Romanized as ‘o’ if it is used after a noun as a direct object particle (90% of the time).
    • E.g. 地球最後の告白を would be Romanized as ‘Chikyuu Saigo no Kokuhaku o’

Chinese Romanization

  • Use the Pinyin system.
  • The use of tonal marks is encouraged, but not necessary.
  • Capitalize every romanized word.
  • Separate the romanization of each Chinese character with a space, but special names and proper nouns are kept together.
    • Example: 北京 would be romanized as “Běijīng” or simply “Beijing”
    • For names of people, separate the first and last name with a space.
      • Example: 毛泽东 would be romanized as “Máo Zédōng” or “Mao Zedong""

Transcription

Many older songs (and some newer songs) do not provide lyrics in their descriptions or elsewhere, and their lyrics can only be found in the PV itself. A decent amount of other songs do not have lyrics in the PV either, and therefore their lyrics have to be transcribed by hearing.

Visual Transcription

Visual transcriptions are almost always easier than auditory ones, as you can rely on both seeing the characters themselves and having the readings to go with them from the song’s audio.

  • If you transcribe from a PV, make sure to link the PV as the source and add (Transcribed from PV) next to it. For example, “YouTube (Transcribed from PV)“
  • The use of OCR (Optical Character Recognition), such as Google Lens and other AI-assisted tools, is allowed, but they should not be blindly trusted. If you use them, make sure to confirm that they are accurate before adding them to an entry. Keep in mind that OCR is often quite unreliable on stylized or low-quality text.

Audio Transcription

Audio transcription is much more difficult than visual transcription, duing to the wide variety of characters that share similar or identical readings. Transcribers need to not only know these characters, but also decide which one fits the given context best. For this reason, it is generally discouraged to attempt audio transcriptions unless you are advanced enough in the languge.

  • Ai-assisted tools are not allowed for audio transcription, as they are extremely inaccurate with languages like Japanese and Chinese and even more so with Vocaloid.
Terms of service Privacy & cookie policy License About VocaDB Staff Contact