UTAU Glossary

A list of words commonly mentioned when talking about the UTAU/Vocaloid software and fandom

  1. MystSaphyr
    UTAU Glossary

    Below is a list of words commonly mentioned when talking about the UTAU/Vocaloid software and fandom. This list is continually being updated. To find a term, you can use ctrl+f on PC or cmd+f on Mac to search the list.


    Ameya/Ayame - The programmer of UTAU and the maintainer of the UTAU website/blog.

    AppLocale - An alternative way of using Japanese software in an English machine if the East Asian Language packs are not installed. However, AppLocale does not allow hiragana usage in the UTAU software without crashing, so only romaji may be used. See also hiragana, katakana, romaji

    Audacity - Freeware recording software that might be used to record wav samples for a voicebank. See also OREMO, Reaper, mixing, wav

    CV - "Consonant-Vowel" The traditional recording system of UTAU. Sounds consist of either a single V, "vowel," sound (a, i, e, o, u and n included) or "consonant-vowel" (ka, ji, no, etc.) See also triphones/triphonics, diphones/diphonics, monophones/monophonics, VCV, dipthong, phoneme

    Defoko/Utane Uta (or "Dehuo") - The default voice of UTAU, created from a text-to-speech voice from Aquestalk. Defoko, or "default child" was her original name, but considered offensive, thus the later name like Utane Uta (song-sound song). Dehuo is merely an early mistranslation of "Defoko". See also Kasane Teto

    diphones/diphonics - A unit of sound consisting of 2 more basic sound units. Examples would be consonant-vowel combinations such as "ka" or vowel-vowel diphongs such as "ai" See also triphones/triphonics, monophones/monophonics, CV, VCV, dipthong, phoneme

    dipthong - A 2-vowel combination that is pronounced as a single unit. For example, "ai" is not pronounced as a separate" ah-ee" but as "I" See also triphones/triphonics, diphones/diphonics, monophones/monophonics, CV, VCV, phoneme

    Fanloid/Fanmade - Any Vocaloid-like or UTAU-like fanmade character that either has a voice derived from an existing voicebank, or no actual synth voice at all.

    flags - An editing feature in UTAU that helps change the tonal quality of notes. Flags include Y and H (clarity of sound), bre (breathiness), an g (gender: + for lower, - for higher). See also genderbent/genderbend

    genderbent/genderbent - The term for a voice that has been edited to sound like the opposite gender. While many people change the pitch of songs to either higher or lower, the correct way to genderbent a Vocaloid or UTAU is to change g flags or gender settings in the software itself. See also flags

    hiragana - One of the "kana" alphabets or phonetic alphabets of the Japanese language. Hiragana are the characters typically used in UTAU. See also AppLocale, katakana, romaji

    JOKAloid - A term used for the UTAU PSS character Donka Fjord. Fjord was an April Fool's day joke. His demo featured a very high-quality voicebank, but the released bank consists only of random noises and comes through UTAU as mechanical gurgling noises. Fjord's genderbent, Donka Dasha, also falls into this category. See also VIPPAloid

    Kasane Teto - A fake Vocaloid that was the second useable in the UTAU software, after Defoko. Teto was an April Fool's joke created on the VIP section of the Japanese image board 2chan. She is not an official Vocaloid, despite many fans who believe she is, since she resembles the Crypton style and has a high-quality voicebank. Her voice source is named for a parody of Doraemon's seiyuu. See also Defoko/Dehuo/Utane Uta

    katakana - One of the "kana" alphabets or phonetic alphabets of the Japanese language. See also AppLocale, hiragana, romaji

    MIDI - "Musical Instrument Digital Interface" MIDI files consist of software instruments readily recognized by the computer and read by the computer in sequences of notes rather than complex audio files, making them smaller than mp3s and other similar file types, though of more computerized quality. MIDI tracks can be imported into UTAU and appear as a series of notes sung with "ah" by the UTAU voice. See also VSQ, UST

    monophones/monophonics - Phonemes consisting of a single sound. These would be the most basic consonants and vowels that make up all languages. See also triphones/triphonics, diphones/diphonics, VCV, CV, dipthong, phoneme

    OREMO - Audio recording software developed especially for UTAU. Allows user to record against a tone for samples all on the same note, as well as lists of sounds that make sorting recordings easier. See also Audacity, wav, reclist

    phoneme - A base unit of sound that is combined with other phonemes to make up words in language. See also triphones/triphonics, diphones/diphonics, monophones/monophonics, VCV, CV, dipthong

    Piapro - A Crypton-sponsored online Vocaloid community. Vocaloid users and artists can upload their songs and artwork for others' enjoyment. It is a hub of many prominent Vocaloid producers. However, posting UTAU music is NOT allowed. See also "P-name"

    "P-name" - An honorific, "-P" as added to the end of a name, used for mainly Vocaloid and some UTAU users that designates them as a "producer." It is a polite way of showing one's appreciation and respect for a user's music, art or MMD work. The "P" suffix on Piapro, however, can only be obtained by producing many quality Vocaloid works. P-names CANNOT be given to oneself, and DO NOT necessarily mean one has created original music. The topic of the P-name is often one of heavy debate. See also Piapro

    Reaper - Audio editing software. Reaper is professional-level and free for NON-COMMERCIAL experimentation and use after the trial period, much like Winzip is indefinitely free but prompts for purchase. Reaper sports a much higher level of editing power than Audacity. See also Audacity, wav, mixing

    reclist - A list of syllables to be recorded, used in OREMO. See also, OREMO

    romaji - The romanized (English-version) of the phonetic Japanese alphabet. Can be used in UTAU, and if AppLocale is installed, is the ONLY way of using UTAU. See also hiragana, katakana

    triphones/triphonics - A sound consisting of three separate, basic sounds. Used in UTAU to describe the recording methods used by Ritsu, Teto, and several "new" UTAU, however the correct term is VCV. See also VCV, CV, diphones/diphonics, monophones/monophonics, dipthong, phoneme

    UST - "UTAU Sequence Text" UST is the native filetype of UTAU and is the only one that the software reads without requiring import or special editing. See also VSQ, MIDI

    UTAU - Relates to either the UTAU software or a character avatar and/or voicebank. To be considered an UTAU, a character MUST have a usable voicebank for the UTAU software. See also UTAUloid

    UTAUloid - Fanmade term, derived from "Vocaloid," for an UTAU character avatar. To be considered an UTAUloid, a character MUST have a usable voicebank for the UTAU software.

    VCV - "Vowel-Consonant-Vowel" A phoneme technique used to make an UTAU sound more natural, also called "triphones" or "triphonics." By recording strings of syllables and using otos to split them up, one can link vowels between consonants for sound that flows more naturally. Example: "aRiGaTo" becomes "ari-iga-ato." See also triphones/triphonics, diphones/diphonics, monophones/monophonics, CV, dipthong, phoneme.

    VIPPAloid - A prank-Vocaloid-turned-UTAU created by members of the VIP board on 2ch. VIPPAloids include Kasane Teto, Namine Ritsu, and Yokune Ruko. See also JOKAloid

    Vocaloid - Yamaha's professional vocal synth software on which UTAU is based. Popular Vocaloids include Hatsune Miku, Kagamine Rin and Len, Gumi/Megpoid, etc. Yamaha does not support or endorse the UTAU software but Crypton has shown support and enthusiasm for it.

    VSQ - Vocaloid SeQuence file. Consists of notes and syllables/lyrics, and can be imported into the UTAU software. See also UST, MIDI

    wav - The sound file used in UTAU to build voicebanks. Sounds must be exported from recording software in .wav format for UTAU to recognize them. See also OREMO, Audacity, Reaper
    kimchi-tan likes this.

Recent Reviews

  1. kimchi-tan
    Somewhat outdated but would still be very useful for newcomers.