• If you do not recieve your confirmation email within a few hours, please email haloutau@gmail.com with your username for manual validation. Your account should be activated within 24 hours.
    You may also reach out via any other listed contact on Admin Halo's about page: https://utaforum.net/members/halo.194/#about

wtf is wrong with my voicebank

Row 4

Teto's Territory
Can you describe the issue?
I appreciate the download link but we need to know what the problem is before we can fix it~!
 

bingos_baby

Momo's Minion
Could you post a sample about what exactly is the issue? Like audio or a screenshot? I used a couple sample USTs and it sounds fine. I used a different CV UTAU in each test to compare/contrast, and honestly I'm not seeing a big, glaring problem. The Shinkai Shoujo UST I used is untuned so I could see a baseline of how the VB is working, and the only issues I saw was the oto might be a bit choppy. Give it another pass, tweak some stuff, and you'll be fine. And in the demo snippet I tested, the enunciation might be a bit hard to understand, but that's on the quality of the recording samples. There may be a couple you'll want to rerecord, but that's your call. None of these are huge issues and can be easily fixed.

Demo (tuned): https://files.catbox.moe/zqvaic.wav (not choppy, your VB is the first two verses)
Shinkai Shoujo (untuned) : https://files.catbox.moe/gek82w.wav (your VB comes in first, it's a lil choppy)
 

heynotloid

Momo's Minion
Thread starter
Could you post a sample about what exactly is the issue? Like audio or a screenshot? I used a couple sample USTs and it sounds fine. I used a different CV UTAU in each test to compare/contrast, and honestly I'm not seeing a big, glaring problem. The Shinkai Shoujo UST I used is untuned so I could see a baseline of how the VB is working, and the only issues I saw was the oto might be a bit choppy. Give it another pass, tweak some stuff, and you'll be fine. And in the demo snippet I tested, the enunciation might be a bit hard to understand, but that's on the quality of the recording samples. There may be a couple you'll want to rerecord, but that's your call. None of these are huge issues and can be easily fixed.

Demo (tuned): https://files.catbox.moe/zqvaic.wav (not choppy, your VB is the first two verses)
Shinkai Shoujo (untuned) : https://files.catbox.moe/gek82w.wav (your VB comes in first, it's a lil choppy)
would I have to re-record?
 

bingos_baby

Momo's Minion
would I have to re-record?
If you want to. A lot of people don't worry too much about the quality of voice samples and focus more on the sound, while others may focus on "How well can I understand what the VB is saying?" I mean, I can't understand a word some VOCALOIDs are saying, so I don't think you should worry about it too much. I brought it up in case that was what your initial worry was. I think of it more as something to keep in mind if you record future UTAUs. Mic quality is also something to consider, but that's an investment you should look into if you record more VBs. Sorry if that's not the clearest answer, it truly does boil down to personal preference.
 

Halo

Icon by Wanpuccino @ DA
Administrator
Defender of Defoko
Your recordings are fine.
This is a partially configured CVVC voice bank. Was that intentional? Are you using OpenUtau, autoCVVC or presamp? If not, that may be part of the issue. if you finished the aliases and config it would sound okay. many of your oto entries are a bit strange but I can try to find time to come back and share some corrections that should help.
EDIT: Also, this appears to be a wastelandutau reclist-- those come with base oto.inis you can start with. Did you use that?
 
Last edited:
  • Like
Reactions: Kiyoteru

heynotloid

Momo's Minion
Thread starter
Your recordings are fine.
This is a partially configured CVVC voice bank. Was that intentional? Are you using OpenUtau, autoCVVC or presamp? If not, that may be part of the issue. if you finished the aliases and config it would sound okay. many of your oto entries are a bit strange but I can try to find time to come back and share some corrections that should help.
EDIT: Also, this appears to be a wastelandutau reclist-- those come with base oto.inis you can start with. Did you use that?
i am using UTAU and otoining by hand
 

Halo

Icon by Wanpuccino @ DA
Administrator
Defender of Defoko
Okay, so to use your voicebank you need to run it through either presamp or autocvvc, for which you may need to create a presamp.ini first. CVVC in general is better off used in OpenUtau or using the PhonemizeronUTAU plug in just for sanity's sake but it's up to you.

But also it not being complete will not help of course haha. IMHO using the base oto is a better idea bc it means you dont have to manually alias which is very taxing for a voicebank type that's already hard to test unless you're already quite familiar with UTAU or are using OpenUtau.

Using the base oto, OpenUtau, and vlabeler I got a "complete" and usable alias situation so I could test:

1742009628393.png
Only oto'd these lines in full as a test to see if everything was fine, it is, no overlap blips... So you just gotta keep going, and make sure you're using something that lets you actually use a CVVC bank.
A basic visual in case you wish to continue manually aliasing, before and after of your VCs:
1742011461161.png1742011517933.png
As you can see, the preutterance needs to be at the start of the consonant rather than the middle or end to prevent early starts or strange overlapping. The overlap should be just before the previous vowel begins to taper off in the waveform, or spectrogram (my screenshots are in vlabeler, but you can get a spectrogram view in any otoing program).

Additionally, VCs will ALWAYS be named "n f" and not "ん f" as all solutions for VCCV use a standardized "romaji prior vowel, next consonant" format to match VCVs "romaji prior vowel, hiragana CV" format. You do, though, need hiragana CVs. Your own aliases also miss out on the beginning (- CV) sounds, which are part of why you say the syllable twice-- That way you have - ka for the start of phrases, as well as ka for after a VC. They have different needs, so it's important and possibly part of why you're encountering issues.

Another example, this time how i would recommend entries for your go-go file be done.
1742012262629.png1742012306664.png1742012330624.png

Some reading for ya that should help with some basics that seem to be throwing off your V C otos in particular, though your CVs also need some brushup (preutterance always directly at the end of the consonant and start of the vowel btw):
https://yinsototutorial.weebly.com/9-cvvc-japanese.html
https://utaforum.net/resources/anatomy-of-the-oto.321/
https://github.com/stakira/OpenUtau/wiki/Voicebank-development

I'm blunt, but I hope this doesn't come across as harsh. It's hard to tell what you need and how best to convey it when you're giving this little information so i just have to generalise.
 
  • Like
Reactions: Row 4 and Kiyoteru

Similar threads