When it comes to vocal synths, we should strive to remember to just have fun! This article will go in to detail about handling of syllables like "hya" and "kya" in Japanese. If you are happy with how your UTAU or Diffsinger model sings in Japanese, then please ignore this article.
Multi-pitch CVVC banks do not work properly with the shareware A for automatic button!! Any articles where I complain about CVVC banks being broken is my own fault for not figuring it out sooner!!
Monday, May 19, 2025
Thursday, February 27, 2025
Trying To Make a Diffsinger G2P Phonemizer Plugin
So, hey guys! I'll just be real with you... I haven't actually gotten this to work all the way to the end. But it was really painful to try and get this all to work so... I'll share my current knowledge with you.
Diffsinger does not need a traditional phonemizer. The timing is all taken care of by the duration model. Technically with a good enough dsdict, you'll be happy with just using the default DIFFS phonemizer. Heck, you can just type phoneme hints and be happy.
But by creating a phonemizer, you are able to make it so that people can type in any word (or nonsense) and get results... Whereas using a dsdict method means that if a word isn't in the dsdict, it just doesn't know what the heck is going on.
There are three ways to make your phonemizer usable as far as I know. The first is by handing it over to the people in charge of OpenUtau for them to put into the newest release. You really wanna make sure it's good before you do that! The second is by recompiling OpenUtau after creating the phonemizer. This would be great for testing, but very annoying to share. The final method is releasing it as a plugin. I feel a little bad I'm writing this before I actually complete the project, but I just keep running into problems that I'm not equipped to handle given I just woke up.
So, if you're interested in learning how to do this, keep reading!
As a precaution - Anaconda and Visual Studio are pretty hefty. If you have a computer with little hard drive space, you may want to sit this one out because it takes up a lot of space. That's why I've been doing this all on my laptop and not my tablet.
I am a Colab girl. I love Google Colab. However, I was unable to get the Colab that had been provided to work at all... Though shout out to the person behind it because I wouldn't have been able to figure out the correct requirements without it.
Credits:
OpenUtau (and g2p code): Stakira
Google Colab (that I used to figure out the correct requirements): LotteV
Repo to create phonemizer plugin: Tyler (spicytigermeat)
Update 2025/03/05 -
Sunday, January 5, 2025
Art Tutorial - Easy Painterly Anime Skin
I got into art again.
Something that really annoyed me was that I just couldn't figure out how artists got the skin to look the way that they do.
I don't have all of the answers! But I do have a workflow that can get you some pretty nice results!
Credit to Fungzau on YouTube for making the videos that taught me this.
Wednesday, December 11, 2024
Using English C+V As a Voice Copier
I never thought I'd be C+V anything!
Using English C+V As a Voice Copier
Tuesday, October 1, 2024
[Art Tutorial] Heads Up! The Biggest Part of Drawing Bodies
This resource doesn't belong on this blog, but why not put it here?!
What are heads?
Realism
![]() |
Image from Pexels |
Anime Style
![]() |
I think this is the official Miku box art pose |
Princess Peach
Any Style You Want!
Saturday, March 16, 2024
The Easiest Way to Make Realistic UST Files Using Praat!
I should have explained this sooner!
The Easiest Way to Make Realistic UST Files Using Praat!
Wednesday, February 15, 2023
Mandarin Chinese Phonemes for English Speakers
Here is the pinyin list! The files must be saved with the pinyin names to work with the base oto. Some of the phonemes might actually break filenames, but if you figure out a way to encode the filenames in phonemes and fix the base oto to reflect that, I will happily link to that as an option!
Here is the phoneme list. You might be able to set up OREMO to have it save the names from the pinyin list with the phoneme list as comments, but I was never able to get that to work with my English lists before I found the magic of using words. (I still couldn't get the comments to work, they just weren't needed anymore!)
(Note: I have no idea if I got the syllables like bo, po, mo, fo correct. They're in their own section under 'o', but have the same phonemes as those under -uo.)
I need to make a few edits to the base OTO. Not many, just a few. But I'm waiting until some more people record this list! Why? Because I'm doing this on my phone, but I need to use my computer to access the base oto :P
Saturday, May 14, 2022
Use My Voice to Make Your UTAU Sing Cantarella Realistically!
Can you use Praat instead of VocalShifter?! How can you make a CV UTAU voicebank realistic? I'm actually using SEO tricks I learned at LearnMMD!
Use My Voice to Make Your UTAU Sing Cantarella Realistically!
Friday, July 2, 2021
I was a big dummy - Multi-pitch CVVC banks are broken by the automatic button.
Hi there! I'm on mobile at the moment, but I felt it was important to make this announcement.
Almost every single time I complained about a Multi-pitch CVVC bank being broken, it wasn't the bank's fault. I had left the automatic button in shareware utau on at all times as that button automatically converts CV to VCV and shows exactly what samples is playing. Really useful!
However, it breaks banks that include VC samples. In a single pitch bank, you don't notice as what it does as it just gives up and refuse to apply a suffix. With Multi-pitch, especially when every sample has a suffix, that means stuff just doesn't play.
With the number of articles on the blog, I have no idea how many places I've made this mistake. If I find an article where I messed up, I will edit it to point out the mistake. However, like I said, I have no idea what articles have the issue. Articles posted after this may have the issue, as I have articles written and scheduled until 2022.
I will try to do better in the future!
Wednesday, December 2, 2020
Hopefully this isn't Goodbye, Clyp.
Monday, September 21, 2020
Downloading from Axfc In The Year of Our Lord 2020
Well, I'm glad I can make this article, but I'm sad that I have to.
Downloading from Axfc In The Year of Our Lord 2020
Monday, February 10, 2020
Do I need an expensive microphone?
Let me get it out of the way - I will always suggest the AT2020 USB. It is more expensive than a comparable microphone from Blue, but I never really liked how the Blue Yeti in particular sounded. Both microphones seem really expensive at over $100 (Yeti seems to have dropped to sub-100 used), but that's because they include an audio interface inside of the body of the microphone.
I didn't know what audio interfaces were, at all, for probably my entire career making an UTAU. An audio interface will allow you to get a higher quality microphone for less money (AT2020 USB goes for $149, but XLR goes for $99 according to Amazon), but the audio interface (especially with phantom power) will likely cost more than just getting a really nice USB microphone.
If you really, really want to spend money on a microphone, ask yourself what you will use it for. If your only answer is UTAU, I would cap spending at under $200 USD and go with an AT2020 USB. People do buy thousand dollar microphones for UTAU. (Of course, used and from eBay so it's closer to $600.) If you have the money, and you can see yourself using it for singing, streaming, or podcasting, then go for it! But please never feel like you need to spend anything for this hobby.
inb4 mae wants to drag us all down to her level - remember that there are different economic levels. You may be able to afford a lot of audio equipment, but not everyone can. This article is to help people who can't afford that audio equipment be happy with what they can get.
Do I need an expensive microphone?
Below, I'm plopping down a comparison between three banks recorded with three different microphones. I made choices I'm not exactly proud of that keep the comparisons from being one to one exactly, but they're close enough.
Which sample do you like best? Which feels the most clear?
To me, it's the second sample. The first sample is a little tinny and in some places abrasive to the ears. The third is muffled (and is recorded at C#3, but that's irrelevant here).
The second sample is clear. It has some issues with sibilants, but that can be edited out using equalizers.
Knowing that, how much do you think each microphone cost?
For the first two, it's a big, whooping zero (additional) cents. The first microphone was my laptop's embedded microphone - the same one I use for all of my "let's play" videos on YouTube. The second microphone is my phone's microphone. The third... I tried out someone's setup and I have no idea how much it cost. But, I believe their audio interface alone might have cost more than my phone, which also plays Stardew Valley and calls my mom.
If you look at this like an audio engineer, the stupidly expensive microphone is better and sounds the closest to real life, of course. But, there's two important notes here. The first is that the fandom tends to celebrate bright vocals compared to darker vocals. The phone microphone UTAU would likely be more popular than the expensive microphone UTAU due to that. The next point is just that the difference in quality is not worth the money to me. It's not night and day - it's just slightly better. This isn't even comparing it to the AT2020 USB, which likely costs only as much as the audio interface itself. (My stand broke, so the best I can do is give you a sample from my UTAU's last bank. The AT2020 bank definitely would be more popular than the expensive microphone's bank due to tone.)
How can I make my current microphone better?
- Turn the volume/sensitivity of your microphone down in volume settings if possible. You'll need to be a bit louder, but it will stop it from picking up a lot of background noise.
- Turn off all fans and heaters.
- Don't put your microphone on the same surface as your computer without some kind of buffer, like a pillow. (Your mileage may very - people told me this, but all I really noticed was my microphone falling on my face.)
- Try using a blanket as a makeshift sound booth. (Never worked for me, but I only tried using OREMO, meaning my computer was under the blanket with me... Getting hotter and making the fan run harder... I was a dumb kid.)
- If possible, hide in a closet. If there aren't enough clothes to buffer sound, put up quilts or blankets.
- Record as far away from your computer as possible.
I did all that and it's still noisy!
Monday, December 2, 2019
RSL English Recording List Review
RSL English Recording List Review
What is the RSL English Recording List?
Voicebank structure
What is missing?
- Does the bank have "- V", "V", "V -"? (RSL does.)
- Does the bank have "CV", "VC"? (RSL does.)
- Does the bank have "- C" and "C -"? (RSL does.)
- Does the bank have all common consonant clusters? (Sadly, not here.)
How is the phonetic system?
vowel list |
How is using the bank?
Who is this for?
Monday, June 17, 2019
Japanese to English Equivalency Chart
Japanese to English Equivalency Chart
Note: Shortly after writing this, I realized that it was impossible for me to create the video without any monetary input from my viewers. If you want the video series I was intending to make happen, please donate to my Patreon.