Wiki Wednesday #336 - Kiyo Mizune / 水音キヨ
Who is Mizune Kiyo?
Art from wiki |
I get less tired as the sun goes down.
Art from Bank |
I was sad so I hugged my cat but I'm allergic so now I just want to sleep. There's proof this guy is meant to be Twine Owata / Owata Twine.
Art from official site |
it's the fourth of July!
Art from official site |
the katakana says ヲ in the reading section. No idea why, but I'll respect it. But I do want to say Tadane Utauo / Utauo Tadane in case anyone is searching with that Romanization.
Art from bowlroll |
When it comes to vocal synths, we should strive to remember to just have fun! This article will go in to detail about handling of syllables like "hya" and "kya" in Japanese. If you are happy with how your UTAU or Diffsinger model sings in Japanese, then please ignore this article.
I think there's just one more from this site after this one...
Art from Bank |
Art from wiki |
Technically, the name is Makaron Tanaka / Tanaka Makaron. But you see, Macaron isn't his legal name. He gets called that because of how much he loves macarons.
Art from Bank |
Sometimes I forget that I'm able to read katakana. I'll read it correctly and then say "but I'm bad at katakana so I'm probably wrong." I check and I was right all along.
Art from Bank |
you gotta remember, sometimes people just straight up lie in recipes you find online.
Art from wiki |
my first day processing these 2025 forgotten Friday articles and I've knocked out two months! If I keep this pace, I could be done by Christmas.
Art from Bank |
So, hey guys! I'll just be real with you... I haven't actually gotten this to work all the way to the end. But it was really painful to try and get this all to work so... I'll share my current knowledge with you.
Diffsinger does not need a traditional phonemizer. The timing is all taken care of by the duration model. Technically with a good enough dsdict, you'll be happy with just using the default DIFFS phonemizer. Heck, you can just type phoneme hints and be happy.
But by creating a phonemizer, you are able to make it so that people can type in any word (or nonsense) and get results... Whereas using a dsdict method means that if a word isn't in the dsdict, it just doesn't know what the heck is going on.
There are three ways to make your phonemizer usable as far as I know. The first is by handing it over to the people in charge of OpenUtau for them to put into the newest release. You really wanna make sure it's good before you do that! The second is by recompiling OpenUtau after creating the phonemizer. This would be great for testing, but very annoying to share. The final method is releasing it as a plugin. I feel a little bad I'm writing this before I actually complete the project, but I just keep running into problems that I'm not equipped to handle given I just woke up.
So, if you're interested in learning how to do this, keep reading!
As a precaution - Anaconda and Visual Studio are pretty hefty. If you have a computer with little hard drive space, you may want to sit this one out because it takes up a lot of space. That's why I've been doing this all on my laptop and not my tablet.
I am a Colab girl. I love Google Colab. However, I was unable to get the Colab that had been provided to work at all... Though shout out to the person behind it because I wouldn't have been able to figure out the correct requirements without it.
Credits:
OpenUtau (and g2p code): Stakira
Google Colab (that I used to figure out the correct requirements): LotteV
Repo to create phonemizer plugin: Tyler (spicytigermeat)
Photosensitivity warning! KIGAI's download page has a flickering background. The contrast is low enough that it doesn't look too dangerous, but I felt motion sickness when it popped up and I didn't expect it.
Art from site |
this name makes my dyslexia hurt.
Art from Bank |