Spooktober 5th Annual Visual Novel Jam

Hosted by Stella @ MakeVisualNovels, Crystal Game Works · #SpooktoberVNJam

206

Entries

2,181

Ratings

Overview Submissions Results

Community116

Screenshots Submission feed

Spooktober 5th Annual Visual Novel Jam community

Regarding the use of TTS

A topic by Black Mage created Aug 18, 2023 Views: 135 Replies: 1

Viewing posts 1 to 2

Black Mage1 year ago (4 edits)

I have an inquiry about the usage of Text To Speech (TTS) programs. I understand that using AI is prohibited, but I see a problem when we're looking at the usage of TTS programs. They are similar to sound synthesizers or programs to compose music or something in that area (pardon my lack of knowledge on what to call them) where it generally uses a thing called soundfont or something that can be regarded as a collection/library of sounds/instruments.

It is true that no human speaks/records the speech generated by the TTS programs, but the same can be said for music created using those programs. Most people would just slap the notes on the program, and convert them to a sound file and that's it. It'll be rare to find someone who plays and records the instrument themself after composing it inside the program.

"But the note used on the program is actually recorded for real by someone". Well, TTS also required the voice provider to record every important sound (alphabets, certain words combination) by themselves. It's just that they never record a full word by themselves, similar to how the music programs never record the full piece by themselves. We can regard a word on TTS as a small composition on music programs.

So, what's the ruling on this matter?

I'm asking because a robotic voice might be hard to record without playing a lot with effects, or maybe someone would like to make a full monotone voiced game.

Stella @ MakeVisualNovelsHost1 year ago

I'm conversing with the judges over it, but here are my initial thoughts. Note that they are not a final ruling and we'll get back to you on it.

TTS, vocaloids, and similar voice simulation and production software should be acceptable with a valid license to use both the software and voice.

Two reasons:

1)TTS is often used as an accessibility feature and is actually built into some visual novel engines.

2)Vocaloids and the like have very well established licensing and rules regarding their usage. One of our major concerns with the current landscape of generative AI is the dubious nature of copyright, both in the source of the training material and in the ownership of the end product, and thankfully many soundfont packs have this well established.

itch.io

Spooktober 5th Annual Visual Novel Jam

Regarding the use of TTS