Register for the Forum
Welcome to the Verse 2 Melody Forum! To access our forum and join the community, please register for a free account. Registration is quick, easy, and free, and gives you full access to all forum features. Once registered with a free account, log in to the forum with the same details. We look forward to your participation!

The Challenges of AI Pronunciation: A Journey with Suno

I've been fascinated by computers since my late teens and the early days of the ZX80. Now I am revelling in the cutting-edge innovations of AI-powered music composition tools like Suno and Udio. This forum celebrates the journey of technological evolution.
User avatar
v2melody
Site Admin
Posts: 7
Joined: Wed Feb 12, 2025 4:56 pm
Location: Spain
Contact:

The Challenges of AI Pronunciation: A Journey with Suno

Post by v2melody »

One of the most fascinating aspects of working with AI tools like Suno is their ability to transform text into music. However, as powerful as these tools are, they’re not without their quirks—especially when it comes to pronunciation. As someone who has spent a lifetime immersed in language and phonetics, I’ve found these challenges both frustrating and enlightening.

The Pronunciation Puzzle: "Maria" in Thomas of the Light Heart

A particularly tricky word I encountered was "Maria" in Owen Seaman's poem Thomas of the Light Heart* The line:
"He seems immensely tickled by a  
Projectile which he calls a 'Black Maria.'"
posed a unique challenge. The AI consistently pronounced "Maria" as the name (/məˈriːə/), instead of the intended pronunciation rhyming with "tyre" or "liar" (/məˈraɪə/), referring to the police wagon. After nearly 30 attempts and countless tweaks, I finally achieved an acceptable result.

What Worked (and What Didn’t)

Here are some methods I tried:

Unsuccessful Attempts:
  • Adding explicit instructions like:
    [SPEAKER NOTE: Pronounce 'Maria' to rhyme with 'tyre' and 'fire'.]
  • Phonetic approximations such as:
    "Black MuhRYEu."  
More Successful Attempts:
  • "Black Ma-rye-a."
    It worked well with the prompt: Military march, brass fanfare, Celtic reel, fiddle, trench ballad, pub shanty, concertina, bugle call, Edwardian parlor song
  • "Black MuhRIYEu."
    Worked well with the prompt: Orchestral depth, military drums, dignified brass, emotional builds, heroic themes, determined rhythms, marching tempos
These prompts seemed to influence not just the tone but also the AI's linguistic interpretation.

The Role of Phonetics and Future Potential

As a retired English teacher with a deep understanding of phonetics, I often wish for greater control over pronunciation in AI tools. Incorporating the International Phonetic Alphabet (IPA) into these systems could be revolutionary. Imagine being able to isolate difficult phonemes or specify exact pronunciations using IPA symbols—it would open up incredible possibilities for creators working in multiple languages or with nuanced texts.

Lessons Learned
  1. Experimentation is Key: Achieving the desired output often requires patience and creativity. Testing different phrasings, prompts, and even musical styles can yield surprising results.
  2. Listen Carefully: Always listen to the entire output. Small glitches can appear unexpectedly and may require additional adjustments.
  3. AI is Evolving: While these challenges can be frustrating, it’s important to remember how far AI has come. Tools like Suno are improving every day, and what feels impossible now may soon become effortless.
A Shared Journey

Working with AI has deepened my appreciation for both technology and language. It’s a reminder that even as machines grow more capable, they still reflect our human imperfections—and our ingenuity in overcoming them. For those navigating similar challenges, I encourage you to share your experiences. Together, we can learn from each other and help shape the future of creative AI tools.

Citations:
[1] https://www.reddit.com/r/SunoAI/comment ... my_native/
[2] https://sunoaiwiki.com/tips/2024-06-25- ... on-issues/
[3] https://github.com/suno-ai/bark/issues/514
[4] https://www.youtube.com/watch?v=cr3gY4feIgw
[5] https://sunoaiwiki.com/tips/2024-05-02- ... g-suno-ai/
[6] https://fltmag.com/suno-ai/
[7] https://howtopromptsuno.com/common-prob ... -my-lyrics
[8] https://www.youtube.com/watch?v=ac-EYIr1ia8