How to Create a Viral Music Video with Sondo AI: The Hwasa’s Maria Formula

By olivia | May 12, 2026

How to Create a Viral Music Video with Sondo AI: The Hwasa’s Maria Formula

Using “Maria” by Hwasa as a case study, this article breaks down the core formula behind a viral music video—from raw, emotionally striking expression to an introspective “self-dialogue” core, as well as strong visual symbolism and emotion-driven storytelling. It shows that what truly connects with audiences isn’t complex narratives, but amplified emotion. From a creation standpoint, AI music video generators are reshaping music video production—turning abstract emotions into visual scenes, automatically syncing visuals with music, and enabling rapid generation of multiple styles, significantly lowering the barrier to entry.

Formula

Why does this Music Video keep going viral?

“Maria” — Hwasa

Years after its release, it’s still gaining views, even surpassing 300M+.

But what really made it break out isn’t just the music:

emotion + visuals + self-expression

1. Emotion: Raw and piercing

At its core, the song responds to external judgment:
criticism, denial, attack.

“Maria” isn’t driven by a complex storyline. Instead, it delivers an extremely direct emotional expression. The entire MV revolves around a specific emotional state: the struggle and recovery after being judged, questioned, and rejected.

It feels like a conversation with oneself—“Maria” is both a name and a self-call.

2. Core idea: A dialogue with the self

“Maria” is actually her baptismal name—a way she addresses herself.

So the MV isn’t about confronting the outside world, but about facing one’s inner self. Vulnerability and strength, breakdown and rebuilding, coexist and repeat within the same person.

This internal conflict makes it easy for viewers to project themselves into the story, creating a deep emotional resonance.

3. Visuals: Strong symbols + powerful metaphors

The MV is filled with visual metaphors:

  • Surrounded by pencils → verbal attacks

  • Blood and dining table → being “consumed” by the public

  • Hospital / breakdown scenes → mental pressure

Almost every visual serves metaphor. Abstract psychological states are turned into visible scenes: being surrounded, watched, consumed.

The visuals constantly shift between light and dark, control and chaos, creating a persistent sense of unease. This unstable rhythm keeps viewers emotionally engaged—not through plot, but through feeling.

4. Theme: Self-identity 

When all these layers come together, the theme becomes clear:

It’s not about external judgment, but about how you face it—and how you rebuild your sense of self.

In other words, the core isn’t conflict, but self-identity.

That’s why it has lasting appeal. It doesn’t belong to one person—it belongs to anyone who has experienced doubt and rejection.

“Maria” represents a shift in Music Video storytelling:
from narrative-driven → emotion-driven.
And this is exactly where AI excels.

Sondo AI Are Changing Everything

In the past, creating a music video like this required:
actors, locations, lighting, a full team, and lots of time.

Now, with Sondo AI, everything is being redefined.

Tools like Sondo transform creation from a production process into an expression process. You no longer need to physically build scenes. Just describe an emotion—pressure, conflict, release—and Sondo AI generates the visuals. Dreamlike spaces, psychological states, abstract environments—what once required filming can now be created instantly. More importantly, AI can sync visuals to music automatically, aligning rhythm and emotion without complex editing.

1. Emotion → Instant visuals

You don’t need detailed descriptions. Just say:

  • pressure

  • loneliness

  • resistance

  • breakdown

  • rebirth

AI turns them into visuals.

2. Metaphors → One-click generation

What used to require shooting:

  • hospital scenes

  • cold lighting

  • intense symbolic imagery

Visual contrasts:

  • bright vs dark

  • glamorous vs collapsing

Now:
generated instantly and directly.

3. Rhythm → Auto-sync

  • music rhythm → visual rhythm

  • emotional shifts → camera changes

No editing skills needed.

The success of “Maria” isn’t just about content—it’s a signal of a new creative direction. What truly moves people isn’t complex production, but amplified emotion and authentic expression.

And now, with tools like Sondo, this kind of expression is no longer limited to a few.

Start Your First Music Video with Sondo AI

Use Sondo AI to turn your emotions into a music video people can actually see.

https://www.sondo.ai/

Why More Musicians Are Turning to AI to Create Music Videos? →