From Chapter to Cinema: A Vibe Coder’s Guide to AI Animation Using Only Google Labs
How I turned a 1,500-word memoir chapter into a fully animated short film using nothing but AI
I stared at the opening line of my book chapter: “I remember the last time I resolved to never be broke again.”
The book is called “Commas, Not Periods”.
It was the book on finances I wanted 10 years ago made for today.
And the first memory that I wanted to outline was this scene of walking in the snow 15 blocks to an apartment with only 35 cents left to spend.
It was a dream I’d carried for ten years.
I’d written it down, but I wanted people to feel it.
Then I asked myself a question: Could I create a full animated film using only AI tools?
Not hire animators. Not learn After Effects. Not spend months in production.
Just me, my words, and a handful of Google AI tools working together like a tiny studio.
The answer was yes. And this guide will show you exactly how.
What You’ll Need (The Toolkit)
Think of this like assembling a film crew, except every “person” is an AI tool with a specific job:
Screenwriter: Gemini
Character Designer: ImageFX, then Flow when things became weird (more on that later!)
Animator: Flow
Composer: Music FX
Editor: Google Vids
Voice Actor: Yourself!
You don’t need to master any of these. You just need to know what to ask them.
Phase 1: Breaking Your Text Into Scenes
Before you generate a single image, you need to transform your written words into visual instructions. This is where Gemini becomes your director.
The Magic Prompt
Copy this prompt into Gemini, then paste your source text where indicated:
Act as an expert animation screenwriter and director. I have written content that I want to turn into a short animated film.
The text below will serve as the voiceover narration for the film.
Your task is to break this text down into distinct, manageable animation scenes and write detailed visual descriptions for what should be happening on screen during that part of the narration.
**Directorial Notes:**
* **Tone:** [Describe the emotional arc: reflective, vulnerable, hopeful, etc.]
* **Visual Style:** [Describe your preferred look: painterly, realistic, impressionistic, etc.]
* **Key Visual Metaphors:** [List any imagery from your text that should be shown literally or symbolically]
Please provide the output as a table with two columns:
1. **Visual Action Description:** Detailed instructions for the animator describing the environment, character action, lighting, and mood.
2. **Corresponding Narration:** The exact segment of text that will be spoken over those visuals.
**Here is the source text:**
[PASTE YOUR TEXT HERE]What You’ll Get Back
Gemini will return a scene-by-scene breakdown. For my chapter about financial struggle, it gave me scenes like:
Close-up of heavy winter boots trudging through deep snow. Camera is low, focused on the struggle. Wind blows snow violently across the frame.
Interior of a dimly lit bus. The protagonist sits with her head against the freezing window. Other passengers are shadowy silhouettes.

This table becomes your shot list, corresponding to each part of the story. Every row is a clip you’ll generate.
Phase 2: Creating Your Character
Here’s what separates amateur AI films from cohesive ones: character consistency.
If you just type prompts into an image generator, your protagonist will look different in every single frame. Different face shape. Different hair. Different everything.
The solution is creating a “character reference sheet” and using it to anchor every subsequent generation.
Step 1: Define Your Character’s Core Features
Before generating anything, decide on consistent details:
Hair color and style
Eye color
Distinguishing features (freckles, scar, glasses)
Body type
Age (if your story spans time, you’ll need multiple versions)
Step 2: Generate Your Reference Images
Go to ImageFX and use prompts that include your “core character block.”
Here’s the template I used for a character appearing across three time periods:
10 Years Ago (The Struggle):
A medium close-up still from an animated film, rendered in a 3D style with painterly brushstrokes. A young woman, aged roughly 20, with unruly dark curly hair matted with wet snow, warm brown eyes squinting against the wind, and slight freckles across her nose. She is trudging through a heavy nighttime blizzard. She wears an insufficient, thin black retail jacket with a generic “STAFF” name tag, hunched over and shivering violently. Her face is beet red and stinging from the cold, her expression is one of pure exhaustion, misery, and near-defeat, with tears freezing on her cheeks. The background is a swirling impressionistic void of white snow and blurred, cold blue streetlights.5 Years Ago (The Middle):
A medium shot still from an animated film, rendered in a 3D style with painterly brushstrokes. The same woman from the previous image, now aged roughly 25, with the same dark curly hair, brown eyes, and freckles. She is sitting on the floor of a cramped, dimly lit bedroom, looking overwhelmed and anxious, holding her head in her hands. She is surrounded by a chaotic pile of expensive-looking, colorful clothes with tags still on them, and many ornate glass perfume bottles on a cluttered dresser that seem to loom over her. The room feels claustrophobic.Present Day (Stability):
A medium shot still from an animated film, rendered in a 3D style with painterly brushstrokes. The same woman from the previous images, now mature, aged roughly 30. Her dark curly hair is styled neatly, and her brown eyes and freckled face show a calm, confident, and peaceful expression with a gentle smile. She is sitting comfortably at a clean wooden desk, working on an open silver laptop in a modern, airy apartment. Natural sunlight streams through a large window.Notice how each prompt includes “the same woman” and repeats the core features (dark curly hair, brown eyes, freckles). This anchors the AI’s understanding.
Step 3: Save Your Best Results
Generate several variations of each. When you find one that looks right, download it immediately. These images become your “actor headshots” for the next phase.
Note: I got a bit stuck at some points here with the consistency.
For example, here is a side by side of a scene I put into Image FX vs Flow for an important scene:


I now realize that using Flow to create both the images and videos in sync is the better way to go about this.
Phase 3: Bringing Images to Life
This is where the magic happens. You’re going to turn your static character images into moving video clips.
The Critical Step: Image-to-Video
Do NOT just type text prompts into Flow. You must upload your reference image first.
Here’s the workflow:
Open Flow
Look for the “Create Image” input option
Upload the specific character image that matches your scene
Write a motion prompt describing what should happen
Motion Prompt Templates
For each scene type, here’s how to prompt for movement:
Walking/Movement Scene:
A medium tracking shot based on the reference image. The character [action: walks, trudges, runs] slowly [direction]. [Weather/environment details]. The camera moves [direction] as they move. [Lighting description].Example:
A medium tracking shot based on the reference image. The character shivers violently, trudging slowly forward against heavy wind and thick, swirling snow. Her breath condenses in cold puffs. She clutches her thin jacket tighter. The camera moves backwards slowly as she walks toward the viewer. The lighting is dark and cinematic wintry blue.Emotional/Internal Scene:
Slow, creeping zoom into the character’s [body part/face], based on the reference image. [Describe any metaphorical elements: smoke, light, shadows]. They are [emotional state], [physical action]. [Atmosphere description].Example:
Slow, creeping zoom into the character’s distressed face, based on the reference image. The room darkens slightly as wisps of heavy, translucent purple and gold smoke begin to curl upwards from the open perfume bottles on the dresser behind her, slowly clouding the air around her head. She is breathing shallowly, looking overwhelmed and panicked.POV (First-Person) Scene:
First-person POV shot, [style]. A hand [action] with [object]. In the background, [setting description]. [Sensory details].Example:
First-person POV shot, 3D painterly style. A hand holds a silver spoon dipping into a glass jar of glowing orange marmalade. In the background, slightly out of focus, is a view of the London skyline through a cafe window on a sunny morning. Steam rises from a tea cup on the table.Metaphorical/Abstract Scene:
[Camera movement]. The character [action] in [setting]. As they [trigger action], [metaphorical element] begins to [transformation]. [Lighting change].Example:
Slow motion. The character takes a deep breath and expands her chest. As she does, the ghostly chains shatter completely into sparkling dust and float upwards. The golden light around her glows brighter, illuminating the darkness.Some examples here:




Clip Length Reality Check
FLow generates clips of approximately 4-5 seconds. Your narration will be longer. Don’t panic. You have options:
Slow down the clip to 0.5x speed (optical flow smoothing makes this look great)
Loop the clip by duplicating it
Reverse and loop (boomerang effect) for seamless extension
Cut between multiple clips for the same scene
Phase 4: The Soundtrack
Your film needs two audio layers: voice and music.
The Voice (Narration)
Because this is likely personal content (a memoir, a brand story, your expertise), I recommend recording yourself. Nothing carries emotional weight like an authentic voice.
But if you prefer AI narration, here are your free options:
Microsoft Clipchamp (Best Free Quality):
Built into Windows or available at clipchamp.com
Uses Azure’s high-end AI voices
Voices have emotional toggles: “Sad,” “Hopeful,” “Whispering”
CapCut (Fastest):
Text-to-Speech built into the editor
Less expressive, but gets the job done
ElevenLabs (kinda breaks the rules for using only Google tools, premium quality, paid):
If budget allows, this is the gold standard
Most natural-sounding AI voices available
Worth it for polished final products
The Music (Background Score)
Go to MusicFX and generate two distinct tracks:
Track 1: The Struggle
Cinematic ambient soundtrack, melancholic piano, lonely, cold wind textures, slow tempo, minimal, sad emotion, lo-fiTrack 2: The Breakthrough
Cinematic orchestral build-up, hopeful, uplifting strings, warm emotional resolution, inspiring, sunrise feeling, triumphThe transition between these tracks should happen at your story’s turning point. In my film, I switched from sad to hopeful exactly when the “chains break” in the metaphorical scene.
Phase 5: Assembly
Now you have:
Your scene-by-scene script (from Gemini)
Your character images (from ImageFX)
Your video clips (from Flow)
Your voiceover (recorded or AI-generated)
Your music tracks (from MusicFX)
Time to put it together.
The Editing Principle
Audio first. Always.
Drag your voiceover onto the timeline before anything else. This is your spine. Everything else gets built around it.
Music Volume
Set your background music to approximately 10-15% volume (around -20dB). You want viewers to barely notice it. The voice must dominate.
The Complete Workflow (Quick Reference)
1. WRITE your source text (chapter, script, essay)
↓
2. PROMPT Gemini to break it into visual scenes
↓
3. GENERATE character reference images in ImageFX
↓
4. CREATE video clips in Flow (upload reference images first!)
↓
5. GENERATE music tracks in MusicFX
↓
6. RECORD or generate voiceover narration
↓
7. ASSEMBLE in Google Vids
↓
8. EXPORT and shareWhat This Process Taught Me
When I started, I thought AI animation would feel like cheating. Like I was somehow less of a creator because I didn’t draw every frame by hand.
What I discovered was different.
The AI didn’t create the film. I did. The AI just translated my vision faster than I could have done alone.
I still made every creative decision:
Which emotions to emphasize
Which metaphors to visualize
Which moments needed silence versus music
Which version of my character felt most authentic
The tools handled the technical execution. I handled the soul.
And that’s always been what matters most.
Your Turn
You have a story. Maybe it’s a book chapter. Maybe it’s a brand origin story. Maybe it’s a lesson you learned the hard way.
Whatever it is, it deserves to be seen, not just read.
Open Gemini. Paste your text. Ask it to see what you see.
Then watch your words come to life.
Here is my final product for your consideration:
This guide was created as part of the Shameless Code Co. mission to help women move from consumers of AI to creators with it. For more resources on vibe coding and AI-assisted creation, visit shamelesscode.co.







Thanks so much for sharing. I will try that this week.
This is right on time can’t wait to give it a shot over the weekend!