Graphic for TWIL's blog post featuring the synergy of GPT-4 Vision and ElevenLabs, depicting a playful AI interpretation of a meeting narrated in the style of David Attenborough.

Welcome to TWIL, our weekly knowledge-sharing voyage where we spotlight the latest lessons our team has gathered in the ever-evolving landscape of software development. This installment of TWIL brings you an exciting intersection of AI storytelling with GPT Vision and Narration with ElevenLabs. With just a little effort, GPT-4 Vision and ElevenLabs bring a meeting snapshot to life in the — a testament to the playful might of today's technology.

GPT Vision and Narration with ElevenLabs

You might’ve seen this viral video earlier in the week where Charlie Holtz, a developer at Replicate wrote a script that would take pictures with his webcam every 5 seconds, then ask GPT-4 Vision to describe them in the style of David Attenborough, only to then use a voice model from ElevenLabs to read the description.

Sounds like a lot, so here’s the clip:


While amusing and terrifying, Charlie’s demonstration shows us how these tools can be equally powerful and surprisingly simple. Although there are several legal and ethical concerns about modeling someone’s voice without their permission (something ElevenLabs’ TOS outright prohibits) — one has to worry about how these things will be used in the future.

But not before we have our own fun with them… I wanted to hear how GPT-4 Vision would invent an Attenborough-style narration of our team’s standup. So I provided this single screenshot and the results, well, they literally speak for themselves 😂.

Screenshot of Cuttlesoft's engineering standup meeting, used for the demonstration of combining GPT-4 and ElevenLabs for a documentary style narration.

  • OpenAI
  • ElevenLabs
Frank Valcarcel's profile picture
Frank Valcarcel

Cofounder, Director of Operations

Related Posts

OpenAI's hexagonal purple logo against a gradient turquoise background symbolizes the intersection of artificial intelligence and software development. This minimalist design represents OpenAI's significant role in AI development tools and APIs that Cuttlesoft integrates into custom software solutions. The clean geometric pattern reflects the structured approach needed when implementing AI capabilities in enterprise applications, healthcare systems, and government software. As a technology-agnostic development company working with Python, React, and Ruby, Cuttlesoft closely follows OpenAI's developments to enhance our clients' applications with artificial intelligence and machine learning capabilities.
October 4, 2024 • Frank Valcarcel

OpenAI’s DevDay 2024: Four Big API Changes

OpenAI’s DevDay 2024 unveiled four game-changing API updates: a Realtime API for seamless speech-to-speech, Vision Fine-tuning for specialized visual models, Prompt Caching to boost efficiency and reduce costs, and Model Distillation for balancing performance and affordability.

Dynamic team of software developers from Cuttlesoft, highlighting their organizational maturity enabled by their re-branded image.
August 18, 2022 • Frank Valcarcel

The New Cuttlesoft

To reimagine Cuttlesoft’s brand, we partnered with the experts at Focus Lab. With their guidance, we identified the ways Cuttlesoft was failing to meet its full potential.