How I Built CyborgVision.io – An AI-Powered Image Storytelling App.

As a developer, I’m always exploring new tools and technologies in the world of software engineering. Whenever I get a creative itch, I build something. That’s how CyborgVision.io was born—an AI-powered app that transforms user-uploaded images into narrated short stories with different moods.

The Idea Behind CyborgVision.io

The premise of CyborgVision.io is simple but fun. Generate AI-powered short stories from images:
Upload a photo or image
Select a mood & AI voice
Generate a narrated AI-powered description

The app analyzes your image’s content and produces unique, often hilarious descriptions based on the chosen mood. You can save your uploaded image stories, share them, and even explore public stories from other users.

Moods Available in CyborgVision.io

CyborgVision.io offers multiple storytelling moods, including:

Satire (sarcastic & witty, often hilarious)
Analysis (straightforward & factual)
Surreal (exaggerated & humorous)
Narrative (story-driven)
Anecdote (personalized & engaging)
Metaphor (creative & symbolic)
Playful (lighthearted & fun)
Mockery (teasing & humorous)

Each description varies in length—from a single paragraph to multiple paragraphs, making them feel like AI-generated short stories.

Enhancing the Experience with AI Voice Narration.

I wanted to take CyborgVision.io beyond just text descriptions. This led me to explore ElevenLabs’ REST API, a powerful AI voice synthesis tool.

Integrating AI Narration – AI image analysis with voice narration: How It Works

After researching and implementing ElevenLabs, I successfully added AI voice narration to the workflow:

Upload an image
Choose a mood and AI voice
AI analyzes & generates a description
AI voice narrates the description

The combination of funny satire descriptions + expressive AI voices makes for some hilarious results.

Expanding Features: AI Video Generation

To make the experience even richer, I added video generation capabilities. Now, users can turn their narrated image stories into an MP4 video that includes:

The original uploaded image
AI-generated voice narration
Optional copyright-free background music

This makes it easier to share AI-generated stories on social media.

Try Cyborgvision.io

After several weekends of development, CyborgVision.io is now live!

Curious to try it out? Upload an image & let AI tell the story:
Visit CyborgVision.io

Check out this YouTube short for a preview:
Watch on YouTube

Tech Stack Used

Building CyborgVision.io required a combination of backend, frontend, AI, and cloud technologies:

Backend:

C# .NET (running in a Docker container)
PostgreSQL (for structured data)
AWS Lambda (for serverless processing)
AWS DynamoDB (for NoSQL storage)
AWS Lightsail (for hosting)

Frontend:

React.js (for a fast, modern UI)
Tailwind CSS (for styling)

Other Integrations:

AWS S3 (for image & audio storage)
OAuth (Google & Microsoft authentication)
ElevenLabs AI Voices (for speech generation)

Final Thoughts

Building CyborgVision.io has been a fun and rewarding experience. It allowed me to work with AI, cloud services, and interactive storytelling—all while creating something entertaining and shareable.

Ready to see AI describe your images?

Generate AI-powered short stories from images. Try CyborgVision.io today! Click Here