How I Built CyborgVision.io – An AI-Powered Image Storytelling App.

As a developer, I’m always exploring new tools and technologies in the world of software engineering. Whenever I get a creative itch, I build something. That’s how CyborgVision.io was born—an AI-powered app that transforms user-uploaded images into narrated short stories with different moods.

The premise of CyborgVision.io is simple but fun. Generate AI-powered short stories from images:
✅ Upload a photo or image
✅ Select a mood & AI voice
✅ Generate a narrated AI-powered description

The app analyzes your image’s content and produces unique, often hilarious descriptions based on the chosen mood. You can save your uploaded image stories, share them, and even explore public stories from other users.

CyborgVision.io offers multiple storytelling moods, including:

  • Satire (sarcastic & witty, often hilarious)
  • Analysis (straightforward & factual)
  • Surreal (exaggerated & humorous)
  • Narrative (story-driven)
  • Anecdote (personalized & engaging)
  • Metaphor (creative & symbolic)
  • Playful (lighthearted & fun)
  • Mockery (teasing & humorous)

Each description varies in length—from a single paragraph to multiple paragraphs, making them feel like AI-generated short stories.

I wanted to take CyborgVision.io beyond just text descriptions. This led me to explore ElevenLabs’ REST API, a powerful AI voice synthesis tool.

After researching and implementing ElevenLabs, I successfully added AI voice narration to the workflow:

👉 Upload an image
👉 Choose a mood and AI voice
👉 AI analyzes & generates a description
👉 AI voice narrates the description

The combination of funny satire descriptions + expressive AI voices makes for some hilarious results.

To make the experience even richer, I added video generation capabilities. Now, users can turn their narrated image stories into an MP4 video that includes:

✔ The original uploaded image
✔ AI-generated voice narration
✔ Optional copyright-free background music

This makes it easier to share AI-generated stories on social media.

After several weekends of development, CyborgVision.io is now live! 🎉

💡 Curious to try it out? Upload an image & let AI tell the story:
➡️ Visit CyborgVision.io

📹 Check out this YouTube short for a preview:
Watch on YouTube

Building CyborgVision.io required a combination of backend, frontend, AI, and cloud technologies:

🔹 Backend:

  • C# .NET (running in a Docker container)
  • PostgreSQL (for structured data)
  • AWS Lambda (for serverless processing)
  • AWS DynamoDB (for NoSQL storage)
  • AWS Lightsail (for hosting)

🔹 Frontend:

  • React.js (for a fast, modern UI)
  • Tailwind CSS (for styling)

🔹 Other Integrations:

  • AWS S3 (for image & audio storage)
  • OAuth (Google & Microsoft authentication)
  • ElevenLabs AI Voices (for speech generation)

Building CyborgVision.io has been a fun and rewarding experience. It allowed me to work with AI, cloud services, and interactive storytelling—all while creating something entertaining and shareable.

🚀 Generate AI-powered short stories from images. Try CyborgVision.io today! Click Here