AI Audiobook Generation

AI Audiobook Generation

KukuFM Case Study

70%

70%

Reduction in Cost

Reduction in Cost

98%

98%

Production Time Decrease

Production Time Decrease

Project Overview

Industry

Media

AI Solution

Generative Text-to-Speech

Quick Brief

KuKuFM wanted to reduce the cost of using human voice artists and increase the speed of bringing new audio content to market.

Customer Pain Points

Relaint on expensive voice artists

Slow to bring new content to market

Unable to scale content delivery

What We Delivered

We built algorithms for parsing transcripts and fed the parsed content into our state-of-the-art TTS modules that generated multi-voice audio content.

End Results

10x faster time to market

Content production more cost-effective

Unlocked infinite volume of content

Introduction

KuKu FM is India’s largest audio streaming platform with over 2 million subscribers. They offer a wide variety of content, including audiobooks, podcasts, and educational materials, across Indian and other languages.

KuKu FM were looking for ways to reduce both the high cost of hiring voice actors and the speed with which new audiobooks were able to get to market. 

They needed a Text-to-Speech (TTS) engine that was able to ingest text data and produce high-quality audiobooks at scale that require no manual post-production.

The Solution

We built algorithms for parsing transcripts and fed the parsed content into our state-of-the-art TTS modules that generated multi-voice audio content. The KuKu FM team were able to choose from a range of generated voices and accents for each show which could then be assigned to characters.

Episodes are ready for final checks within minutes of their submission onto the platform and can be uploaded for commercial use same-day.

The result?

We can process up to 750 hours of audio in <24 hours so that new content can be distributed quickly and efficiently to KuKu’s 2 million paid subscribers. 

Project Overview

Industry

Media

AI Solution

Generative Text-to-Speech

Quick Brief

KuKuFM wanted to reduce the cost of using human voice artists and increase the speed of bringing new audio content to market.

Customer Pain Points

Relaint on expensive voice artists

Slow to bring new content to market

Unable to scale content delivery

What We Delivered

We built algorithms for parsing transcripts and fed the parsed content into our state-of-the-art TTS modules that generated multi-voice audio content.

End Results

10x faster time to market

Content production more cost-effective

Unlocked infinite volume of content

Introduction

KuKu FM is India’s largest audio streaming platform with over 2 million subscribers. They offer a wide variety of content, including audiobooks, podcasts, and educational materials, across Indian and other languages.

KuKu FM were looking for ways to reduce both the high cost of hiring voice actors and the speed with which new audiobooks were able to get to market. 

They needed a Text-to-Speech (TTS) engine that was able to ingest text data and produce high-quality audiobooks at scale that require no manual post-production.

The Solution

We built algorithms for parsing transcripts and fed the parsed content into our state-of-the-art TTS modules that generated multi-voice audio content. The KuKu FM team were able to choose from a range of generated voices and accents for each show which could then be assigned to characters.

Episodes are ready for final checks within minutes of their submission onto the platform and can be uploaded for commercial use same-day.

The result?

We can process up to 750 hours of audio in <24 hours so that new content can be distributed quickly and efficiently to KuKu’s 2 million paid subscribers. 

© 2025 Perlon AI Ltd (DBA as Perlon Labs). All rights reserved.

© 2025 Perlon AI Ltd (DBA as Perlon Labs). All rights reserved.