Kapwing

Joshua Grossberg, CTO at Kapwing, discusses Kapwing’s secret to building successful AI-first features, and how that included partnering with AssemblyAI.

1

What are the most important considerations when building with AI?

Our users are the most important consideration when building a new feature. There are some tools or some technology where we say, is it going to be too high-touch for our customers to use?

We try to separate things like, is it a gimmick, is it a stunt? Is it too complex for our user? But then somewhere in between those things is the sweet spot.

When choosing to integrate a new AI model or partner, the way we see it is that we have our core competencies as a company, but then we have to integrate with the outside world and bring other people's core competencies into ours.

2

How do you commit to a specific AI feature?

We look at trends, but we also have personas that we go by.

Sometimes we're chasing growth, where we say this feature attracts a high-growth persona. A person may have a lot of followers on TikTok or Instagram, so it's going to lead to high growth. But oftentimes that person in and of themselves is not a high-revenue persona.

So the type of people who are our high-revenue persona is like a small business person who's making an Instagram ad. Or maybe short-form YouTube tutorials and things like that.

And for that person, they tend to really want types of text-based embellishments. When we see something that is attractive to that person, like really strong word-by-word timings, or just really good transcription, then that makes sense for us.

If you have an hour of content, the difference between 99% accuracy and 97% accuracy, it's a lot of time for that person to review. So you could cut down their workflow from taking half an hour, taking 20 minutes, taking 15 minutes– it's huge, right?
Joshua Grossberg
CTO, Kapwing
We needed a provider that could grow with us," explains Mark. "Our platform's success depended on having unlimited concurrent streams, reasonable pricing, and responsive support—all while protecting our customers' privacy.
Name goes here
Title goes here
3

What's an example of one of Kapwing's AI features?

A big thing that people want, and that correlates very highly with our paying customers, is transcriptions and translations.

People watch videos on mute now a lot. Someone sends me a video and if I'm supposed to watch them on the train without subtitles, I'm not going to watch it. And if the subtitles are engaging, that makes it better.

That's been a major driver of our revenue and a major driver for some of our best customers.

So, one of the things that we started to do to make our transcription editing more powerful was give people precise word timings. And that allows them to do things like trimming with the transcript and things like when you're actually trimming the video, you're tethering the subtitles to the video as opposed to a specific point in time.

And this also allows us to do things like word-by-word animations.

We needed a provider that could grow with us," explains Mark. "Our platform's success depended on having unlimited concurrent streams, reasonable pricing, and responsive support—all while protecting our customers' privacy.
Name goes here
Title goes here
4

Why Kapwing switched to AssemblyAI

We switched over to AssemblyAI because our previous API didn't have accurate enough word timing or foreign language translations. And foreign languages are actually important for us because we get a lot of users from around the world.

AssemblyAI was very helpful about working with us to do experiments to compare both the Word Error Rate and the overall timing accuracy. That, combined with a better price point, were big reasons why we switched.
Joshua Grossberg
CTO, Kapwing
We needed a provider that could grow with us," explains Mark. "Our platform's success depended on having unlimited concurrent streams, reasonable pricing, and responsive support—all while protecting our customers' privacy.
Name goes here
Title goes here
5

What AI tech are you most excited about in the future?

The Generative AI stuff is cool. For us, we're seeing it happening on images, right? But then what's the next step to really augmenting video? I'm not sure it's ready for a lot of paid usage today, but in the future, it could be really compelling.

We needed a provider that could grow with us," explains Mark. "Our platform's success depended on having unlimited concurrent streams, reasonable pricing, and responsive support—all while protecting our customers' privacy.
Name goes here
Title goes here
6

What AI-powered feature is next for Kapwing?

We're exploring ways to leverage AI to speed up video creation, for example, automatically generate highlights or teasers, automatically edit raw footage, and generate voice-overs to simplify the filming process. The goal is to help more businesses and creators to tell stories through video fast and at scale.

We needed a provider that could grow with us," explains Mark. "Our platform's success depended on having unlimited concurrent streams, reasonable pricing, and responsive support—all while protecting our customers' privacy.
Name goes here
Title goes here

Start building with $50 in free credits

Start building with Universal-Streaming and create voice agents that feel natural, responsive, reliable, and genuinely helpful.

A partnership built on support and scale

Zoom

AssemblyAI's Speech AI models are helping Zoom advance their speech-to-text R&D by refining training data for Zoom's AI Companion, strengthening their AI feature performance.

AI Notetakers
Async Speech-to-Text
No items found.

WhatConverts

WhatConverts call tracking platform partners with AssemblyAI Speech-to-Text API to power State-of-the-Art transcription accuracy.

Call Tracking
Async Speech-to-Text
PII-Redaction

Ollang

Multi-agent AI platform transforms production workflows for streaming platforms and broadcasters

Content Creation
Async Speech-to-Text
Automatic Language Detection
Speaker Diarization

Supernormal

Supernormal, a platform for AI meeting notes and voice agents, partnered with AssemblyAI to improve transcription accuracy and multilingual support—and take control of the market.

AI Notetakers
Async Speech-to-Text
Summarization
Topic Detection

Veed

Learn how Veed co-founders Sabba Keynejad and Tim Mamedov built a competitive AI video editing platform with AssemblyAI.

Content Creation
Async Speech-to-Text
No items found.

Siro

Siro, an AI-powered coaching platform for field sales, integrated AssemblyAI to accurately transcribe and identify speakers in the field, leading to significant downstream benefits for its customers.

Revenue/Sales Intelligence and Sales Coaching
Async Speech-to-Text
Automatic Language Detection
Speaker Diarization

Grain

AI notetaker Grain partners with AssemblyAI to power its conversation intelligence offering, boosting its customers’ productivity and satisfaction.

AI Notetakers
Async Speech-to-Text
Automatic Language Detection

Dexa

Dexa is changing the podcasting landscape by making expert knowledge from podcasts instantly accessible and actionable for everyone.

Content Creation
Async Speech-to-Text
Auto Chapters
Speaker Diarization

Edgetier

Learn why EdgeTier chose to partner with AssemblyAI to power critical components of their conversation intelligence platform.

Conversation Intelligence
Async Speech-to-Text
Automatic Language Detection
Speaker Diarization

Delphi

Expertise can be transformative, and Delphi is pioneering a revolutionary approach to knowledge sharing. By harnessing advanced AI technology, the company is creating digital clones of thought leaders, entrepreneurs, and experts, making their insights accessible to a global audience 24/7.

Content Creation
Async Speech-to-Text
Audio Intelligence
Automatic Language Detection
Entity Detection
Speaker Diarization
Topic Detection

Aloware

Aloware’s bet on AI pays off: After integrating AssemblyAI’s leading Speech AI models, the company converts 50% of its client base to its popular AI-powered packages.

Call Centers and Contact Centers
No items found.
No items found.

CallRail

After integrating highly accurate Speech AI models, CallRail saw explosive growth, as well as significant downstream benefits for its customers.

Call Tracking
Async Speech-to-Text
LeMUR
Audio Intelligence
Summarization
Topic Detection
Sentiment Analysis
Key Phrases
LeMUR

Kapwing

Joshua Grossberg, CTO at Kapwing, discusses Kapwing’s secret to building successful AI-first features, and how that included partnering with AssemblyAI.

Content Creation
Async Speech-to-Text
Code Switching

Jiminny

How Jiminny builds with AI models to secure 15% higher win rates for customers, increasing customer satisfaction by at least 51%

Revenue/Sales Intelligence and Sales Coaching
Async Speech-to-Text
LeMUR
Audio Intelligence
Speaker Diarization

Dovetail

Learn how a leading customer intelligence platform integrated Assembly to boost WER by 36% and improve customer experience.

Conversation Intelligence
Async Speech-to-Text
Speaker Diarization

Earmark

How a product management AI startup launched successfully with real-time meeting transcription at scale.

AI Notetakers
Streaming Speech-to-Text
Audio Intelligence
LeMUR
Entity Detection
Automatic Language Detection
Key Phrases
EU Region

Turn voice data into unparalleled product experiences

Partner with the leader in Speech AI to build powerful products with breakthrough industry impact.