Blog

company news

New Feature Release: Batch API for Transcribing Pre-Recorded Audio

Collin Borns

Sep 27, 2022

3 min read

Today we are excited to announce the Speechly Batch API for Transcribing massive amounts of pre-recorded audio or video content.

  • Copy link

  • Mail

  • LinkedIn

  • Facebook

  • Twitter

At Speechly, we are known for the speed of our Automatic Speech Recognition and Natural Language Understanding. This is all thanks to our Streaming API.

However, as our user base has continued to grow so has the demand for new product features. One of the most requested features to date is the ability to use Speechly for transcribing large amounts of pre-recorded audio or video content.

Given that demand, we are excited to release the Speechly Batch API for Enterprise users.

Transcribe Large Amounts of Pre-Recorded Audio and Video with the Speechly Batch API

The Speechly Batch API enables users to easily and privately send large sets of pre-recorded audio or video files to Speechly for Speech Recognition. This makes it easy to complete tasks like Transcription, Moderation or other types of Speech Analysis on large amounts of off-line audio or video data.

To use the Speechly Batch API, users simply submit the audio and read the results after it is processed. The throughput of the Speechly Batch API supports processing thousands of hours of audio per hour. You can submit the audio directly to Speechly or give the Google Storage or Amazon S3 file URL, making it easy to scale.

With the Speechly Batch API you can expect the same performance and accuracy that you would expect from the Streaming API. We also offer Enterprise customers data annotation services for continued monitoring and improvement in the performance of your Speech Recognition.

When deploying the Speechly Batch API, customers have the ability to deploy On-Premise or in a Private Cloud. These deployment options help customers leverage Speech Recognition in the most secure and cost-effective manner possible.

Example Scenarios for the Speechly Batch API

The use cases for the Speechly Streaming API and the Speechly Batch API are the same, however the Streaming API is used for live online scenarios while the Batch API is for offline scenarios. This means you should use the Streaming API if you need to process speech in real-time as it’s being recorded. Use the Batch API if your audio or video has already been recorded and you need to process later on. Below are a few examples where the Speechly Batch API is ideal:

User Generated Content Monitoring - Massive amounts of podcasts and video are being recorded and uploaded everyday. Speechly makes it easy to transcribe any user generated audio or video content uploaded to your platform for use cases such as Moderation, Content Categorization or Content Indexing, Measuring Brand Awareness and Strategic Ad Placement.

Meeting Analysis - The Speechly Batch API makes it simple to convert meetings into transcriptions for later analysis. This can be used for use cases such as creating Meeting Summaries or making it easy to search back through past discussions for specific information.

Customer Support - Speechly can quickly and accurately transcribe recorded customer calls. This can be used for use cases such as Measuring Agent Performance and Agent Training or extracting & documenting relevant customer support information. This is different from monitoring a conversation in Real-Time, where the goal could be to offer the Agent relevant information as the call is happening. This would be another example of where the Speechly Streaming API would be better suited.

If you are interested in learning more about how to use the Speechly Batch API, you can read more in our Documentation. If you would like to learn more about getting access to the Speechly Batch API, Contact Us today.

Latest blog posts

case study

Combating Voice Chat Toxicity in VR Games: Speechly and Gym Class

Gym Class VR is a basketball game that was preparing to launch on Meta Quest after a very successful Beta. Voice chat is an important social element of the game, but the team noticed evidence of toxic behavior emerging. After trying speech recognition from cloud service providers, they quickly learned this was a cost-prohibitive approach and turned to Speechly.

Collin Borns

Mar 20, 2023

5 min read

voice tech

The Dirty Dozen - The Impact of 12 Types of Toxic Behavior in Online Game Voice Chat

Speechly surveyed over 1000 online gamers about toxic behavior in voice and text chat. The results show offensive names, trolling, bullying and annoying behavior top the list with the broadest impact. However, these behaviors are between 50%-200% more frequent in voice chat.

Collin Borns

Mar 09, 2023

3 min read

voice tech

Voice Chat is Popular with Gamers - It's also the Top Source of Toxic Behavior - New Report

Speechly commissioned a survey of a nationally representative sample of over 1000 gamers. The survey found that nearly 70% of gamers have used voice chat at least once. Of those, 72% said they've experienced a toxic incident. Read more today in the Full Report.

Otto Söderlund

Mar 08, 2023

3 min read