Changelog

Follow along to see weekly accuracy and product improvements.

Subscribe to updates Follow us on Twitter

July 11, 2022

Hindi Transcription / Custom Webhook Headers

We’ve released our new Hindi transcription model to help you transcribe and analyze your Hindi audio and video files.

Now you can automatically convert any Hindi audio or video file to text by including "language_code": "hi" in your POST request to our /v2/transcript endpoint.

We’ve also added Hindi support for our AI models including Custom Vocabulary (Word Boost), Custom Spelling, Automatic Punctuation / Casing, Profanity Filtering, and more. See the full list of supported models available for Hindi transcriptions here.

To get started with Hindi transcription, visit our walkthrough on Specifying a Language on our AssemblyAI documentation page.

Our Webhook service now supports the use of Custom Headers for authentication.

A Custom Header can be used for added security to authenticate webhook requests from AssemblyAI. This feature allows a developer to optionally provide a value to be used as an authorization header on the returning webhook from AssemblyAI, giving the ability to validate incoming webhook requests.

To use a Custom Header, you will include two additional parameters in your POST request to /v2/transcript: webhook_auth_header_name and webhook_auth_header_value. The webhook_auth_header_name parameter accepts a string containing the header's name which will be inserted into the webhook request. The webhook_auth_header_value parameter accepts a string with the value of the header that will be inserted into the webhook request. See our Using Webhooks documentation to learn more and view our code examples.

July 1, 2022

Improved Speaker Labels Accuracy and Speaker Segmentation

Improved the overall accuracy of the Speaker Labels feature and the model’s ability to segment speakers.

Fix a small edge case that would occasionally cause some transcripts to complete with NULL as the language_code value.

June 24, 2022

Content Moderation and Topic Detection Available for Portuguese

Content Moderation and Topic Detection now available for the Portuguese language.

Improved Inverse Text Normalization of money amounts in transcript text.

Addressed an issue with Real-Time Transcription that would occasionally cause variance in timestamps over the course of a session.
Fixed an edge case with transcripts including Filler Words that would occasionally cause server errors.

June 10, 2022

Automatic Language Detection Available for Dutch and Portuguese

Automatic Language Detection now supports detecting Dutch and Portuguese.

Accuracy of the Automatic Language Detection model improved on files with large amounts of silence.
Improved speaker segmentation accuracy for Speaker Labels.

May 27, 2022

Dutch and Portuguese Support Released

Dutch and Portuguese transcription is now generally available for our /v2/transcript endpoint. See our documentation for more information on specifying a language in your POST request.

May 20, 2022

Content Moderation and Topic Detection Available for French, German, and Spanish

Content Moderation and Topic Detection features are now available for French, German, and Spanish languages.

Improved redaction accuracy for credit_card_number, credit_card_expiration, and credit_card_cvv policies in our PII Redaction feature.

Fixed an edge case that would occasionally affect the capitalization of words in transcripts when disfluencies was set to true.

May 2, 2022

French, German, and Italian Support Released

French, German, and Italian transcription is now publicly available. Check out our documentation for more information on Specifying a Language in your POST request.

Released v2 of our Spanish model, improving absolute accuracy by ~4%.
Automatic Language Detection now supports French, German, and Italian.
Reduced the volume of the beep used to redact PII information in redacted audio files.

April 18, 2022

Miscellaneous Bug Fixes

Fixed an edge case that would occasionally affect timestamps for a small number of words when disfluencies was set to true.
Fixed an edge case where PII audio redaction would occasionally fail when using local files.

April 12, 2022

New Policies Added for PII Redaction and Entity Detection

Added two new policies for PII Redaction and Entity Detection: drivers_license and banking_information.

April 4, 2022

Spanish Language Support, Automatic Language Detection, and Custom Spelling Released

Spanish transcription is now publicly available. Check out our documentation for more information on Specifying a Language in your POST request.
Automatic Language Detection is now available for our /v2/transcript endpoint. This feature can identify the dominant language that’s spoken in an audio file and route the file to the appropriate model for the detected language.
Our new Custom Spelling feature gives you the ability to specify how words are spelled or formatted in the transcript text. For example, Custom Spelling could be used to change all instances "CS 50" to "CS50".

March 28, 2022

Auto Chapters v6 Released

Released Auto Chapters v6, improving the summarization of longer chapters.

March 14, 2022

Auto Chapters v5 Released

Auto Chapters v5 released, improving headline and gist generation and quote formatting in the summary key.

Fixed an edge case in Dual-Channel files where initial words in an audio file would occasionally be missed in the transcription.

March 8, 2022

Regional Spelling Improvements

Region-specific spelling improved for en_uk and en_au language codes.
Improved the formatting of “MP3” in transcripts.
Improved Real-Time transcription error handling for corrupted audio files.

February 28, 2022

Real-Time v3 Released

Released v3 of our Real-Time Transcription model, improving overall accuracy by 18% and proper noun recognition by 23% relative to the v2 model.

Improved PII Redaction and Entity Detection for CREDIT_CARD_CVV and LOCATION.

February 22, 2022

Auto Chapters v4 Released, Auto Retry Feature Added

Added an Auto Retry feature, which automatically retries transcripts that fail with a Server error, developers have been alerted message. This feature is enabled by default. To disable it, visit the Account tab in your Developer Dashboard.

Auto Chapters v4 released, improving chapter summarization in the summary key.
Added a trailing period for the gist key in the Auto Chapters feature.

February 7, 2022

Auto Chapters v3 Released

Released v3 of our Auto Chapters model, improving the model’s ability to segment audio into chapters and chapter boundary detection by 56.3%.
Improved formatting for Auto Chapters summaries. The summary, headline, and gist keys now include better punctuation, casing, and text formatting.

January 31, 2022

Miscellaneous Bug Fixes

Fixed a rare edge case affecting audio duration calculation of a small percentage of multi-channel files that contained no speech.
Miscellaneous bug fixes for Real-Time Transcription.

January 24, 2022

Webhook Status Codes, Entity Detection Improved

POST requests from the API to webhook URLs will now accept any status code from 200 to 299 as a successful HTTP response. Previously only 200 status codes were accepted.
Updated the text key in our Entity Detection feature to return the proper noun rather than the possessive noun. For example, Andrew instead of Andrew’s.

Fixed an edge case with Entity Detection where under certain contexts, a disfluency could be identified as an entity.

January 17, 2022

Punctuation and Casing Accuracy Improved, Inverse Text Normalization Model Updated

Released v4 of our Punctuation model, increasing punctuation and casing accuracy by ~2%.
Updated our Inverse Text Normalization (ITN) model for our /v2/transcript endpoint, improving web address and email address formatting and fixing the occasional number formatting issue.

Fixed an edge case where multi-channel files would return no text when the two channels were out of phase with each other.

January 10, 2022

Support for Non-English Languages Coming Soon

Our Deep Learning team has been hard at work training our new non-English language models. In the coming weeks, we will be adding support for French, German, Italian, and Spanish.