Changelog
Follow along to see weekly accuracy and product improvements.
May 27, 2022
Dutch and Portuguese Support Released
- Dutch and Portuguese transcription is now generally available for our
/v2/transcript
endpoint. See our documentation for more information on specifying a language in yourPOST
request.
May 20, 2022
Content Moderation and Topic Detection Available for French, German, and Spanish
- Content Moderation and Topic Detection features are now available for French, German, and Spanish languages.
- Improved redaction accuracy for
credit_card_number
,credit_card_expiration
, andcredit_card_cvv
policies in our PII Redaction feature.
- Fixed an edge case that would occasionally affect the capitalization of words in transcripts when
disfluencies
was set totrue
.
May 2, 2022
French, German, and Italian Support Released
- French, German, and Italian transcription is now publicly available. Check out our documentation for more information on Specifying a Language in your
POST
request.
- Released v2 of our Spanish model, improving absolute accuracy by ~4%.
- Automatic Language Detection now supports French, German, and Italian.
- Reduced the volume of the beep used to redact PII information in redacted audio files.
April 18, 2022
Miscellaneous Bug Fixes
- Fixed an edge case that would occasionally affect timestamps for a small number of words when
disfluencies
was set totrue
. - Fixed an edge case where PII audio redaction would occasionally fail when using local files.
April 12, 2022
New Policies Added for PII Redaction and Entity Detection
- Added two new policies for PII Redaction and Entity Detection:
drivers_license
andbanking_information
.
April 4, 2022
Spanish Language Support, Automatic Language Detection, and Custom Spelling Released
- Spanish transcription is now publicly available. Check out our documentation for more information on Specifying a Language in your
POST
request. - Automatic Language Detection is now available for our
/v2/transcript
endpoint. This feature can identify the dominant language that’s spoken in an audio file and route the file to the appropriate model for the detected language. - Our new Custom Spelling feature gives you the ability to specify how words are spelled or formatted in the transcript text. For example, Custom Spelling could be used to change all instances
"CS 50"
to"CS50"
.
March 28, 2022
Auto Chapters v6 Released
- Released Auto Chapters v6, improving the summarization of longer chapters.
March 14, 2022
Auto Chapters v5 Released
- Auto Chapters v5 released, improving
headline
andgist
generation and quote formatting in thesummary
key.
- Fixed an edge case in Dual-Channel files where initial words in an audio file would occasionally be missed in the transcription.
March 8, 2022
Regional Spelling Improvements
- Region-specific spelling improved for
en_uk
anden_au
language codes. - Improved the formatting of “MP3” in transcripts.
- Improved Real-Time transcription error handling for corrupted audio files.
February 28, 2022
Real-Time v3 Released
- Released v3 of our Real-Time Transcription model, improving overall accuracy by 18% and proper noun recognition by 23% relative to the v2 model.
- Improved PII Redaction and Entity Detection for
CREDIT_CARD_CVV
andLOCATION
.
February 22, 2022
Auto Chapters v4 Released, Auto Retry Feature Added
- Added an Auto Retry feature, which automatically retries transcripts that fail with a
Server error, developers have been alerted
message. This feature is enabled by default. To disable it, visit the Account tab in your Developer Dashboard.
- Auto Chapters v4 released, improving chapter summarization in the
summary
key. - Added a trailing period for the
gist
key in the Auto Chapters feature.
February 7, 2022
Auto Chapters v3 Released
- Released v3 of our Auto Chapters model, improving the model’s ability to segment audio into chapters and chapter boundary detection by 56.3%.
- Improved formatting for Auto Chapters summaries. The
summary
,headline
, andgist
keys now include better punctuation, casing, and text formatting.
January 31, 2022
Miscellaneous Bug Fixes
- Fixed a rare edge case affecting audio duration calculation of a small percentage of multi-channel files that contained no speech.
- Miscellaneous bug fixes for Real-Time Transcription.
January 24, 2022
Webhook Status Codes, Entity Detection Improved
POST
requests from the API to webhook URLs will now accept any status code from200
to299
as a successful HTTP response. Previously only200
status codes were accepted.- Updated the
text
key in our Entity Detection feature to return the proper noun rather than the possessive noun. For example,Andrew
instead ofAndrew’s
.
- Fixed an edge case with Entity Detection where under certain contexts, a disfluency could be identified as an entity.
January 17, 2022
Punctuation and Casing Accuracy Improved, Inverse Text Normalization Model Updated
- Released v4 of our Punctuation model, increasing punctuation and casing accuracy by ~2%.
- Updated our Inverse Text Normalization (ITN) model for our
/v2/transcript
endpoint, improving web address and email address formatting and fixing the occasional number formatting issue.
- Fixed an edge case where multi-channel files would return no text when the two channels were out of phase with each other.
January 10, 2022
Support for Non-English Languages Coming Soon
- Our Deep Learning team has been hard at work training our new non-English language models. In the coming weeks, we will be adding support for French, German, Italian, and Spanish.
January 3, 2022
Shorter Summaries Added to Auto Chapters, Improved Filler Word Detection
- Added a new
gist
key to the Auto Chapters feature. This new key provides an ultra-short, usually 3 to 8 word summary of the content spoken during that chapter.

- Implemented profanity filtering into Auto Chapters, which will prevent the API from generating a
summary
,headline
, orgist
that includes profanity. - Improved Filler Word (aka, disfluencies) detection by ~5%.
- Improved accuracy for Real-Time Streaming Transcription.
- Fixed an edge case where WebSocket connections for Real-Time Transcription sessions would occasionally not close properly after the session was terminated. This resulted in the client receiving a 4031 error code even after sending a session termination message.
- Corrected a bug that occasionally attributed disfluencies to the wrong utterance when Speaker Labels or Dual-Channel Transcription was enabled.
December 27, 2021
v8.5 Asynchronous Transcription Model Released
- Our Asynchronous Speech Recognition model is now even better with the release of v8.5.
- This update improves overall accuracy by 4% relative to our v8 model.
- This is achieved by improving the model’s ability to handle noisy or difficult-to-decipher audio.
- The v8.5 model also improves Inverse Text Normalization for numbers.
December 20, 2021
New and Improved API Documentation
- Launched the new AssemblyAI Docs, with more complete documentation and an easy-to-navigate interface so developers can effectively use and integrate with our API. Click here to view the new and improved documentation.
- Added two new fields to the
FinalTranscript
response for Real-time Transcriptions. Thepunctuated
key is a Boolean value indicating if punctuation was successful. Thetext_formatted
key is a Boolean value indicating if Inverse Text Normalization (ITN) was successful.
December 13, 2021
Inverse Text Normalization Added to Real-Time, Word Boost Accuracy Improved
- Inverse Text Normalization (ITN) added for our
/v2/realtime
and/v2/stream
endpoints. ITN improves formatting of entities like numbers, dates, and proper nouns in the transcription text.
- Improved accuracy for Custom Vocabulary (aka, Word Boosts) with the Real-Time transcription API.
- Fixed an edge case that would sometimes cause transcription errors when
disfluencies
was set totrue
and no words were identified in the audio file.