20 Feb 2018
Speechmatics launches Global English, an accent-agnostic language pack for speech-to-text transcription
Speechmatics has launched its Global English. This is a single English language pack supporting all major English accents for use in speech-to-text transcription. Global English, GE was trained on thousands of hours of spoken data from over 40 countries and tens of billions of words drawn from global sources, making it one of the most comprehensive and accurate accent-agnostic transcription solutions on the market.
When tested against providers of similar solutions, GE consistently produced more accurate transcriptions. Compared directly, GE was between 3% and 55% more accurate than all Google’s Cloud Speech API accent-specific language packs and between 5% and 23% more accurate than IBM’s Cloud US English language pack*.
Traditionally, speech recognition has dealt with variations in language by producing a different language pack for every distinct accent or region. However, this meant a whole new set of models trained on data from that particular subset of speakers of the languages. With the launch of GE, Speechmatics is aiming to democratise speech-to-text transcription to overcome industry-wide issues where there are multiple English accents in one recording. Thus, providing a far more accurate, consistent and cost-effective solution.
Speech recognition has advanced hugely in recent years, making GE possible. The team has been gathering data from a wide range of sources and taking advantage of the astonishing rise in computer power, allowing them to train bigger models, based on more data, capable of supporting more variations. Speechmatics has now built 72 unique languages, more than any other provider on the market, including Amazon, Google, Nuance, Microsoft and IBM. With the modern neural network architectures capable of generalising across variations in speech by using representation learning, Speechmatics were able to generate the accuracy of multiple specialised models all in one language pack.
Benedikt von Thüngen, CEO at Speechmatics, explained: “At Speechmatics, we have historically produced North American, British and Australian versions of the English language packs, as well as domain-specific language packs. Applications include broadcast, compliance, speech analytics, call recording and meeting transcription among others. While a traditional British language pack does indeed perform better on British accented speech than say, a traditional North American language pack would, there’s still tens of distinct British accents to address. And so, we realised we need to come up with what we like to call ‘One Model to Rule Them All’ - an accent-agnostic language pack that is just as accurate at transcribing Australian accent as it is with Scottish.”
Most popular news in Cellular telecomsLarge-scale trial prepares for future 5G networks
National Instruments adopts AccelerComm's 5G NR polar IP
5G New Radio data call with 4G and 5G dual connectivity in China
Seeed announces two IoT development boards using u-blox technology
Ericsson Mobility Report released
Share this page
Want more like this? Register for our newsletter