Common Voice dataset

Each entry in the dataset consists of a unique MP3 and corresponding text file. Many of the 1,368 recorded hours in the dataset also include demographic metadata like age, sex, and accent that can help train the accuracy of speech recognition engines. The dataset currently consists of 1,087 validated hours in 18 languages, but we're always adding more voices and languages.

Languages supported:

10.0/10 (Expert Score) ★★★★★
Product is rated as #1 in category Machine Learning Data Catalog Software
Ease of use
Support
Ease of Setup

Each entry in the dataset consists of a unique MP3 and corresponding text file. Many of the 1,368 recorded hours in the dataset also include demographic metadata like age, sex, and accent that can help train the accuracy of speech recognition engines. The dataset currently consists of 1,087 validated hours in 18 languages, but we’re always adding more voices and languages.

Common Voice dataset
Common Voice dataset

Show more categories

Customer Reviews

Common Voice dataset Reviews

User in Information Technology and Services

Advanced user of Common Voice dataset
★★★★★
Great voice resignation software with data qualities

What do you like best?

I like most about this tool is this is a project to help to make voice recognition open to everyone.common voice data set is unique in its size and diversity.Now you can donate your voice to build an open source voice data set. common voice data set releases largest to-date public domain voice data set including 18 languages.almost 1400 hrs data.

What do you dislike?

There are no dislikes.thank you for the this great and tool. Need to improve security and privacy.Need tp provide better data and more data because competition and openness are healthy for competition. data quality an data quantity need to improve with the help of community efforts and partnership.Many voice resignation devices struggle to understand female voices. this is why is database there is need variety

Recommendations to others considering the product:

Mozilla common voice data set is definitely beneficial for larger as well as small scale organisations.

It have strong community thanks for the effort to the community at voice. Mozilla. Mozilla updated common voice data set contains more than 1400 hrs of speech from 420000 contributors across all over world.

What problems are you solving with the product? What benefits have you realized?

The common voice data set seeks to be a part of the project, its a platform where anyone can donate your voice to an open source data bank.This is community that brings the voices of the world to creators.voice can be as unique as fingerprint,That is the wonder and greatest challenge to virtual assistance.

Review source: G2.com

Leave a reply

Your total score

B2B Software Guide