Voice Data for AI Training – Project of Recording 3,000 Voice Samples Nationwide

OVERVIEW

Industry

Information Technology

Solution

Voice data collection

Background

The project requires a standardized amount of voice-recording data to be delivered within a restricted timeframe
Recordings must meet the project’s script and technical standards

CHALLENGES AND PROBLEM

ChallengeSpecific requirements
Large data volume, urgent timelineComplete 3,000 voice samples in 80 days.
Ensure recording quality according to technical standardsCorrect recording script; noise, reverberation, RMS reaching permissible thresholds.
Diverse voice distributionRegion, gender, age must match committed ratios.
Support both online and offline formsDistribute work flexibly according to actual conditions.

SOLUTION

Project of Recording 3,000 Voice Samples Nationwide

1. Preparation

  • Recruitment of recording personnel from regions according to requirements.
  • Skill training, instruction on using online/offline recording software.
  • Studio testing regarding soundproofing standards, equipment, and noise levels.

2. Recording Activities

  • Offline recording (14 days): Executed in 3 regions, each sample taking 24–32 minutes.
  • Online recording (60 days): Using the partner's compatible recording software.
  • Progress management, quality control of each sample throughout the process.

3. Inspection & Calibration

  • Each sample not meeting requirements will be requested to be re-recorded.
  • Client's audio engineers coordinate to inspect and evaluate quality.
  • Ensure all data meets standards before handover.

Only approved samples will be counted. Any rejected samples will be re-recorded by Bellsystem24 Vietnam until they meet the requirements.

Some preparations for the recording studio

General requirements for the recording studio: The studio must be well soundproofed, with noise and reverberation kept within the allowed technical limits, ensuring professional-grade audio quality The studio must also be prepared and ready for use at least 3 days prior to the recording date.”

Recording studio layout

Recording studio layout

Equipment for each recording room

Equipment for each recording room

RESULT

  • 100% of the project completed on schedule, no delays.
  • 3,000 voice samples collected fully and meeting technical standards.
  • Data distributed diversely by region, gender, and age according to partner requirements.
  • Each recorded sample meets standards, ready to serve as input for AI training.
  • Ensured high data quality, supporting effective AI system development.

Ai training data services →

OTHER CASE STUDIES

Case Studies / Information technology

Voicebot AI Labeling Project – When humans teach machines to understand the Vietnamese language

Case Studies / Entertainment

Automate customer service with Bellsystem24 Vietnam's multi-channel AI chat solution

Case Studies / Automotive industry

Mystery shopping

Managing service quality of car showroom chain with Mystery Shopping service

Scroll to Top

Let BSV help you gain deeper insights through a 1:1 consultation session