To achieve the best results with ReSing Modeler, it’s essential to record a clean, consistent, and expressive dataset, the collection of audio files used to train your AI voice/instrument model. The model will replicate every nuance, strength, and flaw present in your recordings, so quality and consistency are key.
- Recording Content
For an high quality model record around 60 minutes of expressive singing in a single vocal style and language. Include:
- Sustained vowels, pitch runs, and common phrases that cover the full vocal range (low, mid, high).
- Natural inflections, vibrato, and phrasing that represent your intended style.
Keep all recordings monophonic (no harmonies or background vocals) and capture them in a single session using the same microphone and setup to maintain tonal consistency.
- Technical Specifications
- Format: 24-bit / 48 kHz, mono, uncompressed (.wav, .aiff, or .flac)
- Microphone: Professional large-diaphragm condenser
- Interface: Clean, transparent preamp or audio interface without coloration
- Recording Environment
Use an acoustically treated or dampened space to avoid reflections and background noise.
Keep the microphone away from walls or reflective surfaces, and monitor the input through headphones while recording.
The singer should wear closed-back headphones to prevent sound leakage into the mic.