Age manipulation
Age pseudo-label response from the external speech age predictor.
| Cohort | Source / reconstruction | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
|---|---|---|---|---|---|---|
| 20s M | Source Reconstruction GLOBE · GLOBE::train::S_020558::00123229_000000.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 20s F | Source Reconstruction LibriTTS · LibriTTS::7177::7177_258965_000015_000001 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s M | Source Reconstruction GLOBE · GLOBE::train::S_002060::00057675_000001.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s F | Source Reconstruction GLOBE · GLOBE::train::S_000020::00454691_000017.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s M | Source Reconstruction GLOBE · GLOBE::train::S_002429::00306976_000012.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s F | Source Reconstruction LibriTTS · LibriTTS::8778::8778_246974_000024_000009 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
Perceived gender presentation manipulation
Model-predicted male-presentation probability. This is a pseudo-label, not demographic ground truth.
| Cohort | Source / reconstruction | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
|---|---|---|---|---|---|---|
| 20s M | Source Reconstruction GLOBE · GLOBE::train::S_011895::00338957_000002.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 20s F | Source Reconstruction VoxCeleb1 · VoxCeleb1::train::id10258::23dSOm3axoU::00003 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s M | Source Reconstruction AESRC · AESRC::Indian_G1757::G1757S2334 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s F | Source Reconstruction VoxCeleb1 · VoxCeleb1::train::id10968::t3z1N9QWI_8::00005 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s M | Source Reconstruction LibriTTS · LibriTTS::4179::4179_25937_000039_000002 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s F | Source Reconstruction GLOBE · GLOBE::train::S_013904::00115359_000001.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
Pitch manipulation
Habitual pitch response measured with median log-F0.
| Cohort | Source / reconstruction | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
|---|---|---|---|---|---|---|
| 20s M | Source Reconstruction GLOBE · GLOBE::train::S_020558::00123229_000000.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 20s F | Source Reconstruction LibriTTS · LibriTTS::3889::3889_9915_000007_000001 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s M | Source Reconstruction NaturalVoices · NaturalVoices::MSP-PODCAST_3298::MSP-PODCAST_3298_211 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s F | Source Reconstruction NaturalVoices · NaturalVoices::MSP-PODCAST_0478::MSP-PODCAST_0478_1 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s M | Source Reconstruction LibriTTS · LibriTTS::2660::2660_173260_000014_000001 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s F | Source Reconstruction VoxCeleb1 · VoxCeleb1::train::id10693::EW4Cxe52kL4::00006 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
HNR / voice-quality manipulation
Voice-quality response measured with the corrected median HNR estimator.
| Cohort | Source / reconstruction | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
|---|---|---|---|---|---|---|
| 20s M | Source Reconstruction GLOBE · GLOBE::train::S_007523::00435966_000002.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 20s F | Source Reconstruction GLOBE · GLOBE::train::S_004523::00062671_000000.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s M | Source Reconstruction GLOBE · GLOBE::train::S_002060::00057675_000001.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 40s F | Source Reconstruction LibriTTS · LibriTTS::6782::6782_61316_000007_000010 | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s M | Source Reconstruction GLOBE · GLOBE::val::S_010055::00002970_000003.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |
| 60s F | Source Reconstruction GLOBE · GLOBE::train::S_013904::00115359_000001.v2.vad | -2.0 | -1.0 | 0.0 | +1.0 | +2.0 |