CHAPTER 1 AUDIO DEMONSTRATIONS: INTRODUCTION These audio demonstrations illustrate some of the application areas that are introduced in Chapter 1. Additional examples can be found in the directories of other chapters. ------------------------------------------------------------------- Audio Demo 1.1: Time-scale modification of speech and quasi-periodic audio - Sinewave-based modification with voicing-dependent rate factor (Section 9.5.2) Male Speaker ------------ tfq.tea.org.10k Original tfq.tea.tsmtv0p8.10k Fast tfq.tea.tsmtv0p5.10k Faster tfq.tea.tsmtv1p2.10k Slow tfq.tea.tsmtv1p5.10k Slower Female Speaker -------------- ln.swm.org.10k Original ln.swm.tsmtv0p8.10k Fast ln.swm.tsmtv0p5.10k Faster ln.swm.tsmtv1p2.10k Slow ln.swm.tsmtv1p5.10k Slower Trumpet ------- trumpet.org.10k Original trumpet.tsm0p75.10k Fast trumpet.tsm1p25.10k Slow ------------------------------------------------------------------- Audio Demo 1.2: Time-scale modification of complex non-speech signals - Phase vocoder-based modifcation with event-dependent phase coherence (Section 8.4.1) Falling Can ----------- falling_can1.org.10k Original falling_can1.tsm2p0.10k Slow Bongo Drums ----------- bongo_drums.org.10k Original bongo_drums.tsm2p0.10k Slow Loon ---- loon.org.10k Original loon.tsm2p0.10k Slow ------------------------------------------------------------------- Audio Demo 1.3: Pitch and vocal tract length change - Sinewave-based modification (Section 9.5.2/Exercise 9.11) Male Speaker ------------ cp.seg.org.8k Original cp.seg.PitSpec_low.8k Low pitch/Long vocal tract cp.seg.PitSpec_high.8k High pitch/Short vocal tract Female Speaker -------------- glo.org.8k Original glo.PitSpec_low.8k Low pitch/Long vocal tract glo.PitSpec_high.8k High pitch/short vocal tract Male Speaker ------------ mono_pitch.16k File contains (three utterance pairs): a: Original b: Monotone pitch ------------------------------------------------------------------- Audio Demo 1.4: Speech coding - Examples of sinewave-based (Section 12.5.2) and CELP-based (Section 12.7.3) coding Male Speaker ------------ tfq.tea.org.8k Original tfq.tea.8000.8k CELP-based (g729) coding at 8000 bps tfq.tea.4800.8k Sinewave-based coding at 4800 bps tfq.tea.2400.8k Sinewave-based coding at 2400 bps Female Speaker ------------ mlm.tea.org.8k Original mlm.tea.8000.8k CELP-based (g729) coding at 8000 bps mlm.tea.4800.8k Sinewave-based coding at 4800 bps mlm.tea.2400.8k Sinewave-based coding at 2400 bps ------------------------------------------------------------------- Audio Demo 1.5: Post-processing enhancement - Noise reduction adaptive Wiener filter with adaptivity based on spectral change (Section 13.3.3) Cellular Telephone Noise ------------------------ s4141t03.org.10k Original s4141t03.enh.10k Enhanced Cocktail Party Noise -------------------- party.org_modsnr.10k Original party.enh_modsnr.10k Enhanced Automobile Noise ---------------- auto.org_lowsnr.10k Original auto.enh_lowsnr.10k Enhanced ------------------------------------------------------------------- Audio Demo 1.6: Pre-processing enhancement - Reduction of peak-to-rms based on sinewave analysis/synthesis (Section 9.5.2) - Re-digitized from analog tape so original peak-to-rms is somewhat altered - Each file contains a: Original b: Mild reduction in peak-to-rms (~1.5 dB) c: Large reduction in peak-to-rms (~3.0 dB) post_NoNoise.16k Low-noise case post_WithNoise.16k High-noise case -------------------------------------------------------------------