Audio Processing

Mitarbeiter: Jörn Ostermann, Reemt Hinrichs, Alexander Lange
Einleitung

Das Institut für Informationsverarbeitung (TNT) besitzt langjährige Erfahrung im Bereich der Audiosignalverarbeitung. Es werden unter anderem Verfahren für die Datenreduktion so wie Methoden zur Klassifikation von Audiosignalen entwickelt. Des Weiteren werden Algorithmen für die Lokalisation von Signalereignissen und Verfahren zur Schätzung von Impulsantworten in Festkörpern entworfen. In diesem Exposé finden Sie eine Auswahl unserer aktuellen Forschungstätigkeit.

Aktuelle Forschungsthemen

Audiokodierung mit niedrigster Latenz:

Bei Liveanwendungen wie Konzerten oder Übertragungen über das Internet ist die von einem Kodierungsverfahren hinzugefügte Verzögerung ein Problem. Die am TNT entwickelte Technik ermöglicht nahezu verzögerungsfreie Datenratenreduktion unter Erhaltung hoher Audioqualität.

Schadensfrüherkennung an Rotorblättern von Windenergieanlagen:

Das am TNT entworfene Verfahren erkennt Rotorblattschäden mit deutlich weniger Sensoren als vergleichbare Methoden. Dazu werden Verfahren der Audioklassifikation eingesetzt welcher in Luftschallsignalen Schäden auch bei Nebengeräuschen erkennt.

Kodierung für Cochlea-Implantate:

Für Träger der Hörhilfen der Cochlea-Implantate ist die Sprachverständlichkeit bei Umgebungsgeräuschen schwierig. Die am TNT entwickelten Kompressionsverfahren für Erregungsmuster ermöglichen den Einsatz binauraler Verarbeitungsstrategien. Damit ist es möglich die Sprachverständlichkeit zu erhöhen.

Verwendete Methoden

Irrelevanz reduzierende Kodierung, Klassifikationsverfahren, Künstliche Neuronale Netze, Verlustlose Kodierungsverfahren, Audio Feature Entwurf, Adaptive Vektorquantisierung, Context-Adaptive Binary Arithmetic Coding,Time Difference of Arrival Localization, Head-related transfer function

  • Conference Contributions
    • Reemt Hinrichs, Jörn Ostermann
      Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay
      Asilomar Conference on Signals, Systems, and Computers, October 2024
    • Ronghua Xu, Alexander Lange, Max Käding, Steffen Marx, Jörn Ostermann
      Energy spectral analysis of wire breaks in post-tensioned tendons for wind turbines
      19th EAWE PhD Seminar on Wind Energy, pp. 204-207, September 2023
    • Alexander Lange, Max Käding, Ronghua Xu, Steffen Marx, Jörn Ostermann
      Semi-supervised learning for acoustic emission monitoring of tendons in prestressed concrete bridges
      14th International Workshop on Structural Health Monitoring (IWSHM), Stanford, September 2023
    • Christopher Gebauer, Lars Rumberg, Jörn Ostermann
      Pronunciation Modeling for Children’s Speech
      Elektronische Sprachsignalverarbeitung (ESSV), TUDpress, Dresden, pp. 79--86, March 2023
    • Hanna Ehlert, Edith Beaulac, Maren Wallbaum, Christopher Gebauer, Lars Rumberg, Jörn Ostermann, Ulrike Lüdtke
      Collecting and Annotating Natural Child Speech Data – Challenges and Interdisciplinary Perspectives
      Elektronische Sprachsignalverarbeitung (ESSV), TUDpress, Dresden, pp. 72--78, March 2023
    • Lars Rumberg, Christopher Gebauer, Hanna Ehlert, Maren Wallbaum, Ulrike Lüdtke, Jörn Ostermann
      Uncertainty Estimation for Connectionist Temporal Classification Based Automatic Speech Recognition
      Proc. INTERSPEECH 2023, pp. 4583--4587, August 2023
    • Alexander Lange, Max Käding, Reemt Hinrichs, Jörn Ostermann, Steffen Marx
      Wire Break Detection in Bridge Tendons Using Low-Frequency Acoustic Emissions
      European Workshop on Structural Health Monitoring. EWSHM 2022., Springer, June 2022
    • Alexander Lange, Reemt Hinrichs, Jörn Ostermann
      Localized Damage Detection in Wind Turbine Rotor Blades using Airborne Acoustic Emissions
      9th Asia-Pacific Workshops on Structural Health Monitoring 2022 (APWSHM 2022), December 2022
    • Lars Rumberg, Christopher Gebauer, Hanna Ehlert, Maren Wallbaum, Lena Bornholt, Jörn Ostermann, Ulrike Lüdtke
      kidsTALC: A Corpus of 3- to 11-year-old German Children’s Connected Natural Speech
      Proceedings INTERSPEECH 2022 – 23rd Annual Conference of the International Speech Communication Association, ISCA, September 2022
    • Lars Rumberg, Christopher Gebauer, Hanna Ehlert, Ulrike Lüdtke, Jörn Ostermann
      Improving Phonetic Transcriptions of Children’s Speech by Pronunciation Modelling with Constrained CTC-Decoding
      Proceedings INTERSPEECH 2022 – 23rd Annual Conference of the International Speech Communication Association, ISCA, September 2022
    • Lars Rumberg, Hanna Ehlert, Ulrike Lüdtke, Jörn Ostermann
      Age-Invariant Training for End-to-End Child Speech Recognition using Adversarial Multi-Task Learning
      Proceedings INTERSPEECH 2021 -- 22th Annual Conference of the International Speech Communication Association, August 2021
    • Sönke Südbeck, Thomas Krause, Jörn Ostermann
      Non-Line-of-Sight Time-Difference-of-Arrival Localization with Explicit Inclusion of Geometry Information in a Simple Diffraction Scenario
      IEEE MMSP 2020 - IEEE International Workshop on Multimedia Signal Processing, September 2020
    • Reemt Hinrichs, Tom Gajecki, Jörn Ostermann, Waldo Nogueira
      Coding of Electrical Stimulation Patterns for Binaural Sound Coding Strategies for Cochlear Implants
      41st International Engineering in Medicine and Biology Conference, July 2019
    • Thomas Krause, Jörn Ostermann
      Acoustic Emission Localization Using Airborne Sound: Where Did the Wind Turbine Rotor Blade Crack?
      9th European Workshop on Structural Health Monitoring (EWSHM), July 2018
    • Stavroula Tsiapoki, Thomas Krause, Moritz W. Häckell, Raimund Rolfes, Jörn Ostermann
      Combining a Vibration-Based SHM-Scheme and an Airborne Sound Approach for Damage Detection on Wind Turbine Rotor Blades
      8th European Workshop on Structural Health Monitoring, July 2016
    • Stephan Preihs, Jörn Ostermann
      Globally Optimized Dynamic Bit-Allocation Strategy for Subband ADPCM-Based Low Delay Audio Coding
      40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, April 2015
    • Stephan Preihs, Christoph Wacker, Jörn Ostermann
      Adaptive Pre- and Post-Filtering for a Subband ADPCM-based Low Delay Audio Codec
      2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, October 2015
    • Stephan Preihs, Jörn Ostermann
      Using Cascaded Global Optimization for Filter Bank Design in Low Delay Audio Coding
      139th AES Convention [peer-reviewed], New York, October 2015
    • Thomas Krause, Stephan Preihs, Jörn Ostermann
      Acoustic Emission Damage Detection for Wind Turbine Rotor Blades Using Airborne Sound
      10th International Workshop on Structural Health Monitoring (IWSHM) , September 2015
    • Thomas Krause, Stephan Preihs, Jörn Ostermann
      Detection of Impulse-Like Airborne Sound for Damage Identification in Rotor Blades of Wind Turbines
      7th European Workshop on Structural Health Monitoring (EWSHM), July 2014
    • Thomas Krause, Stephan Preihs, Jörn Ostermann
      Airborne Sound Based Damage Detection for Wind Turbine Rotor Blades Using Impulse Detection in Frequency Bands
      1st International Wind Engineering Conference (IWEC), September 2014
    • Stephan Preihs, Fabian-Robert Stöter, Jörn Ostermann
      Low Delay Error Concealment for Audio Signals
      46th AES Conference on Audio Forensics, Denver, June 2012
    • Stephan Preihs, Jörn Ostermann
      Error Robust Low Delay Audio Coding based on Subband ADPCM
      131st AES Convention, New York, October 2011
    • Sascha Disch, Bernd Edler
      Multiband perceptual modulation analysis, processing and synthesis of audio signals
      ICASSP International Conference on Acoustics, Speech and Signal Processing , IEEE CNF, Taipei, Taiwan, April 2009
    • Sascha Disch, Bernd Edler
      An iterative segmentation algorithm for audio signal spectra depending on estimated local centers of gravity
      12th International Conference on Digital Audio Effects (DAFx-09), September 2009
    • Tom Bäckström, Sascha Disch
      Parametric AM/FM Decomposition for Speech and Audio Coding
      Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'09), October 2009
    • Sascha Disch, Bernd Edler
      An Amplitude- and Frequency Modulation Vocoder for Audio Signal Processing
      11th International Conference on Digital Audio Effects (DAFx-08), September 2008
    • Waldo Nogueira, Tamás Harczos, Bernd Edler, Joern Ostermann, Andreas Büchner
      Automatic Speech Recognition with a Cochlear Implant Front-End
      Interspeech, August 2007
    • W Nogueira, A Kátai, T Harczos, F Klefenz, A Büchner, B Edler
      An Auditory Model based Strategy for Cochlear Implants
      Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBC , Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBC, Lyon, August 2007
    • Waldo Nogueira, Martina Brendel, Bernd Edler, Andreas Buechner
      A Novel Signal Processing Strategy for Current Steering in Cochlear Implants
      2007 Conference on Implantable Auditory Prostheses, Granlibakken, USA, July 2007
    • T Harczos, W Nogueira, G Szepannek, F Klefenz
      Comparative evaluation of successive cochlear modeling stages as possible frontends for automatic speech recognition
      19th Int. Congress on Acoustics ICA, Madrid, Spain, September 2007
    • Sascha Disch, Jürgen Herre, Julius Kammerl
      Audio watermarking using subband modulation spectra
      ICASSP International Conference on Acoustics, Speech and Signal Processing, Vol. 1, pp. 245-248, Honolulu, Hawaii, April 2007
    • Waldo Nogueira, Andreas Giese, Bernd Edler, Andreas Büchner
      Wavelet Packet Filterbank for Speech Processing Strategies in Cochlear Implants
      Int. Conf. on Acoustics, Speech, and Signal Processing, IEEE, Toulouse, France, May 2006
    • Waldo Nogueira, Bernd Edler, Caroline Frohne-Büchner, Martina Brendel, Andreas Büchner
      Sinusoidal Analysis of Audio for Current Steering Strategies in Cochlear Implants
      CI-2006 - 9th International Conference on Cochlear Implants, Vienna, Austria, June 2006
    • Amparo Albalate, Waldo Nogueira, Bernd Edler, Andreas Büchner
      Signal Analysis by using Adaptive Filterbanks in Cochlear Implants
      IEEE Biomedical Circuits and Systems Conference (BIOCAS 2006), London, G.B, 2006
    • Oliver Niemeyer, Bernd Edler
      Detection and Extraction of Transients for Audio Coding
      120th AES Convention, Audio Engineering Society, p. Preprint 6811, Paris, May 2006
    • Waldo Nogueira, Andreas Büchner, Bernd Edler
      Fundamental Frequency Coding in NofM Strategies for Cochlear Implants
      118th AES Convention, Audio Engineering Society, p. Preprint 6515, Barcelona, May 2005
    • Nikolaus Meine, Bernd Edler
      Improved Quantization and Lossless Coding for Subband Audio Coding
      118th AES Convention, Audio Engineering Society, p. Preprint 6468, Barcelona, May 2005
    • Oliver Niemeyer, Bernd Edler
      Efficient Coding of Excitation Patterns Combined with a Transform Audio Coder
      118th AES Convention, Audio Engineering Society, p. Preprint 6466, Barcelona, May 2005
    • Heiko Purnhagen, Nikolaus Meine, Bernd Edler
      Sinusoidal Coding Using Loudness-Based Component Selection
      Int. Conf. on Acoustics, Speech, and Signal Processing, Orlando, May 2002
    • Heiko Purnhagen, Bernd Edler, Nikolaus Meine
      Error Protection and Concealment for HILN MPEG-4 Parametric Audio Coding
      110th AES Convention, Audio Engineering Society, p. Preprint 5300, Amsterdam, May 2001
    • Heiko Purnhagen, Nikolaus Meine, Bernd Edler
      Speeding up HILN - MPEG-4 Parametric Audio Encoding with Reduced Complexity
      109th AES Convention, Audio Engineering Society, p. Preprint 5177, Los Angeles, September 2000
    • Bernd Edler, Heiko Purnhagen
      Parametric Audio Coding
      5th International Conference on Signal Processing (ICSP 2000), Beijing, August 2000
    • Frank Baumgarte, Bernd Edler
      Ein Psychophysiologisches Gehörmodell zur Nachbildung von Wahrnehmungsschwellen für die Audiocodierung
      DFG-Abschlußkolloquium, München, October 2000
    • Bernd Edler, Gerald Schuller
      Audio Coding Using a Psychoacoustic Pre- and Post-Filter
      Int. Conf. on Acoustics, Speech, and Signal Processing, Istanbul, June 2000
    • Bernd Edler, Christof Faller, Gerald Schuller
      Perceptual Audio Coding Using a Time-Varying Linear Pre- and Post-Filter
      109th AES Convention, Audio Engineering Society, p. Preprint 5274, Los Angeles, September 2000
    • Gerald Schuller, Bernd Edler, Adele Doser
      A Method for Alias Reduction in Cascaded Filter Banks
      9th IEEE DSP Workshop, IEEE, Hunt, TX, October 2000
    • Heiko Purnhagen, Bernd Edler, Charalampos Ferekidis
      Object-Based Analysis/Synthesis Audio Coder for Very Low Bit Rates
      104th AES Convention, Audio Engineering Society, p. Preprint 4747, Amsterdam, May 1998
    • Bernd Edler, Heiko Purnhagen
      Concepts for Hybrid Audio Coding Schemes Based on Parametric Techniques
      105th AES Convention, Audio Engineering Society, p. Preprint 4808, San Francisco, September 1998
    • Heiko Purnhagen, Bernd Edler
      Objektbasierter Analyse/Synthese Audio Coder für sehr niedrige Datenraten
      ITG-Fachtagung "Codierung für Quelle, Kanal und Übertragung", Aachen, March 1998
    • Bernd Edler
      Very Low Bit Rate Audio Coding Developement
      14th International AES Conference "internet.aes.org", Seattle, June 1997
    • Bernd Edler
      Overview on the Current Development of MPEG-4 Audio Coding
      4th International Workshop on Systems, Signals and Image Processing, Poznan, May 1997
    • Bernd Edler, Heiko Purnhagen, Charalampos Ferekidis
      ASAC - Analysis/Synthesis Audio Codec for Very Low Bit Rates
      100th AES Convention, Audio Engineering Society, p. Preprint 4179, Copenhagen, May 1996
    • Bernd Edler
      Current Status of the MPEG-4 Audio Verification Model Development
      101th AES Convention, Audio Engineering Society, p. Preprint 4376, Los Angeles, November 1996
    • Thomas Sporer, Karlheinz Brandenburg, Bernd Edler
      The Use of Multirate Filter Banks for High Quality Digital Audio
      EUSIPCO '92, Bruxelles, September 1992
    • Karlheinz Brandenburg, Ernst Eberlein, Jürgen Herre, Bernd Edler
      Comparison of Filter Banks for High Quality Audio Coding
      IEEE Int. Symp. on Circuits and Systems, San Diego, May 1992
    • Bernd Edler
      Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen
      Kleinheubacher Tagung, Kleinheubach, October 1989
    • Bernd Edler
      Prädiktive Teilbandcodierung mit Vektorquantisierung für hochqualitative Audiosignale
      8. ITG-Fachtagung Hörrundfunk, Mainz, November 1988
  • Journals
    • Ronghua Xu, Raul Enrique Beltran-Gutierrez, Max Käding, Alexander Lange, Steffen Marx, Jörn Ostermann
      Frequency dependent amplitude response of different couplant materials for mounting piezoelectric sensors
      NDT & E International, Elsevier, Vol. 141, January 2024
    • Ulrike Lüdtke, Juan Bornman, Febe de Wet, Ulrich Heid, Jörn Ostermann, Lars Rumberg, Jeannie Van der Linde, Hanna Ehlert
      Multidisciplinary Perspectives on Automatic Analysis of Children’s Language Samples: Where Do We Go from Here?
      Folia Phoniatrica et Logopaedica, Karger Publishers, Vol. 75, No. 1, pp. 1--12, 2023
    • Reemt Hinrichs, Tom Gajecki, Jörn Ostermann, Waldo Nogueira
      A subjective and objective evaluation of a codec for the electrical stimulation patterns of cochlear implants
      Journal of the Acoustic Society of America, March 2021
    • Thomas Krause, Jörn Ostermann
      Damage Detection for Wind Turbine Rotor Blades Using Airborne Sound
      Structural Control and Health Monitoring, February 2020
    • Nicolle van Schijndel, Julien Bensa, Mads Christensen, Catherine Colomes, Bernd Edler, Richard Heusdens, Jesper Jensen, Jensen Søren, Bastiaan Kleijn, Valery Kot, Balazs Kovesi, Jonas Lindblom, Dominique Massaloux, Omar Niamut, Frederik Norden, Jan Plasberg, Renat Vafin, Steven van de Par, David Virette, Oliver Wübbolt
      Adaptive RD Optimized Hybrid Sound Coding
      J. Audio Eng. Soc., Audio Engineering Society, Vol. 56, No. 10, pp. 787-809, October 2008
    • Waldo Nogueira, Andreas Büchner, Thomas Lenarz, Bernd Edler
      A Psychoacoustic "NofM"-type Speech Coding Strategy for Cochlear Implants
      Journal on Applied Signal Processing, Special Issue on DSP in Hearing Aids and Cochlear Implants, Eurasip, Vol. 127, No. 18, pp. 3044-3059, November 2005
    • Bernd Edler
      Audiocodierung in MPEG-4
      Telekommunikation Aktuell, Verlag für Wissenschaft und Leben, Erlangen, July 2004
    • Gerald Schuller, Bin Yu, Dawei Huang, Bernd Edler
      Perceptual Audio Coding Using Adaptive Pre- and Post-filters and Lossless Compression
      IEEE Transactions on Speech and Audio Processing, IEEE, Vol. 10, No. 6, pp. 379-390, September 2002
    • Laura Contin, Bernd Edler, David Meares, Pete Schreiner
      Tests on MPEG-4 Audio Codec Proposals
      Signal Processing: Image Communication, Vol. 9, No. 4, pp. 327-342, May 1997
    • Miodrag Temerinac, Bernd Edler
      Overlapping Block Transform: Window Design, Fast Algorithm and an Image Coding Experiment
      IEEE Trans. on Communications, IEEE, Vol. 43, No. 9, September 1995
    • Miodrag Temerinac, Bernd Edler
      LINC: A Common Theory of Transform and Subband Coding
      IEEE Trans. on Communications, IEEE, Vol. 41, No. 2, pp. 266-274, February 1993
    • Miodrag Temerinac, Bernd Edler
      A Unified Approach to Lapped Orthogonal Transforms
      IEEE Trans. on Image Processing, IEEE, Vol. 1, No. 1, pp. 111-116, January 1992
    • Bernd Edler
      Aliasing Reduction in Subbands of Cascaded Filter Banks with Decimation
      Electronics Letters, IEE, Vol. 28, No. 12, pp. 1104-1105, June 1992
    • Bernd Edler
      Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen
      Frequenz, Schiele und Schön, Vol. 43, No. 9, pp. 252-256, September 1989
  • Technical Report
    • Jörn Ostermann, Reemt Hinrichs
      Links und rechts verbinden
      Unimagazin, Leibniz Universität Hannover, No. 1, June 2020
    • Jörn Ostermann Reemt Hinrichs
      Signal Coding for Binaural Signal Processing in Cochlear Implants
      Binaire, October 2019
    • Thomas Krause, Jörn Ostermann
      Schäden an Rotorblättern akustisch erkennen
      ti! Technologie-Informationen 3 2016 Unter Strom, September 2016
    • Thomas Krause, Jörn Ostermann
      Dem Geräusch auf der Spur - Wie Roboter hören, wo etwas passiert
      Unimagazin - Forschungsmagazin der Leibniz Universität Hannover, pp. 10-13, December 2016
    • Waldo Nogueira, Bernd Edler
      Audiosignalverarbeitung für Cochlea-Implantate
      Technologie-Informationen, Technologietransfer aus Hochschulen Innovation Niedersachsen, Vol. 4, p. 6, 2006