Skip to content
English
  • There are no suggestions because the search field is empty.

What is the Audio Enhancement Module?

AI-based speech optimisation

New Module:
As this is a completely new development, we are spending a lot of time testing and debugging. However, there may still be some issues that we have not found during our internal and external testing. We encourage you to report any bugs or errors using our support form. Thank you very much!

Module Description

The Audio Enhancement module improves the sound quality of your audio and video recordings. It removes unwanted background noise, reduces distortions, and enhances speech clarity. In addition, the module can restore missing frequencies, resulting in a more natural and clear audio output via a .wav file that can be exported.

With two specialized model architectures – FINCH and LARK – you can choose the appropriate enhancement for your needs:

Feature FINCH LARK
Primary Function Noise reduction and distraction removal Frequency restoration and clarity enhancement
Key Enhancements Eliminates background noise, reduces unwanted sound artifacts Restores missing frequencies, improves audio depth and richness
Best For Clean audio output in noisy environments Enhancing low-quality recordings for improved clarity
Use Cases Podcasts, interviews, field recordings Music remastering, archival audio restoration
Effect on Audio Creates a crisp, distraction-free sound Adds warmth, detail, and depth

How Does It Work?

  1. Select the Media File: Choose the media file you want to analyze.
  2. Activate the Advanced Speech Recognition Module: In the left column, select the "Audio Enhancement" module.
  3. Define the Model & Parameters: Choose the model for analysis from the available options, set the parameters, and click the yellow "Add Module" button.
  4. Start the Analysis: You can either add more modules or begin the analysis immediately by clicking "Start Analysis"
  5. Review & Download the Results: Listen to the improved version directly or download the file as an optimized audio output.

What Parameters are available?

  • Model Architecture (FINCH or LARK)
    Choose between the two enhancement models:
    • FINCH

      Removes background noise and unwanted distractions from audio recordings. Ideal for interviews, meetings, and field recordings. 

    • LARK

      Repairs missing frequencies, enhances clarity, and adds depth for a more natural sound. Optimal for music, film productions, and high-quality speech recordings.

  • Loudness Target Level (-70 to -5 LUFS)
    Defines the target loudness level, measured in LUFS (Loudness Units Full Scale).
  • Loudness Peak Limit (-9 to 0 dBTP)
    Sets a peak loudness limit to avoid distortions, measured in dB True Peak (dBTP).
  • Enhancement Level (0–1)
    Determines the intensity of the enhancement.
    • 1.0 – Maximum enhancement

    • 0.8 – Strong enhancement

    • 0.6 – Medium enhancement

    • 0.2 – Light enhancement

Note:
Higher enhancement levels may introduce artifacts, especially when applied to heavily distorted original recordings. Testing different values is recommended.

LUFS (Loudness Units Full Scale):
LUFS is a standardized loudness measurement used in broadcasting, streaming, and audio production. It reflects perceived loudness rather than just peak levels, ensuring consistent volume across different media platforms. LUFS is widely used in television, radio, and online streaming services to normalize audio levels.

dB True Peak (dBTP):
dBTP measures the absolute peak level of an audio signal, including inter-sample peaks that may cause clipping or distortion when played back on different systems. It is commonly used in mastering and broadcasting to ensure that audio does not exceed safe levels, preventing digital distortion in various playback environments.


Displaying the Results:

Module Section

On the right side of the player, you’ll see a section with detailed results for each module used in the analysis. Clicking on the module name opens a dropdown with specific parameters, useful for troubleshooting.


Result Cards

Once the analysis is complete, the enhanced audio is available as a playable preview and as a downloadable file.

Export Options:

  • Listen to the enhanced file directly in the player.

  • Download the improved audio file for further processing or archiving.
    The available output is currently a standard .wav file.