Audio Mixer Strategist
All-in-one agent covering multi, track, audio, mixing. Includes structured workflows, validation checks, and reusable patterns for ffmpeg clip team.
Audio Mixer Strategist
Your agent for audio processing workflows β covering mixing, mastering, format conversion, and audio manipulation using FFmpeg and command-line audio tools.
When to Use This Agent
Choose Audio Mixer Strategist when:
- Mixing multiple audio tracks or stems into a final output
- Applying audio effects (normalization, EQ, compression, reverb)
- Converting between audio formats (WAV, MP3, FLAC, AAC, OGG)
- Processing podcast audio (noise reduction, leveling, splitting)
- Automating batch audio processing workflows
Consider alternatives when:
- You need video editing β use a video processing agent
- You need music generation β use a music AI agent
- You need real-time audio streaming β use a streaming architecture agent
Quick Start
# .claude/agents/audio-mixer.yml name: Audio Mixer Strategist model: claude-sonnet tools: - Read - Write - Edit - Bash - Glob - Grep description: Audio processing agent for mixing, effects, format conversion, and batch audio workflows using FFmpeg
Example invocation:
claude "Create an FFmpeg command to mix a voice track and background music β voice should be -3dB louder, music ducked during speech, with final output normalized to -14 LUFS for podcast distribution"
Core Concepts
Audio Processing Pipeline
| Stage | Operation | FFmpeg Filter |
|---|---|---|
| Input | Load audio files | -i input.wav |
| Normalize | Level matching | loudnorm=I=-14:TP=-1 |
| EQ | Frequency adjustment | equalizer=f=100:t=h:w=200:g=3 |
| Compress | Dynamic range control | acompressor=threshold=-20dB:ratio=4 |
| Mix | Combine multiple tracks | amix=inputs=2:duration=longest |
| Fade | Fade in/out | afade=t=in:st=0:d=2 |
| Output | Encode and save | -c:a libmp3lame -q:a 2 |
Common FFmpeg Audio Commands
# Convert WAV to MP3 at 192kbps ffmpeg -i input.wav -c:a libmp3lame -b:a 192k output.mp3 # Normalize audio to -14 LUFS (podcast standard) ffmpeg -i input.wav -af loudnorm=I=-14:TP=-1:LRA=11 output.wav # Mix voice + music with ducking ffmpeg -i voice.wav -i music.wav \ -filter_complex "[1]volume=0.3[bg];[0][bg]amix=inputs=2:duration=first" \ output.wav # Extract audio from video ffmpeg -i video.mp4 -vn -c:a copy audio.aac
Configuration
| Parameter | Description | Default |
|---|---|---|
output_format | Target audio format (wav, mp3, flac, aac) | mp3 |
sample_rate | Audio sample rate | 44100 |
bit_depth | Bit depth for WAV output | 24 |
loudness_target | Target LUFS level | -14 |
channels | Output channel count (mono, stereo) | stereo |
Best Practices
-
Process in lossless format, encode to lossy only at final output. Work with WAV or FLAC during editing and mixing. Converting to MP3 between processing steps compounds quality loss. Encode to the distribution format only as the final step.
-
Use loudnorm filter for broadcast/podcast normalization. The
loudnormfilter implements EBU R128 loudness standard, which is required for podcast platforms (Spotify: -14 LUFS, Apple Podcasts: -16 LUFS, YouTube: -14 LUFS). -
Apply effects in the correct order. Standard order: noise reduction β EQ β compression β normalization β encoding. Reversing the order (compressing before EQ) produces different, often inferior results.
-
Always use the two-pass loudnorm for accurate normalization. Single-pass loudnorm estimates loudness but can overshoot. Two-pass (
-filter:a loudnorm=I=-14:dual_mono=true:print_format=jsonfirst pass, then apply measured values) produces precise results. -
Preview with short clips before processing large files. Test FFmpeg filter chains on a 30-second sample before processing a 2-hour file. This catches filter errors and lets you tune parameters quickly.
Common Issues
Output audio has clipping or distortion after mixing. When combining multiple tracks, their levels sum and can exceed 0 dBFS. Reduce individual track volumes before mixing, or use dynaudnorm or acompressor after mixing to control peaks.
FFmpeg filter syntax errors are hard to debug. Complex filter graphs with multiple inputs and outputs are sensitive to syntax. Build filter chains incrementally β add one filter at a time and verify output. Use -filter_complex_script for complex graphs to keep commands readable.
Audio quality is poor after format conversion. Default codec settings may use low quality. Explicitly set quality: -c:a libmp3lame -q:a 0 (highest VBR quality) or -c:a libmp3lame -b:a 320k (highest CBR). For AAC, use -c:a aac -b:a 256k or the higher-quality libfdk_aac codec.
Reviews
No reviews yet. Be the first to review this template!
Similar Templates
API Endpoint Builder
Agent that scaffolds complete REST API endpoints with controller, service, route, types, and tests. Supports Express, Fastify, and NestJS.
Documentation Auto-Generator
Agent that reads your codebase and generates comprehensive documentation including API docs, architecture guides, and setup instructions.
Ai Ethics Advisor Partner
All-in-one agent covering ethics, responsible, development, specialist. Includes structured workflows, validation checks, and reusable patterns for ai specialists.