Understanding How the AI Works
AudioNova uses a large language model (LLM) β the same kind of AI that powers tools like ChatGPT β to interpret your mixing prompts.
Unlike traditional audio plugins, it doesnβt use fixed knobs or settings. Instead, it βunderstandsβ what youβre saying and decides what to do based on context.
π§ How the Language Model Interprets Promptsβ
When you send a prompt like:
Add reverb to the vocals, but keep them intimate and close
Hereβs whatβs happening under the hood:
- The model reads your text
- It parses your intent β not just keywords, but tone and phrasing
- It converts that into a high-level audio command
- That command is translated to backend DSP or routing logic (via code)
π¬ What the AI is Good Atβ
-
Understanding adjectives
βwarm
,punchy
,dreamy
,gritty
,clean
-
Understanding musical context
βclub-ready
,bedroom pop
,film score energy
-
Sequencing logic
βcompress β EQ β delay
-
Handling nuanced comparisons
βMake the mix feel like Bon Iver but with EDM drums
𧩠What the AI Might Struggle Withβ
-
Contradictions
βMake the vocals dry and reverberant
β too ambiguous, clarify your intent -
Overloaded prompts
βMake the snare louder, dry, wide, slower, then echo and more vintage but still tight
β break it into steps -
Uncommon slang
β Use industry-standard terms when possible -
Literal vs abstract
βMake it blue and sleepy
might work... or not. Add audio-specific phrasing.
π§ͺ Model Behavior: Things to Knowβ
- Prompts are stateless within each message β but sessions retain some context
- Prompts are interpreted by patterns, not by rules β itβs learning from language, not knobs
- Parentheses
()
may emphasize or group weight - Brackets
[]
can add context or references
π§ Prompting Principles (Recap)β
Principle | Description |
---|---|
Clarity over cleverness | Be specific about the result you want |
Break up instructions | Step-by-step is better than all at once |
Use references | Artists, genres, gear, or FX plugins |
Think like a producer | Describe intention, not settings |
π¬ Example: Prompt Deconstructedβ
Make the vocals smoother, a bit warmer, with subtle reverb and compression
The AI will likely interpret this as:
- Smoothing = compression + de-essing
- Warmth = EQ boost at 200β600Hz
- Subtle reverb = low mix level, short tail
- Compression = medium ratio, soft attack
AudioNova is powered by language β and the more clearly you speak to it, the more it becomes your mix engineer. ππ§