Skip to content

Instantly share code, notes, and snippets.

@percyvega
Last active October 6, 2021 01:51
Show Gist options
  • Save percyvega/3fb74554792ddacc0c4a6d3121aa5f61 to your computer and use it in GitHub Desktop.
Save percyvega/3fb74554792ddacc0c4a6d3121aa5f61 to your computer and use it in GitHub Desktop.

When one records, it should easily reach -12db, and only as loud as -6db, otherwise it will over-modulate.

For podacasts Convert it to mono and then encode at 64kbps joint stereo MP3 to get the best size / quality / compatibility ratio. Joint stereo would be essentially mono in this case but some old players don't support mono very well.

Cables:

XLR cables (3 prongs)
	Needs an interface to connect to computer.

USB cables
	Does not need an interface to connect to computer (connects directly).

Microphones:

Types:
	-Condenser:
			-Needs an interface that has phantom power (+48v).
			-Best for voice (especially large-diafragm condensers).
			-Condenser microphones are able to react to subtle sounds.
			-Not suited for high-volume recording.
	-Dynamic:
			-Does not need an interface that has phantom power.
			-Best for live performance, drums, guitar, amplifiers.
			-Mainly used in situations where loud audio is to be picked up.

Pattern modes:
	-Stereo: Uses both the left and right channels to capture a wide, realistic sound image—perfect for recording acoustic guitar or choir.
	-Cardioid (unidirectional): Well-suited to podcasts, vocal performances, voice-overs and instruments. Cardioid mode records sound sources that are directly in front of the microphone, delivering rich, full-bodied sound.
	-Omnidirectional: Picks up sound equally from all around the mic. It’s best used in situations when you want to capture the ambience of "being there"—like a live recording of a band's performance, a multi-person podcast or a conference call.
	-Bidirectional: Records from both the front and the rear of the microphone—ideal for recording a duet or a two-person interview.

Audacity Noise Reduction Noise Reduction (dB): 12-24 Sensitivity: 6.00 Frequency smoothing (bands): 0 Noise: Reduce Normalize Remove DC Offset Normalize maximum amplitude to -1.0 dB Compressor Threshold: -12 dB Noise Floor: -40 dB Ratio: 2:1 Attach Time: 0.20 secs Release Time: 1.0 secs Make-up gain

Wave An uncompressed or lossless format, whereas MP3 is compressed or lossy. Technically .wav is just a container format and can hold various types of compressed or uncompressed audio, but typically you'll see it containing LPCM uncompressed audio (the same as on audio CDs). With .wav files, you are essentially getting a raw bitstream representation of the audio signal in digital form. An analog sound produced in the real world contains essentially an infinite amount of information because it is a wave that is constantly changing (see below). In order to bring these sounds into the digital domain you need to sample the signal at various intervals to make an approximation of the sound. For .wav the audio signal is usually sampled at 44,100 times per second or more, and each sampled value is recorded so the sound wave can be reproduced. The analog sound wave is sampled at every T interval to make a digital approximation of the original signal. For digital audio T is usually 1/44,100th of a second or less.

MP3 Compressed in order to squeeze the same audio information into a smaller file size. The .wav format is great for very faithful representations of the analog signal, but as you probably know that's usually at the cost of larger file sizes. Compressed audio (and video in a similar fashion) is designed to reduce the file size while still maintaining a respectable level of fidelity. In layman's terms, compression attempts to remove the unnecessary data from the stream and reduce the signal to its most necessary components. With MP3, the compression and encoding algorithms use a model of how we hear to analyze the audio in the frequency domain and remove and unnecessary information. For instance, due to auditory masking if there are two sounds at close frequencies, often we will only hear the louder one if the difference in volume between the two is significant. So for MP3, the lower volume sound could be discarded and the audio would sound essentially he same to our ears.

In practice, both .wav and MP3 have their uses. For production, .wav is the standard because it will almost always be a 100% accurate, bit-for-bit reproduction of the source material. MP3s can be a decent alternative at high enough bitrates. The bitrate is the measure of how many bits per second the MP3 encoding will use, meaning the higher the bitrate, the closer the MP3 will be to the original, uncompressed stream. Bitrate is usually measured in kilobits per second (kbps). I like high audio quality for my digital music collection, so when I have the option I usually encode MP3s at a constant 256 or 320kbps. That is the higher end of what MP3s are capable of, and unfortunately a lot of digital music out there is not encoded that high. When the bitrate comes down, you can usually hear it first in the high frequencies, for example the cymbals from a drum kit will sound off. 160kbps is tolerable, but anything lower than that and you will really start to notice. But again, with a high enough bitrate the differences between MP3 and .wav are barely distinguishable, especially to an untrained listener (most listeners).

For .wav files we mostly look at bit depth and sample rate or frequency. Bit depth is the number of bits used to encode each sampled value. The sample rate tells how many times per second the audio is sampled. CD (.wav) and MP3 are encoded at 44,100 Hz sample rate (Hertz meaning "cycles per second"). Newer computers and audio hardware/software are now accommodating higher sample rates, including 48kHz or 96kHz. For .wav the bit depth is usually 16-bits or 24-bits on the newer systems. For most purposes, when using .wav, 16-bit and 44.1kHz is sufficient, but if you have the capabilities it's usually worth upgrading to 24-bit, 48kHz.

Some example file sizes for a stereo five minute recording:

  • .wav, 16-bit, 44.1kHz: 50 MB
  • .wav, 24-bit, 48kHz: 82 MB
  • .wav, 24-bit, 96kHz: 164 MB
  • MP3, 128kbps, 44.1kHz: 4.5 MB
  • MP3, 192kbps, 44.1kHz: 7 MB
  • MP3, 320kbps, 44.1kHz: 11 MB
  • FLAC, 24-bit, 44.1kHz: 28 MB
  • FLAC, 24-bit, 48kHz: 31 MB
  • FLAC, 24-bit, 96kHz: 61 MB

There is also a variable bitrate option for MP3 encoding, which should offer slightly smaller file sizes for the same quality. It uses an encoding scheme that changes (varies) the bitrate for different portions of the song depending on complexity and how many samples would be needed to recreate a given section faithfully. [Edit: FLAC] There is a third category as well: lossless compression. FLAC is a good example of this, and has the quality and fidelity of a .wav file but with smaller file sizes (still considerably larger than MP3 though). One downside of FLAC, especially in a production setting, is that not all software can play it. I use it for certain albums in my personal digital music collection where I want a more faithful reproduction than MP3. But for production I usually stick with .wav and MP3.

I keep my samples and loops as .wav if I can help it, and usually encode my finished tracks to MP3 for distribution. If you have the option, go with .wav. But if there are space constraints or you only have an MP3 version of something, as long as it's a high enough bitrate (at least 256kpbs is ideal), most people can't tell the difference.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment