Bohumír Zámečník bzamecnik

## keras_input_reshape.py
# In Keras the Convolution layer requirest an additional dimension which will be used for the various filter.
# When we have eg. 2D dataset the shape is (data_points, rows, cols).
# But Convolution2D requires shape (data_points, rows, cols, 1).
# Otherwise it fails with eg. "Exception: Input 0 is incompatible with layer convolution2d_5: expected ndim=4, found ndim=3"
#
# Originally I reshaped the data beforehand but it only complicates things.
#
# An easier and more elegant solution is to add a Reshape layer at the input
# of the network!
#

## animate_chroma_polar.py
"""
What pitch classes are playing?

video: https://www.youtube.com/watch?v=DOJyjMQHP8U

We computed a chromagram, ie. a sequence of pitch class vectors in
time using the Python tfr library (https://github.com/bzamecnik/tfr)
and animated it with matplotlib and moviepy. The tfr library computes
very sharp spectrograms and allows to transform frequencies to pitches.
Pitches are folded to classes by ignoring the octave producing

## animate_octave_chroma_matrix.py
import matplotlib as mpl
mpl.use('Agg')
import matplotlib.pyplot as plt
import moviepy.editor as mpy
from moviepy.video.io.bindings import mplfig_to_npimage
import numpy as np
from scipy.signal import medfilt
import tfr

# --- parameters ---

## animate_pitchgram_window.py
import matplotlib as mpl
mpl.use('Agg')
import matplotlib.pyplot as plt
import moviepy.editor as mpy
from moviepy.video.io.bindings import mplfig_to_npimage
import numpy as np
from scipy.signal import medfilt
import tfr

# --- parameters ---

## animate_chromagram_window.py
# video and description: https://youtu.be/GX33y67CN-w

import matplotlib as mpl
mpl.use('Agg')
import matplotlib.pyplot as plt
import moviepy.editor as mpy
from moviepy.video.io.bindings import mplfig_to_npimage
import numpy as np
from scipy.signal import medfilt
import tfr

## animate_chromagram_with_both_linear_and_fifth_steps.py
import matplotlib as mpl
mpl.use('Agg')
import matplotlib.pyplot as plt
import moviepy.editor as mpy
from moviepy.video.io.bindings import mplfig_to_npimage
import numpy as np
import os
from scipy.signal import medfilt
import tfr

## one_hot_lambda_layer_keras.py
"""
When traing ML models on text we usually need to represent words/character in one-hot encoding.
This can be done in preprocessing, however it may make the dataset file bigger. Also when we'd
like to use an Embedding layer, it accepts the original integer indexes instead of one-hot codes.

Can be move the one-hot encoding from pre-preprocessing directly into the model?

If so we could choose from two options: use one-hot inputs or perform embedding.

A way how to do this was suggested in Keras issue [#3680](https://github.com/fchollet/keras/issues/3680).

## demultiplex_inputs_in_keras.py
"""
In this example we show how to select and separately process multiple input
features within Keras layers.

Let's say we have a model with two categorical features and we can to embed
or one-hot encode each one separately. Normally in the Functional API we
would make two Input layers, one for each feature, then connect Embedding
to each, merge them and then add some more Dense/LSTM/... layers. In this
case we need to provide the model.predict() with a list of input arrays
instead of just one. It becomes a bit cumbersome if you need to index and

## model_summary.txt
____________________________________________________________________________________________________
Layer (type)                     Output Shape          Param #     Connected to
====================================================================================================
input_1 (InputLayer)             (None, 32, 10)        0
____________________________________________________________________________________________________
lstm_1 (LSTM)                    (None, 32, 10)        840         input_1[0][0]
____________________________________________________________________________________________________
add_1 (Add)                      (None, 32, 10)        0           input_1[0][0]
                                                                   lstm_1[0][0]
____________________________________________________________________________________________________

## lstm_with_softmax_keras.py
"""
When classifying upon a sequence usually we stack some LSTM returning sequences,
then one LSTM returning a point, then Dense with softmax activation.

Is it possible instead to give the last non-sequential LSTM a softmax activation?

The answer is yes.

In this example we have 3 sequential layers and one layer producing the final result.
	# In Keras the Convolution layer requirest an additional dimension which will be used for the various filter.
	# When we have eg. 2D dataset the shape is (data_points, rows, cols).
	# But Convolution2D requires shape (data_points, rows, cols, 1).
	# Otherwise it fails with eg. "Exception: Input 0 is incompatible with layer convolution2d_5: expected ndim=4, found ndim=3"
	#
	# Originally I reshaped the data beforehand but it only complicates things.
	#
	# An easier and more elegant solution is to add a Reshape layer at the input
	# of the network!
	#
	"""
	What pitch classes are playing?

	video: https://www.youtube.com/watch?v=DOJyjMQHP8U

	We computed a chromagram, ie. a sequence of pitch class vectors in
	time using the Python tfr library (https://github.com/bzamecnik/tfr)
	and animated it with matplotlib and moviepy. The tfr library computes
	very sharp spectrograms and allows to transform frequencies to pitches.
	Pitches are folded to classes by ignoring the octave producing
	import matplotlib as mpl
	mpl.use('Agg')
	import matplotlib.pyplot as plt
	import moviepy.editor as mpy
	from moviepy.video.io.bindings import mplfig_to_npimage
	import numpy as np
	from scipy.signal import medfilt
	import tfr

	# --- parameters ---
	# video and description: https://youtu.be/GX33y67CN-w

	import matplotlib as mpl
	mpl.use('Agg')
	import matplotlib.pyplot as plt
	import moviepy.editor as mpy
	from moviepy.video.io.bindings import mplfig_to_npimage
	import numpy as np
	from scipy.signal import medfilt
	import tfr
	"""
	When traing ML models on text we usually need to represent words/character in one-hot encoding.
	This can be done in preprocessing, however it may make the dataset file bigger. Also when we'd
	like to use an Embedding layer, it accepts the original integer indexes instead of one-hot codes.

	Can be move the one-hot encoding from pre-preprocessing directly into the model?

	If so we could choose from two options: use one-hot inputs or perform embedding.

	A way how to do this was suggested in Keras issue [#3680](https://github.com/fchollet/keras/issues/3680).
	"""
	In this example we show how to select and separately process multiple input
	features within Keras layers.

	Let's say we have a model with two categorical features and we can to embed
	or one-hot encode each one separately. Normally in the Functional API we
	would make two Input layers, one for each feature, then connect Embedding
	to each, merge them and then add some more Dense/LSTM/... layers. In this
	case we need to provide the model.predict() with a list of input arrays
	instead of just one. It becomes a bit cumbersome if you need to index and
	____________________________________________________________________________________________________
	Layer (type) Output Shape Param # Connected to
	====================================================================================================
	input_1 (InputLayer) (None, 32, 10) 0
	____________________________________________________________________________________________________
	lstm_1 (LSTM) (None, 32, 10) 840 input_1[0][0]
	____________________________________________________________________________________________________
	add_1 (Add) (None, 32, 10) 0 input_1[0][0]
	lstm_1[0][0]
	____________________________________________________________________________________________________
	"""
	When classifying upon a sequence usually we stack some LSTM returning sequences,
	then one LSTM returning a point, then Dense with softmax activation.

	Is it possible instead to give the last non-sequential LSTM a softmax activation?

	The answer is yes.

	In this example we have 3 sequential layers and one layer producing the final result.