Evaluating the MuseGAN jazz sample model
In this section, you review the loss function and generated sample metrics that are
provided in the AWS DeepComposer console for the sample MuseGAN
jazz genre model
Note
In the AWS DeepComposer console, the generated samples metrics are available only for pre-trained models. These metrics include Pitches used, Pitch classes used, Empty bar rate, Polyphonic range, In scale ratio, and Drum in pattern.
To evaluate the pre-trained MuseGAN jazz genre model
-
Open the AWS DeepComposer console
. -
In the navigation pane, choose Models.
-
On the Models page, under Sample model, choose the Jazz pre-trained model.
-
On the training results page for the model, choose Loss function to review the loss function graph.
The generator loss and discriminator loss are superimposed onto the same graph, but on different scales. The scale of the generator loss is shown on the left side and the scale of the discriminator loss is shown on the right side.
In this training job, the generator loss plateaus around the 50th epoch, when it stops significantly improving its ability to generate realistic music.
-
Choose Sample output to listen to accompaniment tracks that would be generated had inference been performed at that specific epoch.
For every 50th epoch, you can listen to the accompaniment tracks that could have been generated.
The discriminator loss shows similar behavior, but is less noisy after it plateaus.
-
Choose the Generated samples tab.
You can choose one of the available metrics: Pitches used, Pitch classes used, Empty bar rate, Polyphonic range, In scale ratio, Drum in pattern.
To view the definition of a generated sample metric, choose the corresponding tab. For each generated sample, a graph shows output samples compared with ground truth samples. Over time, the generated samples should converge to the target line. The plot in the tab shows the metric plotted for the generated samples at each epoch (scatter) vs. the ground truth samples (line), respectively. The ground truth samples allow image data to be related to real images as opposed to images provided by inference.