FluxTraining.jl

Loss functions

A loss function compares model outputs to true targets, resulting in a loss. For a loss function to be compatible with the standard supervised training loop, the following properties must hold.

Firstly, the loss function should accept the model outputs and targets, and return a single scalar value. Given a data iterator dataiter and a model model :

The loss function must also be differentiable, so that gradients can be calculated during training. See models for more on how to check this.

Creating loss functions

Flux.jl comes with a lot of commonly used loss functions built-in in its submodule Flux.Losses . See Flux . jl loss functions for a complete reference.

You can also write your own loss functions. If you are using non-mutating array operations, there is a good chance that it will be differentiable and also be compatible with GPU arrays from CUDA . jl .

Backlinks

The following pages link back here:

Data iterators, Getting started