Momentum

struct defined in module Flux.Optimise


			Momentum(η = 0.01, ρ = 0.9)

Gradient descent optimiser with learning rate η and momentum ρ.

Parameters

  • Learning rate ( η): Amount by which gradients are discounted before updating the weights.

  • Momentum ( ρ): Controls the acceleration of gradient descent in the prominent direction, in effect damping oscillations.

Examples


			
			
			
			opt
			 
			=
			 
			

			Momentum
			(
			)
			

			

			
			opt
			 
			=
			 
			

			Momentum
			(
			0.01
			,
			 
			0.99
			)
Methods

There are 2 methods for Flux.Optimise.Momentum: