Connexions

You are here: Home » Content » Back propagation mathematics
Content Actions
Lenses

What is a lens?

Lenses

A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual Connexions member, a community, or a respected organization.

This content is ...
Affiliated with (?)
This content is either by members of the organizations listed or about topics related to the organizations listed. Click each link to see a list of all content affiliated with the organization.
  • This module is included inLens: Rice University ELEC 301 Project Lens
    By: Rice University ELEC 301As a part of collection:"ECE 301 Projects Fall 2003"

    Click the "Rice University ELEC 301 Projects" link to see all content affiliated with them.

    Rice University ELEC 301 Projects
  • This module is included inLens: Rice University OpenCourseWare
    By: OpenCourseWare ConsortiumAs a part of collections:"Music Classification by Genre", "ECE 301 Projects Fall 2003"

    Click the "Rice University OCW" link to see all content affiliated with them.

    Rice University OCW
Tags

(?)

These tags come from the endorsement, affiliation, and other lenses that include this content.

Back propagation mathematics

Module by: Krzysztof Cyran

Summary: This module explains mathematical background of the most popular learning algorithm, called back propagation. It is widely used in multilayer perceptrons training but it's usage is not limited to this particular type of neural network.

Error definition

The back propagation method is the example of the wide class of training methods based on the information covered in the gradient of error function. The independent variables in this minimization are weights of neural network and the considered error to be minimized is the root mean square one.
Let us consider the training set composed of LL ordered pairs, of the following form: { ( x ( 1 ) , d ( 1 ) ),( x ( 2 ) , d ( 2 ) ),...,( x ( L ) , d ( L ) ) } { ( x ( 1 ) , d ( 1 ) ),( x ( 2 ) , d ( 2 ) ),...,( x ( L ) , d ( L ) ) } Furthermore, let us define the total error EE generated on outputs of neural network after presenting the entire training set, as: E= l=1 L E ( l ) E= l=1 L E ( l ) where: E ( l ) = m=1 M E m ( l ) = 1 2 m=1 M ( d m ( l ) y m ( l ) ) 2 E ( l ) = m=1 M E m ( l ) = 1 2 m=1 M ( d m ( l ) y m ( l ) ) 2 As was already told, the independent variables in the minimization of error EE are weights w ij w ij Subscript w ij Since even for the relatively small networks the number of weigths is big, in real applications, the training of the neural network is the minimization of the scalar field over the vector space with hundreds or (more often) thousands dimensions. One of the minizmiazation techniques for such problem is the steapest descent method
01x2dx x 0 1 x 2 n=12×21/226390n+11034n!9801×3964nn!4-1 n 1 2 2 12 26390 n 1103 4 n 9801 396 4 n n 4 -1 n = 1 2 2 ( 26390 n + 1103 ) ( 4 n ) ! 9801 396 4 n n ! 4 n = 1 2 2 ( 26390 n + 1103 ) ( 4 n ) ! 9801 396 4 n n ! 4 n = 1 2 2 ( 26390 n + 1103 ) ( 4 n ) ! 9801 396 4 n n ! 4 n = 1 2 2 ( 26390 n + 1103 ) ( 4 n ) ! 9801 396 4 n n ! 4 n 1 2 2 12 26390 n 1103 4 n 9801 396 4 n n 4 -1 n=12×21/226390n+11034n!9801×3964nn!4-1 n 1 2 2 12 26390 n 1103 4 n 9801 396 4 n n 4 -1

Comments, questions, feedback, criticisms?

Send feedback