Overfitting

In statistics, overfitting is fitting a statistical model that has too many parameters. An absurd and false model may fit perfectly if the model has enough complexity by comparison to the amount of data available. Overfitting is generally recognized to be a violation of Occam's razor. The concept of overfitting is important also in machine learning. Usually a learning algorithm is trained using some set of training examples, i.e. exemplary situations for which the desired output is known. The learner is assumed to reach a state where it will also be able to predict the correct output for other examples, thus generalizing to situations not presented during training (based on its inductive bias). However, especially in cases where learning was performed too long or where training examples are rare, the learner may adjust to very specific random features of the training data, that have no causal relation to the target function. In this process of overfitting, the performance on the training examples still increases while the performance on unseen data becomes worse. In both statistics and machine learning, in order to avoid overfitting, it is necessary to use additional techniques (e.g. cross-validation, early stopping), that can indicate when further training is not resulting in better generalization. In treatment learning, overfitting is avoided by using a minimum best support value.

 

<< PreviousWord BrowserNext >>
yrj visl
ley line
upper rhine
autobahn (album)
minimoog
polymethyl methacrylate
extropianism
proton (company)
dj grandmaster flash
darren aronofsky
traquair house
panopticon
stoke upon trent
national trails
liquefy
green league
edward morley
future of the car
strychnine
commerzbank tower
queenstown, new zealand
primordial black hole
chorology
miguel de unamuno
preikestolen
flcl
stoke on trent
el hazard
alfa romeo gtv6
barbarossa
franois darlan
lpga
your mileage may vary
laboratory
paul graham
automation
sharpless epoxidation
units unit conversion program
fortnight
platoon (automobile)
mechanization
apc
whistler (radio)
filter design