The second point to note about the update equation is the subtraction of lambda*P from the update direction. This effectively regularizes the solution from growing too large; a procedure that is often applied in machine-learning algorithms to promote generalization to unseen data. The scaling factor lambda is user defined and is usually problem dependent. However, a small value typically works well for learning patch models for facial feature detection.