The panellists participated in three 90-min training sessions. In each training session, three of the six fortified samples were alternatively presented (VS, RM, and VL in the first session; RS, VM andRLinthesecondsession; RS,VM,andRLinthethirdsession)to expose the panellists tothe entire range of the products. During the f irst training session, the panellists spontaneously elicited attributes and indicated the definitions that would be used in the study. In subsequent sessions, the panellists were provided by the panel leader with some standards and refined the list of words. The panellists selected the descriptors that were most suitable for describing the sensory differences among the samples and for which it was possible to use actual physical objects as reference standards or at least a precise written definition. The final list of the 17 attributes, definitions and standards used for the final assessment of the samples are reported in Table 1. From the second training session, a score sheet was given to quantify the perceived intensity of the descriptors on a discrete nine-point scale (1 ¼ extremely weak, 9 ¼ extremely intense). After the training phase, the panellists participated in four evaluation sessions (approximately 45 min each) and evaluated the intensity of the above mentioned 17 attributes. All of the evaluations were conducted under red light to mask the appearance of the samples. The six enriched tomato purees were analysed in four replicates and presented in a balanced and randomized order within each replicate. The panellists were required to rinse their mouths prior to each evaluation. A 30 s restwasenforced betweensamples, and the panellists were required to eat plain crackers and rinse their mouths with still water during this interval. A 10 min break was enforced between the third and fourth samples to promote the complete psychological rest and concentration of the panellists until the end of the test.