Expected numbers of cases for each block group were
computed by applying reference incidence rates for the
same years (as estimated from the four administrative
departments of the study area, plus the departments of
Doubs and Hérault) to the person-years of each area strat-ified by gender and 5-year age classes.
Models were fitted to the grouped data with Poisson
regression analysis. Logarithms of observed and expected
cases were linked with a set covariate values in a linear
model that dealt with Poisson overdispersion. To build
flexible models, penalized regression splines (based on 25
knots) were used [20]. Since the dioxin concentration dis-tribution was right-skewed, a square root transformation
was applied to force the exposure data to follow a normal
distribution. Log-transformed industry × years, log-trans-formed population density, socio-economic level, and
NO2 concentration were entered as continuous variables
in the models. Urbanisation (yes/no, by collapsing the 3
urban categories), and administrative departments
(department of Isère as reference category) were treated as
categorical variables.
Multivariate models were run. Dioxin exposure (variable
under scrutiny) and administrative departments (to
account for their heterogeneity) were forced into the mod-els. Then, the 5 remaining independent variables were
introduced as spline functions. To assess the smoothing
added value, we followed the ad hoc approach proposed
by Wood et al. [20]. If the estimated degrees of freedom
were close to 1 (and a linear function was therefore esti-mated), and if the generalized cross validation (GCV)
score was higher than the GCV for the unsmoothed
model, then the original unsmoothed variable was pre-ferred to the spline-smoothed version. Finally, using
Akaike's information criterion for covariate selection, a
backward stepwise selection was applied.