4.3. Logistic regression models for non-captive stations
It can be seen from Table 7 that the correlation between distance
and travel time, travel mode and time, purpose and InboundOut and
InBoundOut and travelFeeD are 0.5, 0.45, 0.5 and
0.35 respectively. Therefore, travel time and InboundOut were
removed for model selection.
Three variables were identified to be significant from the best
fitting logistic regression model for non-captive stations (see
Table 8). There are 559 records for the non-captive stations
(Table 2), but the sample size for this regression model is 486 with
73 missing records being removed for the purpose of the analysis.
The most influential variable is travel cost (from a chosen station to
a destination). The less the cost of travelling from a chosen station
to a destination, the more likely a chosen station will be a nonnearest
station, which is consistent with the results from the
overall model. However, different from the model for all chosen
station, cost (origin to station) was found to be significant. The less
the cost from origin to the chosen station, the less likely it is that
chosen stations will be the nearest station.